Toward AI That Understands Self and Others: A World-Model Theory of Cognitive Diversity and Alignment

Toru Takahashi

arxiv: 2605.29930 · v2 · pith:HVLR3IAXnew · submitted 2026-05-28 · 💻 cs.AI · cs.CY· cs.HC

Toward AI That Understands Self and Others: A World-Model Theory of Cognitive Diversity and Alignment

Toru Takahashi This is my paper

Pith reviewed 2026-06-29 07:43 UTC · model grok-4.3

classification 💻 cs.AI cs.CYcs.HC

keywords world modelscognitive diversityAI alignmentmulti-phase inferenceprocessabilityalignment mapstransformation loss

0 comments

The pith

Disagreement arises because observations become inferences only after constructing sufficient state representations under constraints.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that disagreement in societies is a late-stage phenomenon stemming from how different intelligences build world models from observations under finite constraints. It proposes that alignment in AI should focus on making these heterogeneous world models processable to each other while keeping their unique error-detection abilities intact. This shifts the view from forcing consensus to enabling communication across different inferential frameworks. The approach reconstructs recognition as the construction of approximate sufficient statistics, formalized through the Multi-Phase Inference Assumption and its mechanism.

Core claim

Recognition is the construction of approximate sufficient statistics under informational, representational, observational, and action constraints, formalized as the Multi-Phase Inference Assumption (MIA) and Mechanism (MIM). Alignment maps and transformation loss allow analysis of how world models communicate, making alignment processability rather than agreement: the design of AI systems that help heterogeneous forms of intelligence remain mutually processable while preserving their distinct error-detection capacities.

What carries the argument

The Multi-Phase Inference Mechanism (MIM), which reconstructs recognition by determining admissible targets through construction of state representations approximately sufficient for prediction, evaluation, or action.

If this is right

Heterogeneous world models can communicate without being collapsed into a single representation.
AI systems can be designed to preserve distinct error-detection capacities across forms of intelligence.
Alignment maps and transformation loss provide tools to quantify and manage communication between models.
Disagreement is analyzed as differing admissible targets rather than conflicts of values or beliefs.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The framework could extend to real-time AI interfaces that map between a user's world model and the system's own without requiring the user to adopt the system's representation.
It suggests testable designs for multi-agent environments where agents operate under deliberately varied observational constraints to measure processability.
Applications might include systems for resolving interpretive conflicts in policy or science by tracking transformation losses between models rather than seeking consensus.

Load-bearing premise

The premise that observation is not yet inference and that a possible target becomes admissible only when a state representation can be constructed that is approximately sufficient for prediction, evaluation, or action with respect to that target.

What would settle it

An experiment in which two agents given identical sufficient state representations for the same observation sequence still exhibit persistent disagreement on inferences, or conversely where differing representations produce no communication barrier.

read the original abstract

Modern societies possess more information than ever before, yet they do not converge toward a single shared understanding. The same events, facts, laws, technologies, or risks can be interpreted as evidence of freedom, danger, exclusion, injustice, responsibility, or unrealized possibility. Existing discussions often treat such disagreement as a conflict of values, preferences, or beliefs. This paper argues that disagreement is already a late-stage phenomenon. The central premise is simple but not trivial: observation is not yet inference. Not every observation becomes inferentially relevant, and not every possible object in an observation sequence becomes an estimation target. A possible target becomes admissible only when a state representation can be constructed that is approximately sufficient for prediction, evaluation, or action with respect to that target. This paper develops a world-model theory of cognitive diversity and alignment by reconstructing recognition as the construction of such approximate sufficient statistics under finite informational, representational, observational, and action constraints. It formulates this position as the Multi-Phase Inference Assumption (MIA) and defines its core internal mechanism as the Multi-Phase Inference Mechanism (MIM). The framework introduces alignment maps and transformation loss to analyze how heterogeneous world models communicate without being collapsed into a single representation. World-model alignment is therefore processability, not agreement: the design of AI systems that help heterogeneous forms of intelligence remain mutually processable while preserving their distinct error-detection capacities.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper reframes alignment as mutual processability across heterogeneous world models rather than consensus, but the argument stays definitional with no derivations or examples to make the distinction work.

read the letter

The paper's headline move is to treat disagreement as a downstream effect of different agents building their own approximate sufficient statistics from observations, then define alignment as keeping those models processable without forcing them to match. That distinction between processability and agreement is the one thing a colleague should take away.

It states the central premise cleanly: observation is not yet inference, and a target only becomes admissible once a representation sufficient for prediction or action can be built under real constraints. This connects the problem of societal disagreement on topics like justice or risk to bounded world-model construction, which is a reasonable way to organize the issue.

The new pieces are the Multi-Phase Inference Assumption, the Multi-Phase Inference Mechanism, alignment maps, and transformation loss. These give a vocabulary for talking about how models communicate while keeping their separate error-detection capacities intact.

The soft spot is exactly where the stress-test note points: the abstract introduces the terms and then states the conclusion without showing how the maps or loss produce processability instead of agreement. There are no equations, no worked example, and no link to existing formal results on sufficient statistics or world models that would let a reader verify the step. The argument therefore rests on the definitions rather than on demonstrated mechanics.

This is for readers who follow theoretical work on multi-agent alignment and cognitive diversity. Someone looking for formal results, falsifiable predictions, or implemented mechanisms will not find them here. It is worth sending to peer review because the premise is worth testing in a fuller version, but the authors would need to add concrete illustrations or derivations before the framework can be evaluated on its own terms.

Referee Report

2 major / 1 minor

Summary. The paper claims that societal disagreement over facts and events is a late-stage phenomenon because observation is not yet inference; a target becomes admissible only once an approximately sufficient state representation can be constructed for prediction, evaluation, or action. It reconstructs recognition under finite constraints via the Multi-Phase Inference Assumption (MIA) and its internal Multi-Phase Inference Mechanism (MIM), then introduces alignment maps and transformation loss to analyze communication between heterogeneous world models. The central conclusion is that world-model alignment consists in mutual processability rather than representational agreement, enabling AI systems to preserve distinct error-detection capacities across diverse intelligences.

Significance. If the framework were formalized with explicit derivations and operational examples, it could provide a useful conceptual shift in AI alignment research by reframing the goal as maintaining processability across heterogeneous models instead of enforcing consensus. The approach highlights the role of finite constraints in inference and offers a way to think about cognitive diversity without requiring unification. As currently presented, however, the contribution remains at the level of definitional reframing without demonstrated mechanisms or testable implications.

major comments (2)

[Abstract] Abstract: The assertion that 'World-model alignment is therefore processability, not agreement' is presented as following directly from the MIA, MIM, alignment maps, and transformation loss, yet no derivation, formal definition, or worked example is supplied showing how these constructs yield processability independently of representational agreement or how transformation loss quantifies the distinction.
[Abstract] Abstract: The central premise that 'a possible target becomes admissible only when a state representation can be constructed that is approximately sufficient for prediction, evaluation, or action' is introduced as foundational but is not derived from prior results or contrasted with alternative accounts of inference; this premise directly supports the subsequent claims about MIA/MIM and alignment, making its lack of justification load-bearing for the entire argument.

minor comments (1)

[Abstract] The abstract is highly compressed; expanding the description of how alignment maps and transformation loss function would improve readability even at the conceptual level.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive report. We address each major comment below, clarifying the logical role of the central constructs and indicating where the manuscript will be revised to improve explicitness.

read point-by-point responses

Referee: [Abstract] Abstract: The assertion that 'World-model alignment is therefore processability, not agreement' is presented as following directly from the MIA, MIM, alignment maps, and transformation loss, yet no derivation, formal definition, or worked example is supplied showing how these constructs yield processability independently of representational agreement or how transformation loss quantifies the distinction.

Authors: The MIA and MIM are defined in Section 2 as the assumption and mechanism governing construction of approximately sufficient statistics under finite constraints. Alignment maps and transformation loss are introduced in Section 3 as the means to quantify mappings that preserve predictive utility without requiring identical representations. Processability follows directly because transformation loss is defined as the residual error after the map is applied, which can be low even when the underlying sufficient statistics differ. We agree that the abstract is too terse; the revised manuscript will include a short derivation sketch relating the definitions to mutual processability and one worked numerical example of two models with distinct constraints communicating via an alignment map. revision: partial
Referee: [Abstract] Abstract: The central premise that 'a possible target becomes admissible only when a state representation can be constructed that is approximately sufficient for prediction, evaluation, or action' is introduced as foundational but is not derived from prior results or contrasted with alternative accounts of inference; this premise directly supports the subsequent claims about MIA/MIM and alignment, making its lack of justification load-bearing for the entire argument.

Authors: The premise is presented explicitly as the Multi-Phase Inference Assumption (MIA), i.e., an axiomatic starting point rather than a theorem derived from earlier results. It is motivated by the observation that not every datum becomes an estimation target under resource bounds, a point already implicit in bounded-rationality and statistical decision theory. We will revise the introduction to add an explicit contrast with classical accounts that assume unlimited representational capacity or treat all observations as immediately admissible, thereby clarifying why the premise is load-bearing. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected; framework is self-contained theoretical proposal.

full rationale

The paper states a central premise (observation is not yet inference; targets admissible only via sufficient state representations), formulates it as the Multi-Phase Inference Assumption (MIA) and Mechanism (MIM), introduces alignment maps and transformation loss, and concludes that alignment equals processability rather than agreement. No equations, fitted parameters, self-citations, uniqueness theorems, or ansatzes appear in the provided text. The conclusion is presented as following from the introduced terminology and premise without any reduction that makes the output equivalent to the inputs by construction. This matches the default case of a self-contained conceptual framework with no load-bearing circular steps.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 3 invented entities

Based solely on the abstract, the paper introduces several new concepts without external grounding or derivations. The central premise is presented as an assumption rather than derived from prior results.

axioms (1)

ad hoc to paper Multi-Phase Inference Assumption (MIA)
The paper explicitly formulates its position as this assumption in the abstract.

invented entities (3)

Multi-Phase Inference Mechanism (MIM) no independent evidence
purpose: Core internal mechanism of the framework
Introduced as the mechanism implementing the Multi-Phase Inference Assumption.
alignment maps no independent evidence
purpose: Analyze communication between heterogeneous world models
New construct introduced to study processability without collapse.
transformation loss no independent evidence
purpose: Quantify information change in model communication
New term introduced alongside alignment maps.

pith-pipeline@v0.9.1-grok · 5782 in / 1470 out tokens · 31388 ms · 2026-06-29T07:43:26.878531+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

101 extracted references · 4 canonical work pages · 4 internal anchors

[1]

Amari, S. (2016). Information Geometry and Its Applications. Springer

2016
[2]

Anderson, R. C. and Pichert, J. W. (1978). Recall of previously unrecallable information following a shift in perspective. Journal of Verbal Learning and Verbal Behavior, 17(1), 1--12

1978
[3]

Augustinaviciute, A. (1980). Socionics (Russian: Socionika). Manuscripts and lectures collected in Sochineniya, 2nd ed., Chernaya Belka Publishing, 2008

1980
[4]

Bai, Y. et al. (2022). Constitutional AI: Harmlessness from AI Feedback. arXiv:2212.08073

work page internal anchor Pith review Pith/arXiv arXiv 2022
[5]

Bail, C. A. (2021). Breaking the Social Media Prism: How to Make Our Platforms Less Polarizing. Princeton University Press

2021
[6]

Brohan, A. et al. (2023). RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control. arXiv:2307.15818

work page internal anchor Pith review Pith/arXiv arXiv 2023
[7]

Bruner, J. S. and Postman, L. (1947). Tension and tension-release as organizing factors in perception. Journal of Personality, 15(4), 300--308

1947
[8]

Buber, M. (1923). Ich und Du. Insel Verlag

1923
[9]

Aristotle. (c. 350 BCE). Nicomachean Ethics and Metaphysics. Various editions
[10]

Berlin, I. (1969). Four Essays on Liberty. Oxford University Press

1969
[11]

Burke, E. (1790). Reflections on the Revolution in France. J. Dodsley
[12]

Habermas, J. (1981). Theorie des kommunikativen Handelns. Suhrkamp

1981
[13]

Habermas, J. (1992). Faktizit\"at und Geltung. Suhrkamp

1992
[14]

Hayek, F. A. (1945). The use of knowledge in society. American Economic Review, 35(4), 519--530

1945
[15]

Hayek, F. A. (1960). The Constitution of Liberty. University of Chicago Press

1960
[16]

Hobbes, T. (1651). Leviathan. Andrew Crooke
[17]

Landemore, H. (2013). Democratic Reason: Politics, Collective Intelligence, and the Rule of the Many. Princeton University Press

2013
[18]

MacIntyre, A. (1981). After Virtue. University of Notre Dame Press

1981
[19]

Oakeshott, M. (1962). Rationalism in Politics and Other Essays. Methuen

1962
[20]

Ober, J. (2008). Democracy and Knowledge: Innovation and Learning in Classical Athens. Princeton University Press

2008
[21]

Plato. (c. 380 BCE). Republic. Various editions
[22]

Popper, K. (1945). The Open Society and Its Enemies. Routledge

1945
[23]

Rawls, J. (1971). A Theory of Justice. Harvard University Press

1971
[24]

Rawls, J. (1993). Political Liberalism. Columbia University Press

1993
[25]

Rousseau, J.-J. (1762). Du contrat social. Marc-Michel Rey
[26]

Sunstein, C. R. (2006). Infotopia: How Many Minds Produce Knowledge. Oxford University Press

2006
[27]

Taylor, C. (1989). Sources of the Self. Harvard University Press

1989
[28]

Taylor, C. (1992). The politics of recognition. In Gutmann, A. (ed.), Multiculturalism and the Politics of Recognition, pp. 25--73. Princeton University Press

1992
[29]

Clark, A. (2013). Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behavioral and Brain Sciences, 36(3), 181--204

2013
[30]

Costa, P. T. and McCrae, R. R. (1992). Revised NEO Personality Inventory (NEO-PI-R) and NEO Five-Factor Inventory (NEO-FFI) Professional Manual. Psychological Assessment Resources

1992
[31]

Dennett, D. C. (1991). Consciousness Explained. Little, Brown and Company

1991
[32]

Dewey, J. (1925). Experience and Nature. Open Court Publishing

1925
[33]

Foucault, M. (1966). Les mots et les choses: Une arch\'eologie des sciences humaines. Gallimard

1966
[34]

European Union. (2024). Regulation (EU) 2024/1689 of the European Parliament and of the Council of 13 June 2024 laying down harmonised rules on artificial intelligence. Official Journal of the European Union

2024
[35]

National Institute of Standards and Technology. (2023). Artificial Intelligence Risk Management Framework (AI RMF 1.0). NIST AI 100-1

2023
[36]

Organisation for Economic Co-operation and Development. (2024). Recommendation of the Council on Artificial Intelligence. OECD Legal Instruments, OECD/LEGAL/0449

2024
[37]

Friston, K. (2010). The free-energy principle: a unified brain theory? Nature Reviews Neuroscience, 11(2), 127--138

2010
[38]

Friston, K., FitzGerald, T., Rigoli, F., Schwartenbeck, P., and Pezzulo, G. (2017). Active inference: a process theory. Neural Computation, 29(1), 1--49

2017
[39]

Gibson, J. J. (1979). The Ecological Approach to Visual Perception. Houghton Mifflin

1979
[40]

Gmytrasiewicz, P. J. and Doshi, P. (2005). A framework for sequential planning in multi-agent settings. Journal of Artificial Intelligence Research, 24, 49--79

2005
[41]

Goldberg, L. R. (1990). An alternative ``description of personality'': The Big-Five factor structure. Journal of Personality and Social Psychology, 59(6), 1216--1229

1990
[42]

Grice, H. P. (1975). Logic and conversation. In Cole, P. and Morgan, J. L. (eds.), Syntax and Semantics, Vol. 3: Speech Acts, pp. 41--58. Academic Press

1975
[43]

World Models

Ha, D. and Schmidhuber, J. (2018). World Models. arXiv:1803.10122

work page internal anchor Pith review Pith/arXiv arXiv 2018
[44]

Hafner, D., Lillicrap, T., Ba, J., and Norouzi, M. (2020). Dream to Control: Learning Behaviors by Latent Imagination. International Conference on Learning Representations

2020
[45]

Hafner, D., Lillicrap, T., Norouzi, M., and Ba, J. (2021). Mastering Atari with Discrete World Models. International Conference on Learning Representations

2021
[46]

Hegel, G. W. F. (1807). Ph\"anomenologie des Geistes. Joseph Anton Goebhardt
[47]

Heidegger, M. (1927). Sein und Zeit. Max Niemeyer Verlag

1927
[48]

A., Madigan, D., Raftery, A

Hoeting, J. A., Madigan, D., Raftery, A. E., and Volinsky, C. T. (1999). Bayesian model averaging: A tutorial. Statistical Science, 14(4), 382-417

1999
[49]

Hume, D. (1748). An Enquiry Concerning Human Understanding. A. Millar
[50]

anomenologie und ph\

Husserl, E. (1913). Ideen zu einer reinen Ph\"anomenologie und ph\"anomenologischen Philosophie. Max Niemeyer

1913
[51]

James, W. (1907). Pragmatism: A New Name for Some Old Ways of Thinking. Longmans, Green, and Co

1907
[52]

Jung, C. G. (1921). Psychologische Typen. Rascher Verlag

1921
[53]

Kahneman, D. (2011). Thinking, Fast and Slow. Farrar, Straus and Giroux

2011
[54]

Kant, I. (1781). Kritik der reinen Vernunft. Johann Friedrich Hartknoch
[55]

Kepinski, A. (1972). Rytm zycia. Wydawnictwo Literackie

1972
[56]

Kierkegaard, S. (1849). Sygdommen til D den [The Sickness Unto Death]. C. A. Reitzel
[57]

Kuhn, T. S. (1962). The Structure of Scientific Revolutions. University of Chicago Press

1962
[58]

Lakoff, G. (1987). Women, Fire, and Dangerous Things: What Categories Reveal about the Mind. University of Chicago Press

1987
[59]

LeCun, Y. (2022). A Path Towards Autonomous Machine Intelligence. OpenReview preprint

2022
[60]

Lehmann, E. L. and Casella, G. (1998). Theory of Point Estimation, 2nd ed. Springer

1998
[61]

L\'evi-Strauss, C. (1958). Anthropologie structurale. Plon

1958
[62]

Levinas, E. (1961). Totalit\'e et infini: Essai sur l'ext\'eriorit\'e. Martinus Nijhoff

1961
[63]

Locke, J. (1690). An Essay Concerning Human Understanding. Thomas Bassett
[64]

Merleau-Ponty, M. (1945). Ph\'enom\'enologie de la perception. Gallimard

1945
[65]

Myers, I. B. and McCaulley, M. H. (1985). Manual: A Guide to the Development and Use of the Myers-Briggs Type Indicator. Consulting Psychologists Press

1985
[66]

Nisbett, R. E. (2003). The Geography of Thought: How Asians and Westerners Think Differently---and Why. Free Press

2003
[67]

Ouyang, L. et al. (2022). Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems, 35

2022
[68]

Pariser, E. (2011). The Filter Bubble: What the Internet Is Hiding from You. Penguin Press

2011
[69]

Parr, T., Pezzulo, G., and Friston, K. (2022). Active Inference: The Free Energy Principle in Mind, Brain, and Behavior. MIT Press

2022
[70]

Peirce, C. S. (1958). Collected Papers of Charles Sanders Peirce, Vols. 1--8, edited by C. Hartshorne, P. Weiss, and A. W. Burks. Harvard University Press

1958
[71]

Pietrak, K. (2018). The foundations of socionics -- a review. Cognitive Systems Research, 47, 1--11

2018
[72]

and Woodruff, G

Premack, D. and Woodruff, G. (1978). Does the chimpanzee have a theory of mind? Behavioral and Brain Sciences, 1(4), 515--526

1978
[73]

and Stengers, I

Prigogine, I. and Stengers, I. (1984). Order Out of Chaos: Man's New Dialogue with Nature. Bantam Books

1984
[74]

Quine, W. V. O. (1960). Word and Object. MIT Press

1960
[75]

Rabinowitz, N. C. et al. (2018). Machine theory of mind. Proceedings of the 35th International Conference on Machine Learning, 80, 4218--4227

2018
[76]

Rafailov, R. et al. (2023). Direct Preference Optimization: Your Language Model is Secretly a Reward Model. Advances in Neural Information Processing Systems, 36

2023
[77]

Rao, R. P. N. and Ballard, D. H. (1999). Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nature Neuroscience, 2(1), 79--87

1999
[78]

Sartre, J.-P. (1943). L'\^etre et le n\'eant: Essai d'ontologie ph\'enom\'enologique. Gallimard

1943
[79]

Sapir, E. (1929). The Status of Linguistics as a Science. Language, 5(4), 207--214

1929
[80]

Saussure, F. de. (1916). Cours de linguistique g\'en\'erale. Payot

1916

Showing first 80 references.

[1] [1]

Amari, S. (2016). Information Geometry and Its Applications. Springer

2016

[2] [2]

Anderson, R. C. and Pichert, J. W. (1978). Recall of previously unrecallable information following a shift in perspective. Journal of Verbal Learning and Verbal Behavior, 17(1), 1--12

1978

[3] [3]

Augustinaviciute, A. (1980). Socionics (Russian: Socionika). Manuscripts and lectures collected in Sochineniya, 2nd ed., Chernaya Belka Publishing, 2008

1980

[4] [4]

Bai, Y. et al. (2022). Constitutional AI: Harmlessness from AI Feedback. arXiv:2212.08073

work page internal anchor Pith review Pith/arXiv arXiv 2022

[5] [5]

Bail, C. A. (2021). Breaking the Social Media Prism: How to Make Our Platforms Less Polarizing. Princeton University Press

2021

[6] [6]

Brohan, A. et al. (2023). RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control. arXiv:2307.15818

work page internal anchor Pith review Pith/arXiv arXiv 2023

[7] [7]

Bruner, J. S. and Postman, L. (1947). Tension and tension-release as organizing factors in perception. Journal of Personality, 15(4), 300--308

1947

[8] [8]

Buber, M. (1923). Ich und Du. Insel Verlag

1923

[9] [9]

Aristotle. (c. 350 BCE). Nicomachean Ethics and Metaphysics. Various editions

[10] [10]

Berlin, I. (1969). Four Essays on Liberty. Oxford University Press

1969

[11] [11]

Burke, E. (1790). Reflections on the Revolution in France. J. Dodsley

[12] [12]

Habermas, J. (1981). Theorie des kommunikativen Handelns. Suhrkamp

1981

[13] [13]

Habermas, J. (1992). Faktizit\"at und Geltung. Suhrkamp

1992

[14] [14]

Hayek, F. A. (1945). The use of knowledge in society. American Economic Review, 35(4), 519--530

1945

[15] [15]

Hayek, F. A. (1960). The Constitution of Liberty. University of Chicago Press

1960

[16] [16]

Hobbes, T. (1651). Leviathan. Andrew Crooke

[17] [17]

Landemore, H. (2013). Democratic Reason: Politics, Collective Intelligence, and the Rule of the Many. Princeton University Press

2013

[18] [18]

MacIntyre, A. (1981). After Virtue. University of Notre Dame Press

1981

[19] [19]

Oakeshott, M. (1962). Rationalism in Politics and Other Essays. Methuen

1962

[20] [20]

Ober, J. (2008). Democracy and Knowledge: Innovation and Learning in Classical Athens. Princeton University Press

2008

[21] [21]

Plato. (c. 380 BCE). Republic. Various editions

[22] [22]

Popper, K. (1945). The Open Society and Its Enemies. Routledge

1945

[23] [23]

Rawls, J. (1971). A Theory of Justice. Harvard University Press

1971

[24] [24]

Rawls, J. (1993). Political Liberalism. Columbia University Press

1993

[25] [25]

Rousseau, J.-J. (1762). Du contrat social. Marc-Michel Rey

[26] [26]

Sunstein, C. R. (2006). Infotopia: How Many Minds Produce Knowledge. Oxford University Press

2006

[27] [27]

Taylor, C. (1989). Sources of the Self. Harvard University Press

1989

[28] [28]

Taylor, C. (1992). The politics of recognition. In Gutmann, A. (ed.), Multiculturalism and the Politics of Recognition, pp. 25--73. Princeton University Press

1992

[29] [29]

Clark, A. (2013). Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behavioral and Brain Sciences, 36(3), 181--204

2013

[30] [30]

Costa, P. T. and McCrae, R. R. (1992). Revised NEO Personality Inventory (NEO-PI-R) and NEO Five-Factor Inventory (NEO-FFI) Professional Manual. Psychological Assessment Resources

1992

[31] [31]

Dennett, D. C. (1991). Consciousness Explained. Little, Brown and Company

1991

[32] [32]

Dewey, J. (1925). Experience and Nature. Open Court Publishing

1925

[33] [33]

Foucault, M. (1966). Les mots et les choses: Une arch\'eologie des sciences humaines. Gallimard

1966

[34] [34]

European Union. (2024). Regulation (EU) 2024/1689 of the European Parliament and of the Council of 13 June 2024 laying down harmonised rules on artificial intelligence. Official Journal of the European Union

2024

[35] [35]

National Institute of Standards and Technology. (2023). Artificial Intelligence Risk Management Framework (AI RMF 1.0). NIST AI 100-1

2023

[36] [36]

Organisation for Economic Co-operation and Development. (2024). Recommendation of the Council on Artificial Intelligence. OECD Legal Instruments, OECD/LEGAL/0449

2024

[37] [37]

Friston, K. (2010). The free-energy principle: a unified brain theory? Nature Reviews Neuroscience, 11(2), 127--138

2010

[38] [38]

Friston, K., FitzGerald, T., Rigoli, F., Schwartenbeck, P., and Pezzulo, G. (2017). Active inference: a process theory. Neural Computation, 29(1), 1--49

2017

[39] [39]

Gibson, J. J. (1979). The Ecological Approach to Visual Perception. Houghton Mifflin

1979

[40] [40]

Gmytrasiewicz, P. J. and Doshi, P. (2005). A framework for sequential planning in multi-agent settings. Journal of Artificial Intelligence Research, 24, 49--79

2005

[41] [41]

Goldberg, L. R. (1990). An alternative ``description of personality'': The Big-Five factor structure. Journal of Personality and Social Psychology, 59(6), 1216--1229

1990

[42] [42]

Grice, H. P. (1975). Logic and conversation. In Cole, P. and Morgan, J. L. (eds.), Syntax and Semantics, Vol. 3: Speech Acts, pp. 41--58. Academic Press

1975

[43] [43]

World Models

Ha, D. and Schmidhuber, J. (2018). World Models. arXiv:1803.10122

work page internal anchor Pith review Pith/arXiv arXiv 2018

[44] [44]

Hafner, D., Lillicrap, T., Ba, J., and Norouzi, M. (2020). Dream to Control: Learning Behaviors by Latent Imagination. International Conference on Learning Representations

2020

[45] [45]

Hafner, D., Lillicrap, T., Norouzi, M., and Ba, J. (2021). Mastering Atari with Discrete World Models. International Conference on Learning Representations

2021

[46] [46]

Hegel, G. W. F. (1807). Ph\"anomenologie des Geistes. Joseph Anton Goebhardt

[47] [47]

Heidegger, M. (1927). Sein und Zeit. Max Niemeyer Verlag

1927

[48] [48]

A., Madigan, D., Raftery, A

Hoeting, J. A., Madigan, D., Raftery, A. E., and Volinsky, C. T. (1999). Bayesian model averaging: A tutorial. Statistical Science, 14(4), 382-417

1999

[49] [49]

Hume, D. (1748). An Enquiry Concerning Human Understanding. A. Millar

[50] [50]

anomenologie und ph\

Husserl, E. (1913). Ideen zu einer reinen Ph\"anomenologie und ph\"anomenologischen Philosophie. Max Niemeyer

1913

[51] [51]

James, W. (1907). Pragmatism: A New Name for Some Old Ways of Thinking. Longmans, Green, and Co

1907

[52] [52]

Jung, C. G. (1921). Psychologische Typen. Rascher Verlag

1921

[53] [53]

Kahneman, D. (2011). Thinking, Fast and Slow. Farrar, Straus and Giroux

2011

[54] [54]

Kant, I. (1781). Kritik der reinen Vernunft. Johann Friedrich Hartknoch

[55] [55]

Kepinski, A. (1972). Rytm zycia. Wydawnictwo Literackie

1972

[56] [56]

Kierkegaard, S. (1849). Sygdommen til D den [The Sickness Unto Death]. C. A. Reitzel

[57] [57]

Kuhn, T. S. (1962). The Structure of Scientific Revolutions. University of Chicago Press

1962

[58] [58]

Lakoff, G. (1987). Women, Fire, and Dangerous Things: What Categories Reveal about the Mind. University of Chicago Press

1987

[59] [59]

LeCun, Y. (2022). A Path Towards Autonomous Machine Intelligence. OpenReview preprint

2022

[60] [60]

Lehmann, E. L. and Casella, G. (1998). Theory of Point Estimation, 2nd ed. Springer

1998

[61] [61]

L\'evi-Strauss, C. (1958). Anthropologie structurale. Plon

1958

[62] [62]

Levinas, E. (1961). Totalit\'e et infini: Essai sur l'ext\'eriorit\'e. Martinus Nijhoff

1961

[63] [63]

Locke, J. (1690). An Essay Concerning Human Understanding. Thomas Bassett

[64] [64]

Merleau-Ponty, M. (1945). Ph\'enom\'enologie de la perception. Gallimard

1945

[65] [65]

Myers, I. B. and McCaulley, M. H. (1985). Manual: A Guide to the Development and Use of the Myers-Briggs Type Indicator. Consulting Psychologists Press

1985

[66] [66]

Nisbett, R. E. (2003). The Geography of Thought: How Asians and Westerners Think Differently---and Why. Free Press

2003

[67] [67]

Ouyang, L. et al. (2022). Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems, 35

2022

[68] [68]

Pariser, E. (2011). The Filter Bubble: What the Internet Is Hiding from You. Penguin Press

2011

[69] [69]

Parr, T., Pezzulo, G., and Friston, K. (2022). Active Inference: The Free Energy Principle in Mind, Brain, and Behavior. MIT Press

2022

[70] [70]

Peirce, C. S. (1958). Collected Papers of Charles Sanders Peirce, Vols. 1--8, edited by C. Hartshorne, P. Weiss, and A. W. Burks. Harvard University Press

1958

[71] [71]

Pietrak, K. (2018). The foundations of socionics -- a review. Cognitive Systems Research, 47, 1--11

2018

[72] [72]

and Woodruff, G

Premack, D. and Woodruff, G. (1978). Does the chimpanzee have a theory of mind? Behavioral and Brain Sciences, 1(4), 515--526

1978

[73] [73]

and Stengers, I

Prigogine, I. and Stengers, I. (1984). Order Out of Chaos: Man's New Dialogue with Nature. Bantam Books

1984

[74] [74]

Quine, W. V. O. (1960). Word and Object. MIT Press

1960

[75] [75]

Rabinowitz, N. C. et al. (2018). Machine theory of mind. Proceedings of the 35th International Conference on Machine Learning, 80, 4218--4227

2018

[76] [76]

Rafailov, R. et al. (2023). Direct Preference Optimization: Your Language Model is Secretly a Reward Model. Advances in Neural Information Processing Systems, 36

2023

[77] [77]

Rao, R. P. N. and Ballard, D. H. (1999). Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nature Neuroscience, 2(1), 79--87

1999

[78] [78]

Sartre, J.-P. (1943). L'\^etre et le n\'eant: Essai d'ontologie ph\'enom\'enologique. Gallimard

1943

[79] [79]

Sapir, E. (1929). The Status of Linguistics as a Science. Language, 5(4), 207--214

1929

[80] [80]

Saussure, F. de. (1916). Cours de linguistique g\'en\'erale. Payot

1916