What Type of Inference is Active Inference?
Pith reviewed 2026-06-28 06:43 UTC · model grok-4.3
The pith
Active inference requires both entropy corrections and a planning correction to achieve full expected free energy minimization.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Expected free energy minimization equals variational free energy minimization on a generative model augmented with epistemic priors; the augmented variational free energy decomposes into the predictive model's variational free energy plus explicit entropy-correction terms; full EFE-based planning further requires a planning correction that turns marginal inference into policy optimization, yielding a complete variational characterization together with an associated message-passing scheme.
What carries the argument
The decomposition of the variational free energy of the epistemically augmented model into predictive variational free energy plus explicit entropy-correction terms, together with the planning correction that converts marginal inference into policy optimization.
If this is right
- The variational free energy of the augmented model equals the predictive model's variational free energy plus explicit entropy-correction terms.
- Full EFE-based planning combines the epistemic corrections with a planning correction.
- The decomposition yields a message-passing scheme for EFE-based planning and simpler ablations.
- In grid-world tasks the full combination outperforms ablations that omit either correction.
Where Pith is reading between the lines
- The explicit separation of corrections may allow implementers to add or remove information-seeking behavior without rewriting the entire planner.
- The same structure could be used to derive message-passing updates for other free-energy objectives that mix goal-directed and epistemic terms.
- Because the corrections are stated in terms of standard variational quantities, they may transfer to continuous or high-dimensional state spaces where exact marginals are unavailable.
Load-bearing premise
That expected free energy minimization can be expressed as variational free energy minimization on a generative model augmented with epistemic priors.
What would settle it
An ablation experiment in which an agent performs full EFE planning without the planning correction and still matches the performance and policy distribution of the complete version.
Figures
read the original abstract
Active inference casts decision-making as inference, with the Expected Free Energy (EFE) unifying goal-directed and information-seeking behavior. Recent work showed that EFE minimization can be written as Variational Free Energy (VFE) minimization on a generative model augmented with epistemic priors. We prove that the VFE of the augmented model can be rewritten as the VFE of the predictive model plus explicit entropy-correction terms, making the EFE contribution transparent. We then show that proper EFE-based planning requires combining these epistemic corrections with a planning correction that turns marginal inference into policy optimization, yielding a full variational characterization of EFE-based planning. This clarifies which corrections are needed for cross-entropy planning and for full EFE-based planning. The same entropy-corrected formulation leads to a detailed message-passing scheme for EFE-based planning together with simpler ablations. Experiments on three grid-world environments show that full EFE-based planning outperforms ablations that omit either the planning correction or the epistemic corrections.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims that Expected Free Energy (EFE) minimization can be expressed as Variational Free Energy (VFE) minimization on a generative model augmented with epistemic priors. It proves that the VFE on the augmented model equals the VFE on the predictive model plus explicit entropy-correction terms. It further demonstrates that EFE-based planning requires an additional planning correction to transform marginal inference into policy optimization, providing a complete variational characterization. This leads to a message-passing scheme for EFE-based planning and ablations. Experiments in three grid-world environments show that the full EFE-based planning outperforms ablations missing either the planning or epistemic corrections.
Significance. If the results hold, this work provides a significant clarification on the nature of inference in active inference by decomposing EFE into standard VFE plus specific corrections. The explicit entropy corrections and the planning correction are useful for understanding and implementing EFE-based planning. Credit is given for deriving the message-passing scheme from the corrected formulation and for the ablation studies in grid-worlds that support the necessity of both corrections. This could advance the field by making the contributions of different terms transparent.
minor comments (2)
- [Abstract] The abstract mentions 'cross-entropy planning' without a brief definition or reference, which may confuse readers unfamiliar with the term.
- The paper would benefit from including the specific names or characteristics of the three grid-world environments in the abstract or early introduction for better context.
Simulated Author's Rebuttal
We thank the referee for the positive and accurate summary of our work, the recognition of its significance, and the recommendation for minor revision. No specific major comments were raised in the report.
Circularity Check
No significant circularity in derivation chain
full rationale
The paper starts from standard VFE and EFE definitions (as stated in the abstract), references recent work only for the initial augmented-model rewriting, and then derives explicit entropy-correction terms plus a planning correction that converts marginal inference to policy optimization. These steps are presented as independent proofs and rewritings that produce a message-passing scheme and ablation comparisons; no load-bearing step reduces by construction to a fitted input, self-citation chain, or renamed ansatz. The central claim therefore remains self-contained against the stated definitions.
Axiom & Free-Parameter Ledger
axioms (1)
- standard math Standard variational inference assumptions that a variational distribution approximates the true posterior and that free-energy minimization yields useful inference.
Reference graph
Works this paper leans on
-
[1]
Adamiat, Sepideh and Kouw, Wouter M. and. Message. Active. doi:10.1007/978-3-031-77138-5_14 , abstract =
-
[2]
and Shipp, Stewart and Friston, Karl J
Adams, Rick A. and Shipp, Stewart and Friston, Karl J. , year = 2013, journal =. Predictions Not Commands: Active Inference in the Motor System , shorttitle =. doi:10.1007/s00429-012-0475-5 , url =
-
[3]
and Murphy, Kevin , year = 2018, month = jul, pages =
Alemi, Alexander and Poole, Ben and Fischer, Ian and Dillon, Joshua and Saurous, Rif A. and Murphy, Kevin , year = 2018, month = jul, pages =. Fixing a. Proceedings of the 35th
2018
-
[4]
Computationally Efficient Convolved Multiple Output
Alvarez, Mauricio A and Lawrence, Neil D , year = 2011, journal =. Computationally Efficient Convolved Multiple Output
2011
-
[5]
and Rosasco, Lorenzo and Lawrence, Neil D
Alvarez, Mauricio A. and Rosasco, Lorenzo and Lawrence, Neil D. , year = 2012, month = apr, number =. Kernels for. arXiv , langid =:1106.6251 , primaryclass =
Pith/arXiv arXiv 2012
-
[6]
, year = 2012, month = jun, pages =
Anandkumar, Animashree and Hsu, Daniel and Kakade, Sham M. , year = 2012, month = jun, pages =. A. Proceedings of the 25th
2012
-
[7]
Anil Meera, Ajith and Lanillos, Pablo , editor =. Towards. Active. doi:10.1007/978-3-031-47958-8_3 , abstract =
-
[8]
Natural Hierarchy Emerges from Energy Dispersal , author =. Biosystems , volume =. doi:10.1016/j.biosystems.2008.10.008 , url =
-
[9]
Antoy, Sergio and Hanus, Michael , editor =. Declarative. Logic. doi:10.1007/11680093_2 , abstract =
-
[10]
and Nouri, Ali and Wingate, David , year = 2009, month = jun, series =
Asmuth, John and Li, Lihong and Littman, Michael L. and Nouri, Ali and Wingate, David , year = 2009, month = jun, series =. A. Proceedings of the
2009
-
[11]
A Bayesian Sampling Approach to Exploration in Reinforcement Learning
Asmuth, John and Li, Lihong and Littman, Michael L. and Nouri, Ali and Wingate, David , year = 2012, month = may, number =. A. doi:10.48550/arXiv.1205.2664 , url =. arXiv , keywords =:1205.2664 , primaryclass =
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1205.2664 2012
-
[12]
Planning by Probabilistic Inference , booktitle =
Attias, Hagai , year = 2003, pages =. Planning by Probabilistic Inference , booktitle =
2003
-
[13]
Reinforcement
Azizzadenesheli, Kamyar and Lazaric, Alessandro and Anandkumar, Animashree , year = 2016, month = jun, pages =. Reinforcement. Conference on
2016
-
[14]
Bagaev, Dmitry and. Reactive. doi:10.48550/arXiv.2112.13251 , url =. arXiv , keywords =:2112.13251 , primaryclass =
-
[15]
Bagaev, Dmitry and De Vries, Bert , editor =. Reactive. Scientific Programming , volume =. doi:10.1155/2023/6601690 , url =
-
[16]
Bagaev, Dmitry and. Software Impacts , volume =. doi:10.1016/j.simpa.2022.100299 , url =
-
[17]
Rocket.Jl:
Bagaev, Dmitry , year = 2020, url =. Rocket.Jl:
2020
-
[18]
doi:10.21105/joss.05161 , url =
Bagaev, Dmitry and Podusenko, Albert and de Vries, Bert , year = 2023, month = apr, journal =. doi:10.21105/joss.05161 , url =
-
[19]
ACM Computing Surveys (CSUR) , volume =
A Survey on Reactive Programming , author =. ACM Computing Surveys (CSUR) , volume =. doi:10.1145/2501654.2501666 , url =
-
[20]
and Simpson, Daniel and Rue, H
Bakka, Haakon and Vanhatalo, Jarno and Illian, Janine B. and Simpson, Daniel and Rue, H. Non-Stationary. Spatial Statistics , volume =. doi:10.1016/j.spasta.2019.01.002 , url =
-
[21]
and Daulton, Samuel and Letham, Benjamin and Wilson, Andrew Gordon and Bakshy, Eytan , year = 2020, month = dec, series =
Balandat, Maximilian and Karrer, Brian and Jiang, Daniel R. and Daulton, Samuel and Letham, Benjamin and Wilson, Andrew Gordon and Bakshy, Eytan , year = 2020, month = dec, series =. Proceedings of the 34th
2020
-
[22]
arXiv , keywords =:2111.10530 , primaryclass =
Kalman Filters as the Steady-State Solution of Gradient Descent on Variational Free Energy , author =. arXiv , keywords =:2111.10530 , primaryclass =
-
[23]
Reactive Probabilistic Programming , booktitle =
Baudart, Guillaume and Mandel, Louis and Atkinson, Eric and Sherman, Benjamin and Pouzet, Marc and Carbin, Michael , year = 2020, month = jun, series =. Reactive Probabilistic Programming , booktitle =. doi:10.1145/3385412.3386009 , url =
-
[24]
Beck, Jeff and Ramstead, Maxwell J. D. , year = 2025, month = feb, number =. Dynamic. doi:10.48550/arXiv.2502.21217 , url =. arXiv , keywords =:2502.21217 , primaryclass =
-
[25]
Bellman, Richard , year = 1966, journal =. Dynamic. 1719695 , eprinttype =
1966
-
[26]
Bulletin of the American Mathematical Society , volume =
The Theory of Dynamic Programming , author =. Bulletin of the American Mathematical Society , volume =. doi:10.1090/S0002-9904-1954-09848-8 , url =
-
[27]
Robustness in Identification and Control , author =
Robust Model Predictive Control:. Robustness in Identification and Control , author =. doi:10.1007/BFb0109870 , abstract =
-
[28]
Multidimensional Binary Search Trees Used for Associative Searching , author =. Commun. ACM , volume =. doi:10.1145/361002.361007 , url =
-
[29]
Dynamic Programming and Optimal Control:
Bertsekas, Dimitri , year = 2012, volume =. Dynamic Programming and Optimal Control:
2012
-
[30]
Julia: A Fresh Approach to Numerical Computing
Bezanson, Jeff and Edelman, Alan and Karpinski, Stefan and Shah, Viral B. , year = 2015, month = jul, number =. Julia:. doi:10.48550/arXiv.1411.1607 , url =. arXiv , keywords =:1411.1607 , primaryclass =
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1411.1607 2015
-
[31]
Bingham, Eli and Chen, Jonathan P and Jankowiak, Martin and Obermeyer, Fritz and Pradhan, Neeraj and Karaletsos, Theofanis and Singh, Rohit and Szerlip, Paul and Horsfall, Paul and Goodman, Noah D , year = 2019, journal =. Pyro:
2019
-
[32]
Pattern Recognition and Machine Learning , author =
-
[33]
, year = 2011, month = dec, journal =
Blackmore, Lars and Ono, Masahiro and Williams, Brian C. , year = 2011, month = dec, journal =. Chance-. doi:10.1109/TRO.2011.2161160 , url =
-
[34]
Blei, David M. and Jordan, Michael I. , year = 2006, month = mar, journal =. Variational Inference for. doi:10.1214/06-BA104 , url =
-
[35]
and Kucukelbir, Alp and McAuliffe, Jon D
Blei, David M. and Kucukelbir, Alp and McAuliffe, Jon D. , year = 2017, month = apr, journal =. Variational. doi:10.1080/01621459.2017.1285773 , url =
-
[36]
Bliznyuk, Nikolay and Ruppert, David and Shoemaker, Christine and Regis, Rommel and Wild, Stefan and Mugunthan, Pradeep , year = 2008, month = jun, journal =. Bayesian. doi:10.1198/106186008X320681 , url =
-
[37]
Biosystems Engineering , volume =
Minimising the Non-Working Distance Travelled by Machines Operating in a Headland Field Pattern , author =. Biosystems Engineering , volume =. doi:10.1016/j.biosystemseng.2008.06.008 , url =
-
[38]
Bolin, David and Lindgren, Finn , year = 2011, journal =. Spatial. 23024839 , eprinttype =
2011
-
[39]
Artificial Intelligence , volume =
Planning as Heuristic Search , author =. Artificial Intelligence , volume =. doi:10.1016/S0004-3702(01)00108-4 , url =
-
[40]
Multi-Task
Bonilla, Edwin V and Chai, Kian and Williams, Christopher , year = 2007, volume =. Multi-Task. Advances in
2007
-
[41]
The International Journal of Robotics Research , volume =
Closing the Learning-Planning Loop with Predictive State Representations , author =. The International Journal of Robotics Research , volume =. doi:10.1177/0278364911404092 , url =
-
[42]
Mat\'ern
Borovitskiy, Viacheslav and Terenin, Alexander and Mostowsky, Peter and Deisenroth, Marc Peter , year = 2020, month = dec, series =. Mat\'ern. Proceedings of the 34th
2020
-
[43]
Trends in Cognitive Sciences , volume =
Planning as Inference , author =. Trends in Cognitive Sciences , volume =. doi:10.1016/j.tics.2012.08.006 , url =
-
[44]
Generating Sentences from a Continuous Space , booktitle =
Bowman, Samuel and Vilnis, Luke and Vinyals, Oriol and Dai, Andrew and Jozefowicz, Rafal and Bengio, Samy , year = 2016, pages =. Generating Sentences from a Continuous Space , booktitle =
2016
-
[45]
Bradbury, James and Frostig, Roy and Hawkins, Peter and Johnson, Matthew James and Leary, Chris and Maclaurin, Dougal and Necula, George and Paszke, Adam and VanderPlas, Jake and
-
[46]
Brockman, Greg and Cheung, Vicki and Pettersson, Ludwig and Schneider, Jonas and Schulman, John and Tang, Jie and Zaremba, Wojciech , year = 2016, month = jun, number =. doi:10.48550/arXiv.1606.01540 , url =. arXiv , keywords =:1606.01540 , primaryclass =
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1606.01540 2016
-
[47]
Scalable
Bruinsma, Wessel and Perim, Eric and Tebbutt, William and Hosking, Scott and Solin, Arno and Turner, Richard , year = 2020, month = nov, pages =. Scalable. Proceedings of the 37th
2020
-
[48]
and Kim, Chang Sub and McGregor, Simon and Seth, Anil K
Buckley, Christopher L. and Kim, Chang Sub and McGregor, Simon and Seth, Anil K. , year = 2017, month = dec, journal =. The Free Energy Principle for Action and Perception:. doi:10.1016/j.jmp.2017.09.004 , url =
-
[49]
and Nguyen, Cuong V
Bui, Thang D. and Nguyen, Cuong V. and Turner, Richard E. , year = 2017, month = dec, series =. Streaming Sparse. Proceedings of the 31st
2017
-
[50]
Finding the Outliers in Scanpath Data , booktitle =
Burch, Michael and Kumar, Ayush and Mueller, Klaus and Kervezee, Titus and Nuijten, Wouter and Oostenbach, Rens and Peeters, Lucas and Smit, Gijs , year = 2019, month = jun, series =. Finding the Outliers in Scanpath Data , booktitle =. doi:10.1145/3317958.3318225 , url =
-
[51]
Carpenter, Bob and Gelman, Andrew and Hoffman, Matthew D. and Lee, Daniel and Goodrich, Ben and Betancourt, Michael and Brubaker, Marcus and Guo, Jiqiang and Li, Peter and Riddell, Allen , year = 2017, journal =. doi:10.18637/jss.v076.i01 , url =
-
[52]
Stochastic Versions of the Em Algorithm: An Experimental Study in the Mixture Case , shorttitle =
Celeux,. Stochastic Versions of the Em Algorithm: An Experimental Study in the Mixture Case , shorttitle =. Journal of Statistical Computation and Simulation , volume =. doi:10.1080/00949659608811772 , url =
-
[53]
Champion, Th. Branching. Neural Networks , volume =. doi:10.1016/j.neunet.2022.03.036 , url =
-
[54]
Chang, Huiwen and Zhang, Han and Barber, Jarred and Maschinot, A. J. and Lezama, Jose and Jiang, Lu and Yang, Ming-Hsuan and Murphy, Kevin and Freeman, William T. and Rubinstein, Michael and Li, Yuanzhen and Krishnan, Dilip , year = 2023, month = jan, number =. Muse:. doi:10.48550/arXiv.2301.00704 , url =. arXiv , keywords =:2301.00704 , primaryclass =
-
[55]
Chen, Fan and Wang, Huan and Xiong, Caiming and Mei, Song and Bai, Yu , year = 2023, month = jul, pages =. Lower. Proceedings of the 40th
2023
-
[56]
Loop Calculus in Statistical Physics and Information Science , author =. Physical Review E , volume =. doi:10.1103/PhysRevE.73.065102 , url =
-
[57]
Advances in Neural Information Processing Systems , volume =
Minigrid & Miniworld:. Advances in Neural Information Processing Systems , volume =
-
[58]
Corenflos, Adrien and Zhao, Zheng and S. Temporal. 2022 25th. doi:10.23919/FUSION49751.2022.9841306 , url =
-
[59]
Introduction to Algorithms , author =
-
[60]
BaLoRA: Bayesian Low-Rank Adaptation of Large Scale Models
Coscia, Dario and L. doi:10.48550/arXiv.2605.08110 , url =. arXiv , keywords =:2605.08110 , primaryclass =
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2605.08110
-
[61]
International
Cox, Marco and. International
-
[62]
American journal of physics , volume =
Probability, Frequency and Reasonable Expectation , author =. American journal of physics , volume =
-
[63]
and Ramaker, B
Cutler, Richard R. and Ramaker, B. L. , year = 1979, journal =. Dynamic
1979
-
[64]
Active Inference on Discrete State-Spaces:
Da Costa, Lancelot and Parr, Thomas and Sajid, Noor and Veselic, Sebastijan and Neacsu, Victorita and Friston, Karl , year = 2020, month = dec, journal =. Active Inference on Discrete State-Spaces:. doi:10.1016/j.jmp.2020.102447 , url =
-
[65]
Da Costa, Lancelot and Tenka, Samuel and Zhao, Dominic and Sajid, Noor , year = 2024, month = jan, number =. Active. doi:10.48550/arXiv.2401.12917 , url =. arXiv , keywords =:2401.12917 , primaryclass =
-
[66]
doi:10.5194/essd-15-317-2023 , url =
Earth System Science Data , volume =. doi:10.5194/essd-15-317-2023 , url =
-
[67]
IEEE Transactions on Geoscience and Remote Sensing , volume =
Bayesian. IEEE Transactions on Geoscience and Remote Sensing , volume =. doi:10.1109/TGRS.2024.3434443 , url =
-
[68]
Differentiable Expected Hypervolume Improvement for Parallel Multi-Objective
Daulton, Samuel and Balandat, Maximilian and Bakshy, Eytan , year = 2020, month = dec, series =. Differentiable Expected Hypervolume Improvement for Parallel Multi-Objective. Proceedings of the 34th
2020
-
[69]
Parallel
Daulton, Samuel and Balandat, Maximilian and Bakshy, Eytan , year = 2021, month = dec, series =. Parallel. Proceedings of the 35th
2021
-
[70]
, year = 2007, month = jun, pages =
Dauwels, J. , year = 2007, month = jun, pages =. On. doi:10.1109/ISIT.2007.4557602 , abstract =
-
[71]
doi:10.1109/MMAR.2012.6347921 , url =
Consistent Control Hierarchies with Top Layers Represented by Timed Event Graphs , booktitle =. doi:10.1109/MMAR.2012.6347921 , url =
-
[72]
Spatial and. Entropy , volume =. doi:10.3390/e26010083 , url =
-
[73]
De Vries, Bert , year = 2026, month = mar, number =. Active. doi:10.48550/arXiv.2603.20927 , url =. arXiv , keywords =:2603.20927 , primaryclass =
-
[74]
De Vries, Bert and Nuijten, Wouter and. Expected. doi:10.48550/arXiv.2504.14898 , url =. arXiv , keywords =:2504.14898 , primaryclass =
-
[75]
Journal of neural engineering , volume =
The Neural Optimal Control Hierarchy for Motor Control , author =. Journal of neural engineering , volume =
-
[76]
Dillon, Joshua V. and Langmore, Ian and Tran, Dustin and Brevdo, Eugene and Vasudevan, Srinivas and Moore, Dave and Patton, Brian and Alemi, Alex and Hoffman, Matt and Saurous, Rif A. , year = 2017, month = nov, number =. doi:10.48550/arXiv.1711.10604 , url =. arXiv , keywords =:1711.10604 , primaryclass =
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1711.10604 2017
-
[77]
IJCAI : proceedings of the conference , volume =
Hidden. IJCAI : proceedings of the conference , volume =
-
[78]
Stochastic
Dritsas, Ioannis , year = 2011, month = feb, publisher =. Stochastic
2011
-
[79]
Drusch, M. and Del Bello, U. and Carlier, S. and Colin, O. and Fernandez, V. and Gascon, F. and Hoersch, B. and Isola, C. and Laberinti, P. and Martimort, P. and Meygret, A. and Spoto, F. and Sy, O. and Marchese, F. and Bargellini, P. , year = 2012, month = may, journal =. Sentinel-2:. doi:10.1016/j.rse.2011.11.026 , url =
-
[80]
Duane, Simon and Kennedy, A. D. and Pendleton, Brian J. and Roweth, Duncan , year = 1987, month = sep, journal =. Hybrid. doi:10.1016/0370-2693(87)91197-X , url =
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.