Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
Pith reviewed 2026-05-22 21:37 UTC · model grok-4.3
The pith
Foundation agents gain structure from modular architectures that map directly onto human brain functions for reasoning, adaptation, collaboration, and safety.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Intelligent agents are productively understood through modular, brain-inspired architectures that integrate cognitive science and computational principles, with core components including memory, world modeling, reward processing, goal setting, and emotion; these architectures support self-enhancement via automated optimization, collective intelligence through multi-agent interactions, and safety via intrinsic and extrinsic mitigation strategies.
What carries the argument
modular, brain-inspired architectures that map cognitive, perceptual, and operational modules onto human brain functionalities
If this is right
- Agents can achieve continual learning by autonomously refining capabilities through automated optimization paradigms.
- Collective intelligence emerges when multiple agents interact, cooperate, and form societal structures.
- Security threats can be addressed by combining intrinsic mechanisms with extrinsic alignment and robustness techniques.
- Research efforts can be directed toward harmonizing module-level advances with overall societal benefit.
Where Pith is reading between the lines
- The same modular breakdown could be used to design new benchmarks that test each brain-like function separately rather than end-to-end performance alone.
- Neuroscience findings on specific brain processes could be translated into concrete module upgrades without requiring full biological fidelity.
- Real-world deployment pipelines might adopt the four-part structure to audit agents for safety before scaling.
Load-bearing premise
Mapping agent modules onto human brain functionalities supplies a productive organizing framework for future agent research.
What would settle it
A controlled comparison in which non-brain-mapped modular agent designs consistently outperform brain-mapped ones on the same tasks and benchmarks would falsify the framework's utility.
Figures
read the original abstract
The advent of large language models (LLMs) has catalyzed a transformative shift in artificial intelligence, paving the way for advanced intelligent agents capable of sophisticated reasoning, robust perception, and versatile action across diverse domains. As these agents increasingly drive AI research and practical applications, their design, evaluation, and continuous improvement present intricate, multifaceted challenges. This book provides a comprehensive overview, framing intelligent agents within modular, brain-inspired architectures that integrate principles from cognitive science, neuroscience, and computational research. We structure our exploration into four interconnected parts. First, we systematically investigate the modular foundation of intelligent agents, systematically mapping their cognitive, perceptual, and operational modules onto analogous human brain functionalities and elucidating core components such as memory, world modeling, reward processing, goal, and emotion. Second, we discuss self-enhancement and adaptive evolution mechanisms, exploring how agents autonomously refine their capabilities, adapt to dynamic environments, and achieve continual learning through automated optimization paradigms. Third, we examine multi-agent systems, investigating the collective intelligence emerging from agent interactions, cooperation, and societal structures. Finally, we address the critical imperative of building safe and beneficial AI systems, emphasizing intrinsic and extrinsic security threats, ethical alignment, robustness, and practical mitigation strategies necessary for trustworthy real-world deployment. By synthesizing modular AI architectures with insights from different disciplines, this survey identifies key research challenges and opportunities, encouraging innovations that harmonize technological advancement with meaningful societal benefit.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper is a survey that provides a comprehensive overview of foundation agents by framing them within modular, brain-inspired architectures integrating cognitive science, neuroscience, and computational research. It is structured into four parts: (1) modular foundations mapping agent components such as memory, world modeling, reward processing, goal, and emotion onto human brain functionalities; (2) self-enhancement, adaptive evolution, and continual learning mechanisms; (3) multi-agent systems and collective intelligence from interactions; and (4) safety, ethical alignment, robustness, and mitigation strategies. The central claim is that this synthesis identifies key research challenges and opportunities to advance agents while ensuring societal benefit.
Significance. If the synthesis is balanced and representative, the survey offers a structured interdisciplinary lens that could help organize the rapidly growing literature on agent architectures. It explicitly highlights the value of modular designs and the imperative for safe systems. The brain-inspired mapping is used as an organizational scaffold rather than a falsifiable hypothesis, so the reader's noted concern about its productivity as a framework does not constitute a load-bearing flaw for the survey's contribution. No new theorems, empirical results, or parameter-free derivations are claimed, which is appropriate for this genre but limits the strength of the significance assessment.
minor comments (1)
- [Abstract] Abstract: The text refers to 'this book provides a comprehensive overview' while the work is presented as an arXiv paper (arXiv:2504.01990). Clarifying the intended format (survey paper vs. book chapter) would avoid potential reader confusion about scope and publication venue.
Simulated Author's Rebuttal
We thank the referee for the positive and constructive summary of our survey. We appreciate the recommendation for minor revision and the recognition that the brain-inspired modular framing serves as an organizational scaffold rather than a falsifiable hypothesis, which aligns with the survey genre. Since no specific major comments were enumerated in the report, we have no point-by-point revisions to address at this stage but remain ready to incorporate any editorial suggestions.
Circularity Check
No significant circularity; survey is organizational synthesis only
full rationale
The paper is a survey that structures existing literature around a brain-inspired modular framework without asserting new derivations, equations, quantitative predictions, or theorems. The abstract and structure describe an organizational scaffold for reviewing cognitive modules, self-enhancement, multi-agent systems, and safety, with no load-bearing steps that reduce to fitted parameters, self-definitions, or self-citation chains. No equations or falsifiable claims are present that could exhibit the required reduction to inputs by construction. The mapping to brain functionalities functions as a descriptive lens rather than a premise whose validity is internally derived or assumed without external support.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Principles from cognitive science, neuroscience, and computational research can be integrated into modular architectures for intelligent agents.
Forward citations
Cited by 31 Pith papers
-
Beyond Individual Intelligence: Surveying Collaboration, Failure Attribution, and Self-Evolution in LLM-based Multi-Agent Systems
A survey that unifies prior work on multi-agent LLM systems via the LIFE framework, mapping dependencies across collaboration, failure attribution, and autonomous self-evolution while identifying cross-stage challenges.
-
Harnessing Agentic Evolution
AEvo introduces a meta-agent that edits the evolution procedure or agent context based on accumulated state, outperforming baselines by 26% relative improvement on agentic benchmarks and achieving SOTA on open-ended tasks.
-
AgentForesight: Online Auditing for Early Failure Prediction in Multi-Agent Systems
AgentForesight trains a 7B model to perform online auditing of multi-agent LLM trajectories, detecting early decisive errors and outperforming larger models on custom and external benchmarks.
-
AgentForesight: Online Auditing for Early Failure Prediction in Multi-Agent Systems
AgentForesight introduces an online auditor model that predicts decisive errors in multi-agent trajectories at the earliest step using a coarse-to-fine reinforcement learning recipe on a new curated dataset AFTraj-2K.
-
Post Reasoning: Improving the Performance of Non-Thinking Models at No Cost
Post-Reasoning boosts LLM accuracy by reversing the usual answer-after-reasoning order, delivering mean relative gains of 17.37% across 117 model-benchmark pairs with zero extra cost.
-
ReCast: Recasting Learning Signals for Reinforcement Learning in Generative Recommendation
ReCast repairs all-zero groups and uses contrastive updates on strongest positives and hardest negatives to improve RL in generative recommendation, yielding up to 36.6% better Pass@1 with only 4.1% of baseline rollou...
-
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning
VideoDR is a new benchmark for open-web video deep research that tests multimodal models on cross-frame visual anchor extraction, interactive retrieval, and multi-hop reasoning over joint video-web evidence.
-
FOREVER: Forgetting Curve-Inspired Memory Replay for Language Model Continual Learning
FOREVER aligns replay intervals in LLM continual learning with a model-centric time based on optimizer update magnitudes and an Ebbinghaus-inspired forgetting curve to reduce catastrophic forgetting.
-
Mem-$\pi$: Adaptive Memory through Learning When and What to Generate
Mem-π is a framework using a dedicated model and decision-content decoupled RL to generate context-specific guidance on demand for LLM agents, outperforming retrieval baselines by over 30% on web navigation.
-
Rewarding Beliefs, Not Actions: Consistency-Guided Credit Assignment for Long-Horizon Agents
ReBel uses belief-consistency supervision and belief-aware grouping to improve credit assignment in long-horizon RL for LLM agents, achieving up to 20.4 percentage points higher success and 2.1x better sample efficien...
-
LPDS: Evaluating LLM Robustness Through Logic-Preserving Difficulty Scaling
LPDS quantifies difficulty of logic-preserving problem variations and searches for the hardest ones, producing up to 5x larger performance drops than random sampling and better robustness gains from fine-tuning on dif...
-
FuzzAgent: Multi-Agent System for Evolutionary Library Fuzzing
FuzzAgent deploys specialized agents that collaborate on harness generation, execution, and crash triage to evolve fuzzing campaigns, delivering 45-191% more branch coverage than four baselines on 20 C/C++ libraries a...
-
CHAL: Council of Hierarchical Agentic Language
CHAL is a multi-agent dialectic system that performs structured belief optimization over defeasible domains using Bayesian-inspired graph representations and configurable meta-cognitive value system hyperparameters.
-
MemORAI: Memory Organization and Retrieval via Adaptive Graph Intelligence for LLM Conversational Agents
MemORAI combines selective filtering, provenance tracking in multi-relational graphs, and dynamic weighted PageRank retrieval to achieve state-of-the-art memory retrieval and personalized responses in LLM agents on LO...
-
Learning to Communicate: Toward End-to-End Optimization of Multi-Agent Language Systems
DiffMAS jointly optimizes latent communication and reasoning in multi-agent LLM systems via parameter-efficient supervised training on trajectories, yielding consistent gains over baselines on math, science, and code ...
-
Do LLMs Need to See Everything? A Benchmark and Study of Failures in LLM-driven Smartphone Automation using Screentext vs. Screenshots
A new benchmark shows LLM smartphone agents achieve comparable success with screen text alone as with screenshots, but both fail often due to UI accessibility and reasoning gaps.
-
Agent-GWO: Collaborative Agents for Dynamic Prompt Optimization in Large Language Models
Agent-GWO uses collaborative grey-wolf-inspired agents to jointly optimize LLM prompts and decoding settings, yielding higher accuracy and stability than prior single-agent prompt optimization methods on math and hybr...
-
ADAM: A Systematic Data Extraction Attack on Agent Memory via Adaptive Querying
ADAM extracts data from LLM agent memory with up to 100% attack success rate by estimating data distribution and selecting queries via entropy guidance.
-
Memory in the Age of AI Agents
The paper maps agent memory research via three forms (token-level, parametric, latent), three functions (factual, experiential, working), and dynamics of formation/evolution/retrieval, plus benchmarks and future directions.
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Survey that defines agentic RL for LLMs via POMDPs, introduces a taxonomy of planning/tool-use/memory/reasoning capabilities and domains, and compiles open environments from over 500 papers.
-
Scalable Environments Drive Generalizable Agents
Generalizable agents require environment scaling via diverse executable rule-sets, distinguished from trajectory and task scaling in a new taxonomy.
-
Beyond Individual Intelligence: Surveying Collaboration, Failure Attribution, and Self-Evolution in LLM-based Multi-Agent Systems
The survey proposes the LIFE framework to unify fragmented research on collaboration, failure attribution, and self-evolution in LLM multi-agent systems into a progression toward self-organizing intelligence.
-
SynthAgent: Adapting Web Agents with Synthetic Supervision
SynthAgent uses dual refinement of synthetic tasks and trajectories to produce higher-quality training data that improves web agent adaptation to target environments.
-
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
UI-TARS-2 reaches 88.2 on Online-Mind2Web, 47.5 on OSWorld, 50.6 on WindowsAgentArena, and 73.3 on AndroidWorld while attaining 59.8 mean normalized score on a 15-game suite through multi-turn RL and scalable data generation.
-
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
A comprehensive review of self-evolving AI agents that improve themselves over time, organized via a framework of inputs, agent system, environment, and optimizers, with domain-specific and safety discussions.
-
Advancing Multi-Agent RAG Systems with Minimalist Reinforcement Learning
Mujica-MyGo decomposes multi-turn RAG interactions via multi-agent workflows and applies minimalist policy gradient optimization to improve performance on QA benchmarks while avoiding long-context problems.
-
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
A survey organizing techniques to achieve efficient reasoning in LLMs by shortening chain-of-thought outputs.
-
Multi-Agent Systems: From Classical Paradigms to Large Foundation Model-Enabled Futures
A survey comparing classical multi-agent systems with large foundation model-enabled multi-agent systems, showing how the latter enables semantic-level collaboration and greater adaptability.
-
Perspective on Bias in Biomedical AI: Preventing Downstream Healthcare Disparities
Omics datasets show low ancestry reporting and strong European bias, which biomedical foundation models risk perpetuating into downstream healthcare disparities unless addressed through provenance, openness, and evalu...
-
Agentic AI Security: Threats, Defenses, Evaluation, and Open Challenges
A survey that taxonomizes threats to agentic AI, reviews benchmarks and evaluation methods, discusses technical and governance defenses, and identifies open challenges.
-
Bridging Brains and Machines: A Unified Frontier in Neuroscience, Artificial Intelligence, and Neuromorphic Systems
A position and survey paper that identifies convergence between neuroscience, AGI, and neuromorphic computing and outlines four key integration challenges.
Reference graph
Works this paper leans on
- [1]
-
[2]
Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. Language models are few-shot learners.Advances in neural information processing systems, 33:1877–1901, 2020
work page 1901
-
[3]
Russell and Peter Norvig.Artificial Intelligence: A Modern Approach
Stuart J. Russell and Peter Norvig.Artificial Intelligence: A Modern Approach. Prentice Hall, Englewood Cliffs, NJ, 1 edition, 1995. ISBN 0-13-103805-2
work page 1995
-
[4]
Gps, a program that simulates human thought.Rand Corporation Santa Monica, CA, 1961
Allen Newell and Herbert Alexander Simon. Gps, a program that simulates human thought.Rand Corporation Santa Monica, CA, 1961
work page 1961
-
[5]
Rodney Brooks. A robust layered control system for a mobile robot.IEEE journal on robotics and automation, 2(1):14–23, 1986
work page 1986
-
[6]
Michael Wooldridge.An introduction to multiagent systems. John wiley & sons, 2009
work page 2009
-
[7]
Introducing chatgpt.https://openai.com/blog/chatgpt/, 2022
OpenAI. Introducing chatgpt.https://openai.com/blog/chatgpt/, 2022
work page 2022
-
[8]
Aixin Liu, Bei Feng, Bing Xue, Bingxuan Wang, Bochao Wu, Chengda Lu, Chenggang Zhao, Chengqi Deng,ChenyuZhang,ChongRuan,etal. Deepseek-v3technicalreport. arXivpreprintarXiv:2412.19437 , 2024
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[9]
Claude: The next step in helpful ai.https://www.anthropic.com, 2023
Anthropic. Claude: The next step in helpful ai.https://www.anthropic.com, 2023. Accessed: 2024-12-01
work page 2023
-
[10]
AnYang,BaosongYang,BeichenZhang,BinyuanHui,BoZheng,BowenYu,ChengyuanLi,Dayiheng Liu, Fei Huang, Haoran Wei, et al. Qwen2. 5 technical report.arXiv preprint arXiv:2412.15115, 2024
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[11]
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron, Thibaut Lavril, Gautier Izacard, Xavier Martinet, Marie-Anne Lachaux, Timothée Lacroix, Baptiste Rozière, Naman Goyal, Eric Hambro, Faisal Azhar, et al. Llama: Open and efficient foundation language models.arXiv preprint arXiv:2302.13971, 2023
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[12]
Trainingahelpfulandharmless assistant with rlhf.OpenAI Technical Report, 2022
YuntaoBai,SauravKadavath,SandipanKundu,AmandaAskell,etal. Trainingahelpfulandharmless assistant with rlhf.OpenAI Technical Report, 2022
work page 2022
-
[13]
Principles of neural science, 2013
Eric R Kandel, James H Schwartz, Thomas Jessell, Steven A Siegelbaum, and AJ Hudspeth. Principles of neural science, 2013
work page 2013
-
[14]
Dale Purves, George J Augustine, David Fitzpatrick, William Hall, Anthony-Samuel LaMantia, and Leonard White.Neurosciences. De Boeck Supérieur, 2019
work page 2019
-
[15]
Wikipedia contributors. Phineas Gage. https://en.wikipedia.org/wiki/Phineas_Gage, 2025. Accessed: June 8, 2025
work page 2025
-
[16]
Noga Larry, Gil Zur, and Mati Joshua. Organization of reward and movement signals in the basal ganglia and cerebellum.Nature Communications, 15(1):2119, 2024. 296 Bibliography
work page 2024
-
[17]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. Attention is all you need. InAdvances in Neural Information Processing Systems (NeurIPS), volume 30, pages 5998–6008. Curran Associates, Inc., 2017
work page 2017
-
[18]
Attention mechanisms in computer vision: A survey.Computational visual media, 8(3):331–368, 2022
Meng-Hao Guo, Tian-Xing Xu, Jiang-Jiang Liu, Zheng-Ning Liu, Peng-Tao Jiang, Tai-Jiang Mu, Song-Hai Zhang, Ralph R Martin, Ming-Ming Cheng, and Shi-Min Hu. Attention mechanisms in computer vision: A survey.Computational visual media, 8(3):331–368, 2022
work page 2022
-
[19]
Maurizio Corbetta and Gordon L Shulman. Control of goal-directed and stimulus-driven attention in the brain.Nature reviews neuroscience, 3(3):201–215, 2002
work page 2002
-
[20]
Deep learning.nature, 521(7553):436–444, 2015
Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. Deep learning.nature, 521(7553):436–444, 2015
work page 2015
-
[21]
Wernicke’s area.https://en.wikipedia.org/wiki/Wernicke%27s_area,
Wikipedia contributors. Wernicke’s area.https://en.wikipedia.org/wiki/Wernicke%27s_area,
-
[22]
Accessed: June 8, 2025
work page 2025
-
[23]
Cocktail party effect.https://en.wikipedia.org/wiki/Cocktail_party_ effect, 2025
Wikipedia contributors. Cocktail party effect.https://en.wikipedia.org/wiki/Cocktail_party_ effect, 2025. Accessed: June 8, 2025
work page 2025
-
[24]
James Knierim. Chapter 5: Cerebellum. https://nba.uth.tmc.edu/neuroscience/m/s3/ chapter05.html, 2020. Last review: October 20, 2020; Accessed: June 8, 2025
work page 2020
-
[25]
Torgeir Moberget and Richard B Ivry. Cerebellar contributions to motor control and language comprehension: searching for common computational principles.Annals of the New York Academy of Sciences, 1369(1):154–171, 2016
work page 2016
-
[26]
Kenji Doya. What are the computations of the cerebellum, the basal ganglia and the cerebral cortex? Neural networks, 12(7-8):961–974, 1999
work page 1999
-
[27]
Brainstem: What It Is, Function, Anatomy & Location
Cleveland Clinic. Brainstem: What It Is, Function, Anatomy & Location. https://my. clevelandclinic.org/health/body/21598-brainstem, 2024. Last reviewed: June 12, 2024; Ac- cessed: June 8, 2025
work page 2024
-
[28]
Cleveland Clinic. Thalamus: What It Is, Function & Disorders.https://my.clevelandclinic.org/ health/body/22652-thalamus, 2022. Last reviewed: March 30, 2022; Accessed: June 8, 2025
work page 2022
-
[29]
Thalamic control of sensory selection in divided attention.Nature, 526(7575):705–709, 2015
Ralf D Wimmer, L Ian Schmitt, Thomas J Davidson, Miho Nakajima, Karl Deisseroth, and Michael M Halassa. Thalamic control of sensory selection in divided attention.Nature, 526(7575):705–709, 2015
work page 2015
-
[30]
Limbic system.https://en.wikipedia.org/wiki/Limbic_system, 2025
Wikipedia contributors. Limbic system.https://en.wikipedia.org/wiki/Limbic_system, 2025. Accessed: June 8, 2025
work page 2025
-
[31]
Sanchez Jimenez and Orlando De Jesus
Jose G. Sanchez Jimenez and Orlando De Jesus. Hypothalamic Dysfunction.StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–, 2023. URLhttps://www.ncbi.nlm.nih.gov/ books/NBK560743/. Last update: August 23, 2023; Accessed: June 8, 2025
work page 2025
-
[32]
Yoshua Bengio, Michael Cohen, Damiano Fornasiere, Joumana Ghosn, Pietro Greiner, Matt MacDer- mott,SörenMindermann,AdamOberman,JesseRichardson,OliverRichardson,etal. Superintelligent agents pose catastrophic risks: Can scientist ai offer a safer path?arXiv preprint arXiv:2502.15657, 2025
-
[33]
OpenAI. Chatgpt (gpt-4). urlhttps://chat.openai.com/, 2023. Large language model
work page 2023
-
[34]
React: Synergizing reasoning and acting in language models
Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, and Yuan Cao. React: Synergizing reasoning and acting in language models. InInternational Conference on Learning Representations (ICLR), 2023
work page 2023
-
[35]
Cognitive architectures for language agents.Transactions on Machine Learning Research, 2024
Theodore Sumers, Shunyu Yao, Karthik Narasimhan, and Thomas Griffiths. Cognitive architectures for language agents.Transactions on Machine Learning Research, 2024. Bibliography 297
work page 2024
-
[36]
Generative agents: Interactive simulacra of human behavior
Joon Sung Park, Joseph O’Brien, Carrie Jun Cai, Meredith Ringel Morris, Percy Liang, and Michael S Bernstein. Generative agents: Interactive simulacra of human behavior. InProceedings of the 36th annual acm symposium on user interface software and technology, pages 1–22, 2023
work page 2023
-
[37]
James Kirkpatrick, Razvan Pascanu, Neil Rabinowitz, Joel Veness, Guillaume Desjardins, Andrei A Rusu, Kieran Milan, John Quan, Tiago Ramalho, Agnieszka Grabska-Barwinska, et al. Overcoming catastrophic forgetting in neural networks.Proceedings of the national academy of sciences, 114(13): 3521–3526, 2017
work page 2017
-
[38]
Richard S Sutton, Andrew G Barto, et al.Reinforcement learning: An introduction, volume 1. MIT press Cambridge, 1998
work page 1998
-
[39]
David Ha and Jürgen Schmidhuber. World models.arXiv preprint arXiv:1803.10122, 2018
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[40]
Pattie Maes. Artificial life meets entertainment: lifelike autonomous agents.Communications of the ACM, 38(11):108–114, 1995
work page 1995
-
[41]
Isitanagent,orjustaprogram?: Ataxonomyforautonomousagents
StanFranklinandArtGraesser. Isitanagent,orjustaprogram?: Ataxonomyforautonomousagents. In International workshop on agent theories, architectures, and languages, pages 21–35. Springer, 1997
work page 1997
-
[42]
Concrete Problems in AI Safety
Dario Amodei, Chris Olah, Jacob Steinhardt, Paul Christiano, John Schulman, and Dan Mané. Concrete problems in ai safety.arXiv preprint arXiv:1606.06565, 2016
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[43]
Superintelligence: Paths, dangers, strategies, 2014
Bostrom Nick. Superintelligence: Paths, dangers, strategies, 2014
work page 2014
-
[44]
Continual lifelong learning with neural networks: A review.Neural networks, 113:54–71, 2019
German I Parisi, Ronald Kemker, Jose L Part, Christopher Kanan, and Stefan Wermter. Continual lifelong learning with neural networks: A review.Neural networks, 113:54–71, 2019
work page 2019
- [45]
-
[46]
Oxford University Press, USA, 2019
Gyorgy Buzsaki.The brain from inside out. Oxford University Press, USA, 2019
work page 2019
-
[47]
Action and behavior: a free-energy formulation
Karl J Friston, Jean Daunizeau, James Kilner, and Stefan J Kiebel. Action and behavior: a free-energy formulation. Biological cybernetics, 102:227–260, 2010
work page 2010
-
[48]
Xiaoqian Liu, Xingzhou Lou, Jianbin Jiao, and Junge Zhang. Position: Foundation agents as the paradigm shift for decision making.arXiv preprint arXiv:2405.17009, 2024
-
[49]
Memory and the hippocampus: a synthesis from findings with rats, monkeys, and humans
Larry R Squire. Memory and the hippocampus: a synthesis from findings with rats, monkeys, and humans. Psychological review, 99(2):195, 1992
work page 1992
-
[50]
Jones & Bartlett Learning, 2020
Mark Bear, Barry Connors, and Michael A Paradiso.Neuroscience: exploring the brain, enhanced edition: exploring the brain. Jones & Bartlett Learning, 2020
work page 2020
-
[51]
RajeshPNRaoandDanaHBallard. Predictivecodinginthevisualcortex: afunctionalinterpretation of some extra-classical receptive-field effects.Nature neuroscience, 2(1):79–87, 1999
work page 1999
-
[52]
The emotional brain: The mysterious underpinnings of emotional life
Joseph E LeDoux. The emotional brain: The mysterious underpinnings of emotional life. Simon and Schuster, 1998
work page 1998
-
[53]
Damasio.Descartes’ Error: Emotion, Reason, and the Human Brain
Antonio R. Damasio.Descartes’ Error: Emotion, Reason, and the Human Brain. Putnam, 1994
work page 1994
-
[54]
Earl K Miller and Jonathan D Cohen. An integrative theory of prefrontal cortex function.Annual review of neuroscience, 24(1):167–202, 2001
work page 2001
-
[55]
Cognitive control, hierarchy, and the rostro–caudal organization of the frontal lobes
David Badre. Cognitive control, hierarchy, and the rostro–caudal organization of the frontal lobes. Trends in cognitive sciences, 12(5):193–200, 2008
work page 2008
-
[56]
A neural substrate of prediction and reward
Wolfram Schultz, Peter Dayan, and P Read Montague. A neural substrate of prediction and reward. Science, 275(5306):1593–1599, 1997
work page 1997
-
[57]
Academic Press, 4th edition, 2008
Joaquin M Fuster.The Prefrontal Cortex. Academic Press, 4th edition, 2008
work page 2008
-
[58]
The organisation of mind.Oxford Psychology Series, 32, 2011
Tim Shallice and Richard P Cooper. The organisation of mind.Oxford Psychology Series, 32, 2011. 298 Bibliography
work page 2011
-
[59]
Mindstorms in natural language-based societies of mind.arXiv preprint arXiv:2305.17066, 2023
Mingchen Zhuge, Haozhe Liu, Francesco Faccio, Dylan R Ashley, Róbert Csordás, Anand Gopalakr- ishnan, Abdullah Hamdi, Hasan Abed Al Kader Hammoud, Vincent Herrmann, Kazuki Irie, et al. Mindstorms in natural language-based societies of mind.arXiv preprint arXiv:2305.17066, 2023
-
[60]
Agent AI: Surveying the Horizons of Multimodal Interaction
Zane Durante, Qiuyuan Huang, Naoki Wake, Ran Gong, Jae Sung Park, Bidipta Sarkar, Rohan Taori, Yusuke Noda, Demetri Terzopoulos, Yejin Choi, Katsushi Ikeuchi, Hoi Vo, Li Fei-Fei, and Jianfeng Gao. AGENT AI: SURVEYING THE HORIZONS OF MULTIMODAL INTERACTION.arXiv preprint arXiv:2401.03568, 2024
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[61]
PositionPaper: Agent AI Towards a HolisticIntelligence, 2024
Qiuyuan Huang, Naoki Wake, Bidipta Sarkar, Zane Durante, Ran Gong, Rohan Taori, Yusuke Noda, Demetri Terzopoulos, Noboru Kuno, Ade Famoti, Ashley Llorens, John Langford, Hoi Vo, Li Fei-Fei, Katsu Ikeuchi, and Jianfeng Gao. PositionPaper: Agent AI Towards a HolisticIntelligence, 2024. URL http://arxiv.org/abs/2403.00833
-
[62]
The rise and potential of large language model based agents: A survey, 2023
Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, et al. The rise and potential of large language model based agents: A survey, 2023
work page 2023
-
[63]
A Survey on Large Language Model based Autonomous Agents
Lei Wang, Chen Ma, Xueyang Feng, Zeyu Zhang, Hao Yang, Jingsen Zhang, Zhiyuan Chen, Jiakai Tang, Xu Chen, Yankai Lin, Wayne Xin Zhao, Zhewei Wei, and Ji-Rong Wen. A Survey on Large Language Model based Autonomous Agents, 2023. URLhttp://arxiv.org/abs/2308.11432
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[64]
Language agents: Foundations, prospects, and risks
Yu Su, Diyi Yang, Shunyu Yao, and Tao Yu. Language agents: Foundations, prospects, and risks. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Tutorial Abstracts, pages 17–24, Miami, Florida, USA, November 2024. Association for Computational Linguistics. URL https://aclanthology.org/2024.emnlp-tutorials.3
work page 2024
-
[65]
The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey
Tula Masterman, Sandi Besen, Mason Sawtell, and Alex Chao. The landscape of emerging ai agent architectures for reasoning, planning, and tool calling: A survey.arXiv preprint arXiv:2404.11584, 2024
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[66]
Chawla, Olaf Wiest, and Xiangliang Zhang
TaichengGuo,XiuyingChen,YaqiWang,RuidiChang,ShichaoPei,NiteshV.Chawla,OlafWiest,and XiangliangZhang. Largelanguagemodelbasedmulti-agents: Asurveyofprogressandchallenges. In Kate Larson, editor,Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, IJCAI-24, pages 8048–8057. International Joint Conferences on Artificial I...
-
[67]
A Survey on the Memory Mechanism of Large Language Model based Agents
Zeyu Zhang, Xiaohe Bo, Chen Ma, Rui Li, Xu Chen, Quanyu Dai, Jieming Zhu, Zhenhua Dong, and Ji-Rong Wen. A survey on the memory mechanism of large language model based agents.arXiv preprint arXiv:2404.13501, 2024
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[68]
Miao Yu, Fanci Meng, Xinyun Zhou, Shilong Wang, Junyuan Mao, Linsey Pang, Tianlong Chen, Kun Wang, Xinfeng Li, Yongfeng Zhang, Bo An, and Qingsong Wen. A survey on trustworthy llm agents: Threats and countermeasures.arXiv preprint arXiv:2503.09648, 2025
-
[69]
Finetuned Language Models Are Zero-Shot Learners
Jason Wei, Maarten Bosma, Vincent Y Zhao, Kelvin Guu, Adams Wei Yu, Brian Lester, Nan Du, Andrew M Dai, and Quoc V Le. Finetuned language models are zero-shot learners.arXiv preprint arXiv:2109.01652, 2021
work page internal anchor Pith review Pith/arXiv arXiv 2021
-
[70]
Parameter-efficient transfer learning for nlp
NeilHoulsby,AndreiGiurgiu,StanislawJastrzebski,BrunaMorrone,QuentinDeLaroussilhe,Andrea Gesmundo, Mona Attariyan, and Sylvain Gelly. Parameter-efficient transfer learning for nlp. In International conference on machine learning, pages 2790–2799. PMLR, 2019
work page 2019
-
[71]
Long Ouyang, Jeffrey Wu, Xu Jiang, Diogo Almeida, Carroll Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, et al. Training language models to follow instructions with human feedback.Advances in neural information processing systems, 35:27730–27744, 2022. Bibliography 299
work page 2022
-
[72]
Reft: Representation finetuning for language models
TrungQuocLuong,XinboZhang,ZhanmingJie,PengSun,XiaoranJin,andHangLi. Reft: Reasoning withreinforcedfine-tuning. In Proceedingsofthe62ndAnnualMeetingoftheAssociationforComputational Linguistics, 2024. URLhttps://arxiv.org/abs/2404.03592
-
[73]
R1-searcher: Incentivizing the search capability in llms via reinforcement learning,
Huatong Song, Jinhao Jiang, Yingqian Min, Jie Chen, Zhipeng Chen, Wayne Xin Zhao, Lei Fang, and Ji-Rong Wen. R1-searcher: Incentivizing the search capability in llms via reinforcement learning,
-
[74]
URL https://arxiv.org/abs/2503.05592
work page internal anchor Pith review Pith/arXiv arXiv
-
[75]
Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed H. Chi, Quoc V. Le, and Denny Zhou. Chain-of-thought prompting elicits reasoning in large language models. In Sanmi Koyejo, S. Mohamed, A. Agarwal, Danielle Belgrave, K. Cho, and A. Oh, editors, Advances in Neural Information Processing Systems 35: Annual Conference on Neura...
work page 2022
-
[76]
Voyager: An Open-Ended Embodied Agent with Large Language Models
Guanzhi Wang, Yuqi Xie, Yunfan Jiang, Ajay Mandlekar, Chaowei Xiao, Yuke Zhu, Linxi Fan, and Anima Anandkumar. Voyager: An open-ended embodied agent with large language models.arXiv preprint arXiv:2305.16291, 2023
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[77]
Reflexion: language agents with verbal reinforcement learning
Noah Shinn, Federico Cassano, Beck Labash, Ashwin Gopinath, Karthik Narasimhan, and Shunyu Yao. Reflexion: language agents with verbal reinforcement learning. InNeural Information Processing Systems, 2023. URLhttps://api.semanticscholar.org/CorpusID:258833055
work page 2023
-
[79]
Learning transferable visual models from natural language supervision
AlecRadford, JongWookKim, ChrisHallacy, AdityaRamesh, GabrielGoh, SandhiniAgarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. Learning transferable visual models from natural language supervision. InICML, volume 139 ofProceedings of Machine Learning Research, pages 8748–8763. PMLR, 2021
work page 2021
-
[80]
Haotian Liu, Chunyuan Li, Qingyang Wu, and Yong Jae Lee. Visual instruction tuning. InNeurIPS, 2023
work page 2023
-
[81]
Weihan Wang, Qingsong Lv, Wenmeng Yu, Wenyi Hong, Ji Qi, Yan Wang, Junhui Ji, Zhuoyi Yang, Lei Zhao, Song XiXuan, et al. Cogvlm: Visual expert for pretrained language models.Advances in Neural Information Processing Systems, 37:121475–121499, 2025
work page 2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.