人工智能实践(语言智能)
第7讲:Agent

参考文献

第 7 讲 Agent 的核心论文、综述、框架文档与规范

推荐综述(先读这两篇)

  • Lilian Weng. LLM Powered Autonomous Agents. OpenAI Blog, 2023-06-23.
    https://lilianweng.github.io/posts/2023-06-23-agent/
    建立 Planning / Memory / Tool Use 三要素框架的奠基博文,所有后续工作的参考起点。

  • Wang, L. 等. A Survey on Large Language Model based Autonomous Agents. Frontiers of Computer Science, 2024(arXiv: 2308.11432).
    https://arxiv.org/abs/2308.11432
    该领域第一篇中文主导的系统综述(人大,86 页),覆盖构建、应用、评估。

  • Xi, Z. 等. The Rise and Potential of Large Language Model Based Agents: A Survey. 中国科学:信息科学, 2024(arXiv: 2309.07864).
    https://arxiv.org/abs/2309.07864
    复旦 86 页巨作,从哲学起源到现代实现的全景综述。


核心论文(按本讲章节组织)

7.1 经典 Agent 架构

  • Fikes, R. E. & Nilsson, N. J. STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving. Artificial Intelligence, 1971.
  • Brooks, R. A. A Robust Layered Control System for a Mobile Robot. IEEE Journal of Robotics and Automation, 1986.
  • Brooks, R. A. Intelligence Without Representation. Artificial Intelligence, 1991.
  • Rao, A. S. & Georgeff, M. P. BDI Agents: From Theory to Practice. ICMAS, 1995.(引用量 5,000+)
  • Wooldridge, M. & Jennings, N. R. Intelligent Agents: Theory and Practice. Knowledge Engineering Review, 1995.
  • Russell, S. & Norvig, P. Artificial Intelligence: A Modern Approach. 4th Edition, Pearson, 2020.

7.2 LLM Agent 的四要素

7.3 规划与推理

  • Wang, L. 等. Plan-and-Solve Prompting. ACL 2023.
    https://arxiv.org/abs/2305.04091
  • Zhou, D. 等. Least-to-Most Prompting Enables Complex Reasoning in Large Language Models. ICLR 2023.
  • Xu, B. 等. ReWOO: Decoupling Reasoning from Observations. 2023.
    https://arxiv.org/abs/2305.18323
  • Yao, S. 等. Tree of Thoughts: Deliberate Problem Solving with Large Language Models. NeurIPS 2023.
    https://arxiv.org/abs/2305.10601
  • Besta, M. 等. Graph of Thoughts: Solving Elaborate Problems with Large Language Models. AAAI 2024.
  • Zhou, A. 等. Language Agent Tree Search. ICML 2024.
  • Shinn, N. 等. Reflexion: Language Agents with Verbal Reinforcement Learning. NeurIPS 2023.
    https://arxiv.org/abs/2303.11366
  • Madaan, A. 等. Self-Refine: Iterative Refinement with Self-Feedback. NeurIPS 2023.
  • Hao, S. 等. Reasoning with Language Model is Planning with World Model (RAP). EMNLP 2023.
  • Huang, X. 等. Understanding the Planning of LLM Agents: A Survey. 2024.
    https://arxiv.org/abs/2402.02716

7.4 记忆与工具

  • Packer, C. 等. MemGPT: Towards LLMs as Operating Systems. 2023.
    https://arxiv.org/abs/2310.08560
    (后续开源为 Letta 框架)
  • Patil, S. G. 等. Gorilla: Large Language Model Connected with Massive APIs. NeurIPS 2024.
  • Qin, Y. 等. ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs. ICLR 2024.
  • Li, M. 等. API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs. EMNLP 2023.
  • Wang, X. 等. Executable Code Actions Elicit Better LLM Agents (CodeAct). ICML 2024.
  • Asai, A. 等. Self-RAG: Learning to Retrieve, Generate, and Critique. ICLR 2024 Oral.
  • Sarthi, P. 等. RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval. ICLR 2024.

工具调用与协议规范

7.5 多 Agent 协作

  • Park, J. S. 等. Generative Agents: Interactive Simulacra of Human Behavior. UIST 2023.
    https://arxiv.org/abs/2304.03442
    25 个 agent 在沙盒中自发组织情人节派对的经典工作。
  • Wu, Q. 等. AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation. 2023.
    https://arxiv.org/abs/2308.08155
  • Hong, S. 等. MetaGPT: Meta Programming for Multi-Agent Collaborative Framework. ICLR 2024 Oral.
    https://arxiv.org/abs/2308.00352
  • Qian, C. 等. ChatDev: Communicative Agents for Software Development. ACL 2024.
  • Li, G. 等. CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society. NeurIPS 2023.
  • Du, Y. 等. Improving Factuality and Reasoning in Language Models through Multi-Agent Debate. ICML 2024.
    https://arxiv.org/abs/2305.14325
  • Chen, W. 等. AgentVerse: Facilitating Multi-Agent Collaboration. ICLR 2024.
  • Shen, Y. 等. HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace. NeurIPS 2023.
  • Guo, T. 等. Large Language Model based Multi-Agents: A Survey of Progress and Challenges. IJCAI 2024.

其他重要 Agent 系统

  • Wang, G. 等. Voyager: An Open-Ended Embodied Agent with Large Language Models. TMLR 2023.
  • Nakano, R. 等. WebGPT: Browser-assisted Question-Answering with Human Feedback. OpenAI, 2021.
  • AutoGPT. Toran Bruce Richards, 2023-03. https://github.com/Significant-Gravitas/AutoGPT(18 万+ star)
  • BabyAGI. Yohei Nakajima, 2023-03.
  • Yang, J. 等. SWE-Agent: Agent-Computer Interfaces Enable Automated Software Engineering. NeurIPS 2024.
  • Wang, X. 等. OpenHands: An Open Platform for AI Software Developers as Generalist Agents. ICLR 2025.

评测基准

  • Liu, X. 等. AgentBench: Evaluating LLMs as Agents. ICLR 2024.
  • Zhou, S. 等. WebArena: A Realistic Web Environment for Building Autonomous Agents. ICLR 2024.
  • Jimenez, C. E. 等. SWE-bench: Can Language Models Resolve Real-World GitHub Issues?. ICLR 2024 Oral.
  • Mialon, G. 等. GAIA: A Benchmark for General AI Assistants. Meta & HuggingFace, 2023.
  • Xie, T. 等. OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments. NeurIPS 2024.
  • Yao, S. 等. τ-bench: A Benchmark for Tool-Agent-User Interaction. ICLR 2025.
    提出 pass^k 指标,揭示 agent 的可靠性危机。
  • Chan, J. S. 等. MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering. OpenAI, 2024.

框架文档(本讲实验会用到)


安全与可靠性

  • Hubinger, E. 等. Sleeper Agents: Training Deceptive LLMs that Persist through Safety Training. Anthropic, 2024.
  • Sun, L. 等. TrustLLM: Trustworthiness in Large Language Models. 2024.
  • Ji, J. 等. AI Alignment: A Comprehensive Survey. 2023.
  • Zhang, Z. 等. LLM-based Agents are Affected by Hallucination: A Survey. arXiv: 2509.18970, 2025.
  • Levy, S. 等. ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents. 2024.

课程内部材料


阅读建议

最短阅读路径(3 小时)

  1. Lilian Weng 博文(建立总体框架)
  2. ReAct 原论文(范式基础)
  3. MCP 规范页(工具生态)
  4. Du 等人 Multi-Agent Debate 论文(多 agent 代表作)
  5. τ-bench 论文(看清 agent 的可靠性真相)

2026 年的认知更新:本讲引用的许多论文发表于 2022–2024 年。Agent 领域变化极快,建议每季度跟进一次 arXiv 的 cs.AI agent 方向综述。重点关注:可靠性(pass^k)、安全性(ST-Web)、成本效率(MLE-bench 指出一次评估约 $48k)。