参考文献

核心论文（按本讲章节组织）

7.1 经典 Agent 架构

Fikes, R. E. & Nilsson, N. J. STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving. Artificial Intelligence, 1971.
Brooks, R. A. A Robust Layered Control System for a Mobile Robot. IEEE Journal of Robotics and Automation, 1986.
Brooks, R. A. Intelligence Without Representation. Artificial Intelligence, 1991.
Rao, A. S. & Georgeff, M. P. BDI Agents: From Theory to Practice. ICMAS, 1995.（引用量 5,000+）
Wooldridge, M. & Jennings, N. R. Intelligent Agents: Theory and Practice. Knowledge Engineering Review, 1995.
Russell, S. & Norvig, P. Artificial Intelligence: A Modern Approach. 4th Edition, Pearson, 2020.

7.2 LLM Agent 的四要素

Wei, J. 等. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. NeurIPS 2022.
https://arxiv.org/abs/2201.11903
Yao, S. 等. ReAct: Synergizing Reasoning and Acting in Language Models. ICLR 2023.
https://arxiv.org/abs/2210.03629
Schick, T. 等. Toolformer: Language Models Can Teach Themselves to Use Tools. NeurIPS 2023.
https://arxiv.org/abs/2302.04761
Karpas, E. 等. MRKL Systems: A Modular, Neuro-Symbolic Architecture. AI21 Labs, 2022.
https://arxiv.org/abs/2205.00445
"Modular Reasoning, Knowledge and Language"，工具使用范式的早期形式化。

7.3 规划与推理

Wang, L. 等. Plan-and-Solve Prompting. ACL 2023.
https://arxiv.org/abs/2305.04091
Zhou, D. 等. Least-to-Most Prompting Enables Complex Reasoning in Large Language Models. ICLR 2023.
Xu, B. 等. ReWOO: Decoupling Reasoning from Observations. 2023.
https://arxiv.org/abs/2305.18323
Yao, S. 等. Tree of Thoughts: Deliberate Problem Solving with Large Language Models. NeurIPS 2023.
https://arxiv.org/abs/2305.10601
Besta, M. 等. Graph of Thoughts: Solving Elaborate Problems with Large Language Models. AAAI 2024.
Zhou, A. 等. Language Agent Tree Search. ICML 2024.
Shinn, N. 等. Reflexion: Language Agents with Verbal Reinforcement Learning. NeurIPS 2023.
https://arxiv.org/abs/2303.11366
Madaan, A. 等. Self-Refine: Iterative Refinement with Self-Feedback. NeurIPS 2023.
Hao, S. 等. Reasoning with Language Model is Planning with World Model (RAP). EMNLP 2023.
Huang, X. 等. Understanding the Planning of LLM Agents: A Survey. 2024.
https://arxiv.org/abs/2402.02716

7.4 记忆与工具

Packer, C. 等. MemGPT: Towards LLMs as Operating Systems. 2023.
https://arxiv.org/abs/2310.08560
（后续开源为 Letta 框架）
Patil, S. G. 等. Gorilla: Large Language Model Connected with Massive APIs. NeurIPS 2024.
Qin, Y. 等. ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs. ICLR 2024.
Li, M. 等. API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs. EMNLP 2023.
Wang, X. 等. Executable Code Actions Elicit Better LLM Agents (CodeAct). ICML 2024.
Asai, A. 等. Self-RAG: Learning to Retrieve, Generate, and Critique. ICLR 2024 Oral.
Sarthi, P. 等. RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval. ICLR 2024.

工具调用与协议规范：

OpenAI Function Calling 文档: https://platform.openai.com/docs/guides/function-calling
Model Context Protocol (MCP) 规范（Anthropic, 2024-11）: https://modelcontextprotocol.io
"AI 的 USB-C"，2025 年已被 OpenAI、Google DeepMind 采用，2025-12 捐赠至 Linux Foundation。
MCP GitHub: https://github.com/modelcontextprotocol

7.5 多 Agent 协作

Park, J. S. 等. Generative Agents: Interactive Simulacra of Human Behavior. UIST 2023.
https://arxiv.org/abs/2304.03442
25 个 agent 在沙盒中自发组织情人节派对的经典工作。
Wu, Q. 等. AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation. 2023.
https://arxiv.org/abs/2308.08155
Hong, S. 等. MetaGPT: Meta Programming for Multi-Agent Collaborative Framework. ICLR 2024 Oral.
https://arxiv.org/abs/2308.00352
Qian, C. 等. ChatDev: Communicative Agents for Software Development. ACL 2024.
Li, G. 等. CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society. NeurIPS 2023.
Du, Y. 等. Improving Factuality and Reasoning in Language Models through Multi-Agent Debate. ICML 2024.
https://arxiv.org/abs/2305.14325
Chen, W. 等. AgentVerse: Facilitating Multi-Agent Collaboration. ICLR 2024.
Shen, Y. 等. HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace. NeurIPS 2023.
Guo, T. 等. Large Language Model based Multi-Agents: A Survey of Progress and Challenges. IJCAI 2024.

其他重要 Agent 系统

Wang, G. 等. Voyager: An Open-Ended Embodied Agent with Large Language Models. TMLR 2023.
Nakano, R. 等. WebGPT: Browser-assisted Question-Answering with Human Feedback. OpenAI, 2021.
AutoGPT. Toran Bruce Richards, 2023-03. https://github.com/Significant-Gravitas/AutoGPT（18 万+ star）
BabyAGI. Yohei Nakajima, 2023-03.
Yang, J. 等. SWE-Agent: Agent-Computer Interfaces Enable Automated Software Engineering. NeurIPS 2024.
Wang, X. 等. OpenHands: An Open Platform for AI Software Developers as Generalist Agents. ICLR 2025.

评测基准

Liu, X. 等. AgentBench: Evaluating LLMs as Agents. ICLR 2024.
Zhou, S. 等. WebArena: A Realistic Web Environment for Building Autonomous Agents. ICLR 2024.
Jimenez, C. E. 等. SWE-bench: Can Language Models Resolve Real-World GitHub Issues?. ICLR 2024 Oral.
Mialon, G. 等. GAIA: A Benchmark for General AI Assistants. Meta & HuggingFace, 2023.
Xie, T. 等. OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments. NeurIPS 2024.
Yao, S. 等. τ-bench: A Benchmark for Tool-Agent-User Interaction. ICLR 2025.
提出 pass^k 指标，揭示 agent 的可靠性危机。
Chan, J. S. 等. MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering. OpenAI, 2024.

框架文档（本讲实验会用到）

LangGraph 文档: https://langchain-ai.github.io/langgraph/
AutoGen 文档: https://microsoft.github.io/autogen/
CrewAI 文档: https://docs.crewai.com
smolagents 文档: https://huggingface.co/docs/smolagents
OpenAI Agents SDK: https://platform.openai.com/docs/guides/agents
Google Agent Development Kit (ADK): https://google.github.io/adk-docs/（2025-04 开源，Apache 2.0）
Microsoft Agent Framework 1.0: 2026-04 整合 AutoGen + Semantic Kernel

安全与可靠性

Hubinger, E. 等. Sleeper Agents: Training Deceptive LLMs that Persist through Safety Training. Anthropic, 2024.
Sun, L. 等. TrustLLM: Trustworthiness in Large Language Models. 2024.
Ji, J. 等. AI Alignment: A Comprehensive Survey. 2023.
Zhang, Z. 等. LLM-based Agents are Affected by Hallucination: A Survey. arXiv: 2509.18970, 2025.
Levy, S. 等. ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents. 2024.

课程内部材料

AI智能体文献综述_从经典架构到LLM驱动的自主系统.md — 本讲的上游材料，覆盖从 STRIPS 到 MCP 的完整脉络，建议作为课后精读材料。

阅读建议

最短阅读路径（3 小时）：

Lilian Weng 博文（建立总体框架）
ReAct 原论文（范式基础）
MCP 规范页（工具生态）
Du 等人 Multi-Agent Debate 论文（多 agent 代表作）
τ-bench 论文（看清 agent 的可靠性真相）

2026 年的认知更新：本讲引用的许多论文发表于 2022–2024 年。Agent 领域变化极快，建议每季度跟进一次 arXiv 的 cs.AI agent 方向综述。重点关注：可靠性（pass^k）、安全性（ST-Web）、成本效率（MLE-bench 指出一次评估约 $48k）。

推荐综述（先读这两篇）

核心论文（按本讲章节组织）

7.1 经典 Agent 架构

7.2 LLM Agent 的四要素

7.3 规划与推理

7.4 记忆与工具

7.5 多 Agent 协作

其他重要 Agent 系统

评测基准

框架文档（本讲实验会用到）

安全与可靠性

课程内部材料

阅读建议

On this page