AI Agent 残酷共学

介绍

本轮残酷共学聚焦 「AI Agent」 —— 由大型语言模型（LLM）驱动、能自主规划、调用工具并完成任务的新一代智能体。我们将以顶尖课程 CS598 Topics in LLM Agents（UIUC）中涵盖的前沿研究论文为主要学习材料，结合最新研究进展，在三周高强度的论文研读、总结与讨论中，一起深入理解 Agent 的核心概念、关键能力、主流框架及未来挑战。本次共学将特别强调论文的精读、理解与个人观察的总结。

共学目标

深入理解：系统性学习并掌握 LLM Agent 在推理、记忆、规划、工具调用、多模态处理等关键模块的核心论文和主流方法。
总结观察：针对每周研读的论文，独立完成思考，总结个人观察、批判性见解和潜在的启发。
批判与展望：基于论文学习，评估当前 Agent 技术的局限性（如数据、对齐、安全、人机协作等），并对下一代 Agent / AGI 的发展方向形成自己的思考与展望。

关键词

AI Agent, LLM, Tool Use, Planning, Reasoning, Paper Reading, Self-Reflection, Memory

面向人群

想系统性学习或快速深入理解 Agent 相关前沿研究的 Web3 / AI 开发者、产品经理、研究者
有志于将 LLM Agent 的理论知识应用于实际业务流程、DAO 治理或创意生产的实践者
喜欢高密度论文阅读、深度思考、总结提炼与热烈讨论的共学型选手

报名时间

开始时间：2025-05-25
结束时间：2025-05-31

共学时间

开始时间：2025-06-01
结束时间：2025-06-21

发起人

姓名：Dex
GitHub ID：dexhunter
Telegram：dexhunt3r
Email：[email protected]

发起组织

LXDAO

请假规则

每周请假 2 次

社群

Telegram：https://t.me/LXDAO/24843

学习资料/课程安排

提示：核心阅读量较大，请务必提前预习。鼓励深入研读指定论文，并积极总结个人观察与思考。

Week 1: AI Agent 基础、核心能力（推理、规划与记忆）

核心议题： 理解 AI Agent 的基本概念、发展现状以及构成 Agent 智能的关键能力。

必读论文：

Overview & Foundations:
- Language Agents: Foundations, Prospects, and Risks (Slides) (了解LLM Agent的基本构成、潜力与风险)
- Section 1-3 of How far are we from AGI? (对AGI的探讨及LLM Agent在其中的定位)
Reasoning (推理):
- ReAct: Synergizing Reasoning and Acting in Language Models (经典范式：结合推理与行动)
- Tree of Thoughts: Deliberate Problem Solving with Large Language Models (通过思维树进行更复杂的推理)
Planning (规划):
- LLM+P: Empowering Large Language Models with Optimal Planning Proficiency (LLM与规划器结合)
- Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models (LATS框架)
Memory (记忆):
- Cognitive Architectures for Language Agents (认知架构视角下的Agent记忆)
- HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models (受神经生物学启发的长时记忆)

选读/辅助理解：

Week 2: Agent 能力拓展（多模态）、框架（工具使用、RAG、多智能体）与评估

核心议题： 探索 Agent 在多模态理解方面的能力，学习构建和增强 Agent 的关键框架，并了解如何评估 Agent 的性能。

必读论文：

Multimodal Understanding (多模态理解):
- Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs (多模态LLM的视觉短板)
- VisualWebArena: Evaluating Multimodal Agents on Realistic Visually Grounded Web Tasks (多模态Agent在真实网络任务上的评估)
Tool Use (工具使用):
- ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs (让LLM掌握大量真实API)
- Gorilla: Large Language Model Connected with Massive APIs (LLM与大规模API的连接)
Retrieval-Augmented Generation (RAG - 检索增强生成):
- Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity (自适应RAG)
- Corrective Retrieval-Augmented Generation (纠正性RAG)
Multi-Agent Systems (多智能体系统):
- AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework (通过多智能体对话构建应用)
- CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society (探索LLM社会性的交互式智能体)
Agent Evaluation (Agent评估):
- Autonomous Evaluation and Refinement of Digital Agents (数字Agent的自主评估与改进)
- Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference (基于人类偏好的LLM评估平台)

选读/辅助理解：

Week 3: Agent 应用、挑战与未来展望

核心议题： 了解 AI Agent 在不同领域的具体应用，深入探讨其面临的关键挑战（如数据、安全、对齐），并展望其向 AGI 发展的路径。

必读论文：

Agent Application (Agent应用):
- Auto-research: ResearchTown: Simulator of Human Research Community (模拟人类研究社区)
- Coding Agents: If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents (代码如何赋能LLM作为智能体)
- Social Agents: Generative Agents: Interactive Simulacra of Human Behavior (可交互的人类行为模拟)
- Gaming Agents: Voyager: An Open-Ended Embodied Agent with Large Language Models (开放式具身智能体)
Challenges from Agents to AGI (从Agent到AGI的挑战):
- Data: BAGEL: Bootstrapping Agents by Guiding Exploration with Language (通过语言引导探索来引导Agent)
- Safety:
  - Universal and Transferable Adversarial Attacks on Aligned Language Models (对对齐语言模型的通用和可迁移对抗攻击)
  - DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models (GPT模型可信度的综合评估)
- Alignment:
  - Training Language Models to Follow Instructions with Human Feedback (RLHF经典论文)
  - Direct Preference Optimization: Your Language Model is Secretly a Reward Model (DPO)
Future Perspectives & Broader Impact:
- Section 4-7 of How far are we from AGI? (再次回顾AGI的讨论，结合已学内容进行思考)

选读/辅助理解：

The Emerged Security and Privacy of LLM Agent: A Survey with Case Studies
Position: A Roadmap to Pluralistic Alignment
Why Johnny Can’t Prompt: How Non-AI Experts Try (and Fail) to Design LLM Prompts (关于人机交互与Prompt工程的思考)

拓展资源

UIUC CS598 Topics in LLM Agents 完整 Slides & Paper List
OpenAI Cookbook (包含Agent相关代码示例)
LangGraph 官方教程 (学习Agentic应用的构建)

共学激励

让我们一起在三周里「快速变态成长」，把纸面知识内化为深刻洞见，共同探索 AI Agent 的无限可能 🚀

打卡成功者福利：https://www.notion.so/lxdao/3eab258b4df44c9cb97319452b2be13b

报名和打卡规则

因为残酷共学的报名和打卡是基于 GitHub 进行开展的，如果你是非开发者或者对 git 操作不熟悉，请先阅读此文档：残酷共学 GitHub 新手教程

报名:
- Step01：Fork 本仓库。
- Step02：复制 Template.md 创建你的个人笔记文件，并根据文档指引填写你的信息，并将文件重命名为你的 GitHub ID：YourGitHubID.md。
- Step03：创建一个 PR 到当前仓库，本残酷共学助教会对你的 PR 进行 review，review 通过后，你的 PR 会被 merge 到 main 分支，这个时候你会收到邀请加入这个仓库 contribution 的邮件，接受邀请后，你会自动获得 main 分支的 push 权限。
- Step04：完成以上三个步骤，恭喜你报名成功，后续就可以将你的学习记录直接 push 到 main 分支进行更新。
- 请加入 https://t.me/LXDAO/24843 群组保持交流。加入群组后请在群里报到一下方便助教记录。
打卡：
- 报名成功后，你将拥有 main 分支的 push 权限，你需要将每天学习笔记按日期更新到你的 YourName.md 文档中，提交更新后，我们会自动更新你的打卡状态到下面的打卡记录表。
- 如果你不在 UTC+8 时区，需要添加时区 code 到你的 YourName.md 文件的开始，错误的时区设置可能会使自动化打卡脚本错误计算打卡时间，具体请参考：https://github.com/IntensiveCoLearning/template/blob/main/Template.md?plain=1#L1
- 当你提交笔记时，请确保以下几点，否则打卡可能会失败：
  - 在 YourName.md 文档，请将笔记内容放到以下代码块中，且  和  不能删除:
```

### 日期
笔记内容

```
  - 日期格式为 ### 2024.07.11，请不要随意更改

残酷共学打卡记录表

✅ = Done ⭕️ = Missed ❌ = Failed

Name	6.01	6.02	6.03	6.04	6.05	6.06	6.07	6.08	6.09	6.10	6.11	6.12	6.13	6.14
dexhunter	✅	✅	✅	✅	✅	✅	✅	✅	✅	✅	✅	⭕️	⭕️	❌
Vap0RSh4wN	⭕️	⭕️	✅	✅	✅	⭕️	❌
universe-ron	✅	✅	✅	✅	✅	✅	✅	✅	✅	✅	✅	✅	✅	⭕️
fuhaooo	⭕️	⭕️	⭕️	❌
chesley666	✅	✅	✅	⭕️	⭕️	✅	✅	❌
StarryDesert	⭕️	⭕️	⭕️	❌
apkaisaw	✅	⭕️	⭕️	❌
MartinYeung5	⭕️	⭕️	⭕️	❌
shendongyang05	⭕️	⭕️	⭕️	❌
Bitpeng-YT	✅	✅	✅	⭕️	⭕️	❌
CHENFANGC	⭕️	⭕️	⭕️	❌
fffuuuming	✅	✅	⭕️	✅	✅	✅	⭕️	✅	✅	✅	✅	✅	✅	✅
nocb	✅	✅	⭕️	⭕️	❌
voidpainter	⭕️	✅	⭕️	✅	⭕️	❌

统计数据

总参与人数: 0
完成人数: 0
完成用户:
全勤用户:
淘汰人数: 0
淘汰率: 0.00%
Fork人数: 0

Name		Name	Last commit message	Last commit date
Latest commit History 203 Commits
.github/workflows		.github/workflows
Bitpeng-YT.md		Bitpeng-YT.md
CHENFANGC.md		CHENFANGC.md
MartinYeung5.md		MartinYeung5.md
README.md		README.md
StarryDesert.md		StarryDesert.md
Template.md		Template.md
Vap0RSh4wN.md		Vap0RSh4wN.md
apkaisaw.md		apkaisaw.md
censhui		censhui
chesley666.md		chesley666.md
dexhunter.md		dexhunter.md
fffuuuming.md		fffuuuming.md
fuhaooo.md		fuhaooo.md
nocb.md		nocb.md
shendongyang05.md		shendongyang05.md
sync_status_readme.py		sync_status_readme.py
universe-ron.md		universe-ron.md
voidpainter.md		voidpainter.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Agent 残酷共学

介绍

共学目标

关键词

面向人群

报名时间

共学时间

发起人

发起组织

请假规则

社群

学习资料/课程安排

Week 1: AI Agent 基础、核心能力（推理、规划与记忆）

Week 2: Agent 能力拓展（多模态）、框架（工具使用、RAG、多智能体）与评估

Week 3: Agent 应用、挑战与未来展望

拓展资源

共学激励

更多信息

报名和打卡规则

残酷共学打卡记录表

统计数据

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 17

Uh oh!

Languages

IntensiveCoLearning/ai-agent

Folders and files

Latest commit

History

Repository files navigation

AI Agent 残酷共学

介绍

共学目标

关键词

面向人群

报名时间

共学时间

发起人

发起组织

请假规则

社群

学习资料/课程安排

Week 1: AI Agent 基础、核心能力（推理、规划与记忆）

Week 2: Agent 能力拓展（多模态）、框架（工具使用、RAG、多智能体）与评估

Week 3: Agent 应用、挑战与未来展望

拓展资源

共学激励

更多信息

报名和打卡规则

残酷共学打卡记录表

统计数据

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 17

Uh oh!

Languages

Packages