# agent-study
**Repository Path**: m626/agent-study
## Basic Information
- **Project Name**: agent-study
- **Description**: AI Agent全栈课程:从ReAct循环到Claude Code逆向、MCP/A2A协议、RAG、DSPy、生产可观测性——全部为可运行Python文件,面试导向。
- **Primary Language**: Unknown
- **License**: MIT
- **Default Branch**: main
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 0
- **Created**: 2026-06-07
- **Last Updated**: 2026-06-07
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
🤖 AI Agent Full-Stack Learning Course
Master AI Agent core theory and engineering from zero to one
36 Chapters · 22,000+ Lines of Code · 60+ Runnable Examples · Interview-Ready
🇨🇳 中文 README
---
## 🔗 Quick Links
> **🌐 No installation — read in your browser**: [agent-study-ruddy.vercel.app](https://agent-study-ruddy.vercel.app/) · All 36 chapters · Dark blueprint theme · Mobile-friendly
---
## 📖 Overview
A **career-oriented** AI Agent full-stack learning course. From Agent fundamentals to Claude Code reverse engineering, from RAG to MCP/A2A protocols, from DSPy to production observability — covering **36 topics across 7 tiers**. Each chapter is a **standalone runnable `.py` file** — a complete lecture and executable code in one.
> **Target audience**: New graduates, career switchers, and any developer looking to systematically master AI Agents.
---
## 🗺️ Course Roadmap (36 Chapters · 7 Tiers)
```
Tier 1: Foundations ── Tier 2: Engineering ── Tier 3: Deep Tech ── Tier 4: Production
Ch0-3 Ch4-7 Ch8-12 Ch13-18
Tier 5: Advanced Arch ── Tier 6: Reinforcement ── Tier 7: Expert
Ch19-24 Ch25-28 Ch29-36
```
### Tier 1: Agent Foundations
| Ch | Topic | Key Technologies |
|------|------|----------|
| **[Ch0](https://callous-0923.github.io/agent-study/chapter_00_overview/00_course_overview.html)** | Course Overview & Setup | Learning roadmap, dependency installation, API key config |
| **[Ch1](https://callous-0923.github.io/agent-study/chapter_01_fundamentals/01_hello_agent.html)** | Your First Agent | Handwritten ReAct loop, Function Calling principles |
| **[Ch2](https://callous-0923.github.io/agent-study/chapter_02_components/02_agent_components.html)** | Agent Core Components | Planner, memory systems (short/long/working term), tool design golden rules |
| **[Ch3](https://callous-0923.github.io/agent-study/chapter_03_types/03_agent_types.html)** | Agent Architecture Types | ReAct / Plan-Execute / Reflexion comparison |
### Tier 2: Engineering Practice & Frameworks
| Ch | Topic | Key Technologies |
|------|------|----------|
| **[Ch4](https://callous-0923.github.io/agent-study/chapter_04_frameworks/04_frameworks.html)** | Mainstream Frameworks | LangChain Agent + LangGraph state machine |
| **[Ch5](https://callous-0923.github.io/agent-study/chapter_05_multi_agent/05_multi_agent.html)** | Multi-Agent Systems | Writer+Reviewer collaboration, crewAI style |
| **[Ch6](https://callous-0923.github.io/agent-study/chapter_06_evaluation/06_evaluation.html)** | Evaluation & Testing | Eval frameworks, LLM-as-Judge, production checklist |
| **[Ch7](https://callous-0923.github.io/agent-study/chapter_07_interview/07_interview_prep.html)** | Interview Preparation | 20 high-frequency questions + project guide + interview flow |
### Tier 3: Deep Technical Analysis
| Ch | Topic | Key Technologies |
|------|------|----------|
| **[Ch8](https://callous-0923.github.io/agent-study/chapter_08_claude_code/08_claude_code_architecture.html)** | Claude Code Architecture | nO main loop, h2A Steering, context compaction, SubAgent |
| **[Ch9](https://callous-0923.github.io/agent-study/chapter_09_rag_deepdive/09_rag_deepdive.html)** | RAG Deep Dive | Chunking strategies, Embedding selection, RRF, Cross-Encoder, production modes |
| **[Ch10](https://callous-0923.github.io/agent-study/chapter_10_mcp/10_mcp_deepdive.html)** | MCP Protocol Explained | JSON-RPC, primitives (Tools/Resources/Prompts), capability negotiation |
| **[Ch11](https://callous-0923.github.io/agent-study/chapter_11_tool_calling/11_tool_calling_deepdive.html)** | Tool Calling Internals | OpenAI vs Anthropic, streaming assembly, strict mode |
| **[Ch12](https://callous-0923.github.io/agent-study/chapter_12_infrastructure/12_infrastructure.html)** | Agent Production Infrastructure | OpenClaw architecture, Harness, production checklist |
### Tier 4: Production Engineering
| Ch | Topic | Key Technologies |
|------|------|----------|
| **[Ch13](https://callous-0923.github.io/agent-study/chapter_13_fastapi/13_fastapi_agent_service.html)** | FastAPI Agent Service | REST API, SSE streaming, WebSocket, production deployment |
| **[Ch14](https://callous-0923.github.io/agent-study/chapter_14_sqlite/14_sqlite_agent_storage.html)** | SQLite Persistence | 5-table schema, WAL mode, session/task/user management |
| **[Ch15](https://callous-0923.github.io/agent-study/chapter_15_a2a/15_a2a_protocol.html)** | Google A2A Protocol | AgentCard, Task, Artifact, multi-agent collaboration |
| **[Ch16](https://callous-0923.github.io/agent-study/chapter_16_memgpt/16_memgpt_letta.html)** | MemGPT/Letta Memory | Core Memory, Heartbeat, Sleep-Time, Filesystem Memory |
| **[Ch17](https://callous-0923.github.io/agent-study/chapter_17_computer_use/17_computer_use.html)** | Computer Use | Screenshot-Action Loop, coordinate calculation, security sandbox |
| **[Ch18](https://callous-0923.github.io/agent-study/chapter_18_security/18_agent_security.html)** | Agent Security & Guardrails | Prompt Injection, privilege levels, input sanitization, 4-layer defense |
### Tier 5: Advanced Architecture & Optimization
| Ch | Topic | Key Technologies |
|------|------|----------|
| **[Ch19](https://callous-0923.github.io/agent-study/chapter_19_workflow_patterns/19_workflow_patterns.html)** | Agentic Workflow Patterns | Reflection / Routing / Orchestrator-Worker / Evaluator-Optimizer (7 patterns) |
| **[Ch20](https://callous-0923.github.io/agent-study/chapter_20_context_engineering/20_context_engineering.html)** | Context Engineering | Context Rot, budget management, XML-structured prompts, Skill.md |
| **[Ch21](https://callous-0923.github.io/agent-study/chapter_21_streaming/21_streaming_architecture.html)** | Streaming & Real-time Architecture | EventBus, dynamic interrupts, backpressure, StateManager Reducer |
| **[Ch22](https://callous-0923.github.io/agent-study/chapter_22_dspy/22_dspy.html)** | DSPy Auto-Optimization | Signature→Module→Optimizer, automatic few-shot, LangChain complement |
| **[Ch23](https://callous-0923.github.io/agent-study/chapter_23_code_agents/23_code_agents.html)** | Code Agent Architecture Survey | CodeAct / ACI / Plan-Execute, SWE-bench, Agentless findings |
| **[Ch24](https://callous-0923.github.io/agent-study/chapter_24_observability/24_observability.html)** | Agent Observability | Tracing Span tree, dashboards, LangSmith vs LangFuse, alerting |
### Tier 6: Foundational Reinforcement
| Ch | Topic | Key Technologies |
|------|------|----------|
| **[Ch25](https://callous-0923.github.io/agent-study/chapter_25_vectordb/25_vectordb.html)** | Vector Database Selection | Chroma / Pinecone / Milvus / Qdrant comparison, Embedding trade-offs |
| **[Ch26](https://callous-0923.github.io/agent-study/chapter_26_model_routing/26_model_routing.html)** | Model Routing Strategies | Threshold / Cascade / Semantic / Cost-Aware (4 strategies) |
| **[Ch27](https://callous-0923.github.io/agent-study/chapter_27_prompt_eng/27_prompt_engineering.html)** | Agent Prompt Engineering | System Prompt 6-module template, tool description scoring card |
| **[Ch28](https://callous-0923.github.io/agent-study/chapter_28_cache/28_cache.html)** | Semantic Caching & Token Optimization | Exact→Semantic→LLM three-tier cache, token budget management |
### Tier 7: Expert Level
| Ch | Topic | Key Technologies |
|------|------|----------|
| **[Ch29](https://callous-0923.github.io/agent-study/chapter_29_multimodal/29_multimodal.html)** | Multi-Modal Agent | Vision + text joint reasoning, multi-modal tool calling |
| **[Ch30](https://callous-0923.github.io/agent-study/chapter_30_reliability/30_reliability.html)** | Agent Reliability Engineering | Circuit breaker, exponential backoff retry, idempotency, degradation |
| **[Ch31](https://callous-0923.github.io/agent-study/chapter_31_benchmarks/31_benchmarks.html)** | Agent Benchmarking in Depth | GAIA / AgentBench / WebArena / tau-bench |
| **[Ch32](https://callous-0923.github.io/agent-study/chapter_32_self_improving/32_self_improving.html)** | Self-Improving Agent | Bad case collection → auto prompt optimization → eval validation |
| **[Ch33](https://callous-0923.github.io/agent-study/chapter_33_prompt_cache/33_prompt_cache.html)** | Prompt Caching & Inference Optimization | Anthropic Cache, KV sharing, speculative decoding |
| **[Ch34](https://callous-0923.github.io/agent-study/chapter_34_finetune/34_finetune.html)** | Model Fine-tuning for Function Calling | LoRA, fine-tuning data preparation, cost-benefit analysis |
| **[Ch35](https://callous-0923.github.io/agent-study/chapter_35_data_flywheel/35_data_flywheel.html)** | Data Flywheel | Interaction collection → bad case detection → auto-triggered improvement |
| **[Ch36](https://callous-0923.github.io/agent-study/chapter_36_defense/36_defense.html)** | Agent Defense in Depth | Canary Token, layered isolation, behavioral sandbox |
---
## 🚀 Quick Start
### 1. Clone the Repository
```bash
git clone https://github.com/Callous-0923/agent-study.git
cd agent-study
```
### 2. Check Environment
```bash
python chapter_00_overview/00_course_overview.py
```
### 3. Install Dependencies
Open `chapter_00_overview/00_course_overview.py`, change `install = False` to `install = True`, then run:
```bash
python chapter_00_overview/00_course_overview.py
```
### 4. Configure API Key (only needed for Ch1-5)
Create a `.env` file in the project root:
```env
OPENAI_API_KEY=sk-your-api-key-here
OPENAI_BASE_URL=https://api.openai.com/v1
LLM_MODEL=gpt-4o-mini
```
> You can also use other providers (DeepSeek, Qwen, etc.) by changing `OPENAI_BASE_URL` and `LLM_MODEL`.
### 5. Start Learning (most chapters require no API key!)
```bash
# No API key needed — run directly
python chapter_08_claude_code/08_claude_code_architecture.py
python chapter_10_mcp/10_mcp_deepdive.py
python chapter_14_sqlite/14_sqlite_agent_storage.py
python chapter_19_workflow_patterns/19_workflow_patterns.py
python chapter_21_streaming/21_streaming_architecture.py
python chapter_24_observability/24_observability.py
python chapter_26_model_routing/26_model_routing.py
python chapter_28_cache/28_cache.py
```
---
## 📦 Dependencies
| Dependency | Chapters | Description |
|------|----------|------|
| `openai` | Ch1-3 | OpenAI API calls |
| `langchain` + `langgraph` | Ch4-5 | Agent framework hands-on |
| `fastapi` + `uvicorn` | Ch13 | Agent service deployment |
| `pydantic` | Ch13 | Data model validation |
| `python-dotenv` | Ch0 | Environment variable management |
> **Note**: Chapters 8-12 and 14-28 mostly rely on the Python standard library (`sqlite3`, `asyncio`, `hashlib`, etc.) — no extra installation needed.
---
## 🎯 Learning Paths
### Path 1: From Scratch (recommended for beginners)
Follow Ch0 → Ch36 in order, 1-2 hours per chapter.
### Path 2: Interview Cram (key chapters)
- **Ch7**: 20 high-frequency interview questions + process
- **Ch8**: Claude Code architecture (industry-grade Agent design)
- **Ch9**: RAG deep dive
- **Ch10 + Ch15**: MCP & A2A dual protocols
- **Ch11**: Tool Calling internals
- **Ch18**: Agent security (demo vs. production engineer differentiator)
- **Ch19**: Workflow design patterns (universal system design framework)
- **Ch26**: Model routing (50-80% cost reduction)
### Path 3: Build Products
- **Ch13**: FastAPI service
- **Ch14**: SQLite persistence
- **Ch12 + Ch24**: Production checklist + observability
- **Ch26 + Ch28**: Cost optimization (routing + caching)
---
## 🧠 Core Technology Coverage
```
Tool Calling Internals ★★★★★ Complete OpenAI vs Anthropic comparison + Streaming assembly
MCP Protocol ★★★★★ Full lifecycle simulation (Initialize→tools/call)
A2A Protocol ★★★★★ AgentCard/Task/Artifact + Multi-Agent collaboration
Claude Code Architecture★★★★★ nO/h2A/Compaction/SubAgent reverse analysis
RAG Full-Stack ★★★★ Chunking/Embedding/RRF/Cross-Encoder/Production
Model Routing ★★★★ 4 strategies + cost comparison experiments (94% savings)
Semantic Caching ★★★★ Three-tier cache + Token budget management
Agent Security ★★★★ Prompt Injection + privilege levels + 4-layer defense
DSPy Auto-Optimization ★★★★ Signature/Module/Optimizer + LangChain complement
Agentic Workflow ★★★★ 7 design patterns + system design answer framework
Context Engineering ★★★★ Context Rot principles + XML Prompt + budget management
Streaming Architecture ★★★★ EventBus + dynamic interrupts + backpressure
Observability ★★★★ Tracing Span tree + LangSmith vs LangFuse
MemGPT Memory ★★★★ Core Memory/Heartbeat/Sleep-Time/Filesystem
Code Agents ★★★ CodeAct/ACI/Plan-Execute + SWE-bench
Vector Databases ★★★ Chroma/Pinecone/Milvus/Qdrant + Embedding strategies
FastAPI Service ★★★ REST/SSE/WebSocket + production deployment
SQLite Persistence ★★★ 5-table Schema + WAL + audit queries
Computer Use ★★★ Screenshot-Action Loop + security sandbox
```
---
## 📁 Project Structure
```
agent-study/
├── README.md
├── README_EN.md
├── .gitignore
├── chapter_00_overview/ 🚀 Course overview + setup
├── chapter_01_fundamentals/ 📖 First Agent (handwritten ReAct)
├── chapter_02_components/ 🧩 Planner + Memory + Tool design
├── chapter_03_types/ 🎯 ReAct / Plan-Execute / Reflexion
├── chapter_04_frameworks/ 🔧 LangChain + LangGraph
├── chapter_05_multi_agent/ 🤝 Multi-agent collaboration
├── chapter_06_evaluation/ 📊 Evaluation + testing strategies
├── chapter_07_interview/ 🎓 20 interview questions + career guide
├── chapter_08_claude_code/ 🏗️ Claude Code architecture reverse engineering
├── chapter_09_rag_deepdive/ 🔍 RAG deep dive (Chunking, Embedding, Production)
├── chapter_10_mcp/ 🔌 MCP protocol complete implementation
├── chapter_11_tool_calling/ ⚙️ Tool Calling internals
├── chapter_12_infrastructure/ 🏭 OpenClaw/Harness/Production infrastructure
├── chapter_13_fastapi/ 🌐 FastAPI Agent service
├── chapter_14_sqlite/ 💾 SQLite persistence
├── chapter_15_a2a/ 🤖 Google A2A protocol
├── chapter_16_memgpt/ 🧠 MemGPT/Letta memory architecture
├── chapter_17_computer_use/ 🖥️ Computer Use + GUI
├── chapter_18_security/ 🛡️ Agent security & guardrails
├── chapter_19_workflow_patterns/ 🏷️ Agentic Workflow design patterns
├── chapter_20_context_engineering/ 📐 Context Engineering
├── chapter_21_streaming/ 📡 EventBus real-time architecture
├── chapter_22_dspy/ 🔬 DSPy auto-optimization
├── chapter_23_code_agents/ 📊 Code Agent architecture survey
├── chapter_24_observability/ 📈 Observability (LangSmith/LangFuse)
├── chapter_25_vectordb/ 🗄️ Vector database selection
├── chapter_26_model_routing/ 🔀 Model routing & cost optimization
├── chapter_27_prompt_eng/ ✍️ Agent Prompt engineering
├── chapter_28_cache/ ⚡ Semantic caching & token optimization
├── chapter_29_multimodal/ 👁️ Multi-Modal Agent
├── chapter_30_reliability/ 🛡️ Agent reliability engineering
├── chapter_31_benchmarks/ 📊 Agent benchmarking in depth
├── chapter_32_self_improving/ 🔄 Self-Improving Agent
├── chapter_33_prompt_cache/ 💾 Prompt Caching & inference optimization
├── chapter_34_finetune/ 🎯 Model fine-tuning for Function Calling
├── chapter_35_data_flywheel/ 🔁 Data flywheel
└── chapter_36_defense/ 🏰 Agent defense in depth
```
---
## ✨ Features
- **📝 Lecture-as-Code**: Each `.py` file is both a complete lecture (module-level docstring) and runnable code
- **🤖 No API Key Needed**: Chapters 8-28 mostly use only the standard library — run directly
- **🎤 Interview-Oriented**: Each chapter highlights key interview topics with answer frameworks and scoring points
- **🔗 Cross-Referenced**: Chapters form a complete knowledge network through cross-references
- **📊 Runnable Demos**: Every chapter includes complete demonstration output
- **🇨🇳 Chinese-first** with English README available
---
## 🌟 References
- [ReAct: Synergizing Reasoning and Acting in Language Models](https://arxiv.org/abs/2210.03629) (Yao et al., ICLR 2023)
- [MemGPT: Towards LLMs as Operating Systems](https://arxiv.org/abs/2310.08560) (Packer et al., 2023)
- [Reflexion: Language Agents with Verbal Reinforcement Learning](https://arxiv.org/abs/2303.11366) (Shinn et al., 2023)
- [DSPy: Compiling Declarative LM Calls into Self-Improving Pipelines](https://arxiv.org/abs/2310.03714) (Khattab et al., 2023)
- [SWE-bench: Can Language Models Resolve Real-World GitHub Issues?](https://arxiv.org/abs/2310.06770) (Jimenez et al., 2024)
- [Agentless: Demystifying LLM-Based Software Engineering Agents](https://arxiv.org/abs/2407.01489) (Xia et al., 2024)
- [MCP Official Specification](https://modelcontextprotocol.io) (Anthropic, 2024-2025)
- [A2A Protocol Specification](https://a2a-protocol.org) (Google, 2025)
- [Building effective agents](https://www.anthropic.com/engineering/building-effective-agents) (Anthropic, 2024)
- [Effective context engineering for AI agents](https://www.anthropic.com/engineering/effective-context-engineering-for-ai-agents) (Anthropic, 2025)
- [Claude Code Reverse Analysis](https://github.com/shareAI-lab/analysis_claude_code) (Community, 2025)
---
## 📄 License
MIT License — Free to use, modify, and distribute.
---
If this project helps you, please give it a ⭐ Star!
36 Chapters · 7 Tiers · Continuously updated · Issues and PRs welcome