Stanford CME295 Transformers & LLMs | Autumn 2025 | Lecture 7 - Agentic LLMs | Stanford Online

The lecture explores practical techniques for Large Language Models (LLMs) to interact with external systems, focusing on Retrieval Augmented Generation (RAG), tool calling, and agents. RAG is presented as a method to augment prompts with relevant information from external knowledge bases to overcome the limitations of LLMs' knowledge cutoff dates and context length constraints. The discussion covers chunking strategies, embedding models (like SentenceBERT), and retrieval methods, including semantic similarity search and BM25. Tool calling enables LLMs to complete tasks by accessing external resources through structured data and function APIs, exemplified by finding a teddy bear using location data. The lecture also introduces agents, autonomous systems that pursue goals through iterative processes, highlighting the ReAct framework and agent-to-agent communication protocols, while also addressing safety concerns like data exfiltration.

Outlines

Part 1: Recap and Context

Part 2: Retrieval Augmented Generation (RAG)

Part 3: Tool Calling and Integration

Part 4: Agents and Future Outlook

Sign in to continue reading, translating and more.

Continue

Stanford CME295 Transformers & LLMs | Autumn 2025 | Lecture 7 - Agentic LLMs

Stanford Online

Part 1: Recap and Context

Introduction to Practical Techniques for LLM Interaction with External Systems

Recap of Reasoning Models and Group Relative Policy Optimization (GRPO)

Connecting LLMs to Evolving Knowledge Bases and Enabling Action Performance

Part 2: Retrieval Augmented Generation (RAG)

Retrieval Augmented Generation (RAG) Explained: Steps and Knowledge Base

Retrieving Relevant Documents: Candidate Retrieval and Ranking Stages

Addressing Query-Document Differences and Improving Chunk Coherence

Evaluating Retrieval Performance: Metrics and Benchmarks

Part 3: Tool Calling and Integration

Tool Calling: Leveraging Structured Data for LLM Interaction

Implementing Tool Calling: Steps and Training

Explanation-Based Training and Tool Selection Challenges

Tool Selection and Standardization with Model Context Protocol (MCP)

Part 4: Agents and Future Outlook

Agents: Autonomous Goal Pursuit and Task Completion

Multi-Agent Communication and Safety Considerations

Addressing Agent Limitations and Building Tools/Agents: Advice and Conclusion

Stanford CME295 Transformers & LLMs | Autumn 2025 | Lecture 7 - Agentic LLMs

Stanford Online

Part 1: Recap and Context

00:05Introduction to Practical Techniques for LLM Interaction with External Systems

Introduction to Practical Techniques for LLM Interaction with External Systems

01:01Recap of Reasoning Models and Group Relative Policy Optimization (GRPO)

Recap of Reasoning Models and Group Relative Policy Optimization (GRPO)

05:31Connecting LLMs to Evolving Knowledge Bases and Enabling Action Performance

Connecting LLMs to Evolving Knowledge Bases and Enabling Action Performance

Part 2: Retrieval Augmented Generation (RAG)

15:16Retrieval Augmented Generation (RAG) Explained: Steps and Knowledge Base

Retrieval Augmented Generation (RAG) Explained: Steps and Knowledge Base

23:42Retrieving Relevant Documents: Candidate Retrieval and Ranking Stages

Retrieving Relevant Documents: Candidate Retrieval and Ranking Stages

37:35Addressing Query-Document Differences and Improving Chunk Coherence

Addressing Query-Document Differences and Improving Chunk Coherence

47:46Evaluating Retrieval Performance: Metrics and Benchmarks

Evaluating Retrieval Performance: Metrics and Benchmarks

Part 3: Tool Calling and Integration

59:23Tool Calling: Leveraging Structured Data for LLM Interaction

Tool Calling: Leveraging Structured Data for LLM Interaction

1:08:13Implementing Tool Calling: Steps and Training

Implementing Tool Calling: Steps and Training

1:17:21Explanation-Based Training and Tool Selection Challenges

Explanation-Based Training and Tool Selection Challenges

1:25:54Tool Selection and Standardization with Model Context Protocol (MCP)

Tool Selection and Standardization with Model Context Protocol (MCP)

Part 4: Agents and Future Outlook

1:31:56Agents: Autonomous Goal Pursuit and Task Completion

Agents: Autonomous Goal Pursuit and Task Completion

1:39:01Multi-Agent Communication and Safety Considerations

Multi-Agent Communication and Safety Considerations

1:46:11Addressing Agent Limitations and Building Tools/Agents: Advice and Conclusion

Addressing Agent Limitations and Building Tools/Agents: Advice and Conclusion