Deep Dive into LLMs like ChatGPT

Large language models operate through a multi-stage pipeline: pre-training on massive internet datasets, tokenization via byte-pair encoding, and neural network optimization to predict subsequent tokens. These models function as stochastic token simulators rather than sentient entities, relying on a fixed context window for working memory and static parameters for long-term knowledge. Despite their ability to solve complex Olympiad-grade problems, they exhibit "Swiss cheese" capabilities, often failing at simple tasks like counting or basic arithmetic due to their reliance on finite computation per token. Mitigating these hallucinations and reasoning gaps requires integrating external tools like web search and code interpreters. Advanced "thinking" models, developed through reinforcement learning, further enhance performance by discovering emergent reasoning strategies, such as backtracking and re-evaluating steps, which allow them to solve problems beyond the limitations of simple expert imitation.

Outlines

Part 1: Pre-training, Base Models

Part 2: Fine-Tuning, Factuality

Part 3: Memory, Reasoning Limits

Part 4: Reinforcement Learning, DeepSeek

Part 5: RLHF, Human Feedback

Part 6: Future, Resources

Sign in to continue reading, translating and more.

Continue

Andrej Karpathy

Part 1: Pre-training, Base Models

Pre-training and Internet Data Processing

Tokenization and Neural Network Training Mechanics

GPT-2 Reproduction and Computational Scaling

Base Models and In-Context Learning

Part 2: Fine-Tuning, Factuality

Post-training and Supervised Fine-Tuning

Human Labeling and Synthetic Data Curation

Hallucinations and Factuality Mitigations

Part 3: Memory, Reasoning Limits

Context Window as Working Memory

Computational Capabilities and Reasoning Sharp Edges

Part 4: Reinforcement Learning, DeepSeek

Reinforcement Learning Motivation

Trial and Error Learning in RL

DeepSeek R1 and Reasoning Models

AlphaGo and Emergent Strategies

Part 5: RLHF, Human Feedback

Reinforcement Learning in Unverifiable Domains

RLHF Upsides and Gaming the Reward Model

Part 6: Future, Resources

Future Capabilities: Multimodality and Agents

Resources and Where to Find Models

Deep Dive into LLMs like ChatGPT

Andrej Karpathy

Part 1: Pre-training, Base Models

00:00Pre-training and Internet Data Processing

Pre-training and Internet Data Processing

15:00Tokenization and Neural Network Training Mechanics

Tokenization and Neural Network Training Mechanics

30:00GPT-2 Reproduction and Computational Scaling

GPT-2 Reproduction and Computational Scaling

43:00Base Models and In-Context Learning

Base Models and In-Context Learning

Part 2: Fine-Tuning, Factuality

59:00Post-training and Supervised Fine-Tuning

Post-training and Supervised Fine-Tuning

1:10:00Human Labeling and Synthetic Data Curation

Human Labeling and Synthetic Data Curation

1:20:00Hallucinations and Factuality Mitigations

Hallucinations and Factuality Mitigations

Part 3: Memory, Reasoning Limits

1:35:00Context Window as Working Memory

Context Window as Working Memory

1:46:00Computational Capabilities and Reasoning Sharp Edges

Computational Capabilities and Reasoning Sharp Edges

Part 4: Reinforcement Learning, DeepSeek

2:05:00Reinforcement Learning Motivation

Reinforcement Learning Motivation

2:20:00Trial and Error Learning in RL

Trial and Error Learning in RL

2:28:00DeepSeek R1 and Reasoning Models

DeepSeek R1 and Reasoning Models

2:40:00AlphaGo and Emergent Strategies

AlphaGo and Emergent Strategies

Part 5: RLHF, Human Feedback

2:47:00Reinforcement Learning in Unverifiable Domains

Reinforcement Learning in Unverifiable Domains

3:00:00RLHF Upsides and Gaming the Reward Model

RLHF Upsides and Gaming the Reward Model

Part 6: Future, Resources

3:09:00Future Capabilities: Multimodality and Agents

Future Capabilities: Multimodality and Agents

3:15:00Resources and Where to Find Models

Resources and Where to Find Models