What is a Transformer? (Transformer Walkthrough Part 1/2)

In this two-part video series, Neel introduces transformers, the architecture behind modern language models, and aims to provide a deep understanding of how they work by coding one from scratch. Neel, a researcher in mechanistic interpretability, explains why understanding the internals of these models is crucial, especially as they achieve human-level language capabilities. The first part focuses on conceptually explaining transformers, their components, and their purpose, targeting those new to the topic but familiar with neural networks. Neel discusses the inputs (tokens) and outputs (logits) of a transformer, emphasizing that transformers are sequence modeling engines that process information in parallel at each sequence position and use attention to move information between positions. The tutorial also covers tokenization, embeddings, layer normalization, and positional information, providing a comprehensive overview of transformer architecture.

Outlines

Sign in to continue reading, translating and more.

Continue

Neel Nanda

Introduction to Transformers and Mechanistic Interpretability

Understanding Transformer Inputs and Outputs

Converting Language to Vectors: Tokens and Embeddings

Transformer Output: Logits and Autoregressive Generation

High-Level Transformer Architecture: Embedding, Residual Stream, and Transformer Blocks

Unembedding, LayerNorm, and Linear Transformations

Positional Information and Learned Absolute Positional Embeddings

What is a Transformer? (Transformer Walkthrough Part 1/2)

Neel Nanda

00:00Introduction to Transformers and Mechanistic Interpretability

Introduction to Transformers and Mechanistic Interpretability

06:25Understanding Transformer Inputs and Outputs

Understanding Transformer Inputs and Outputs

17:10Converting Language to Vectors: Tokens and Embeddings

Converting Language to Vectors: Tokens and Embeddings

25:31Transformer Output: Logits and Autoregressive Generation

Transformer Output: Logits and Autoregressive Generation

36:21High-Level Transformer Architecture: Embedding, Residual Stream, and Transformer Blocks

High-Level Transformer Architecture: Embedding, Residual Stream, and Transformer Blocks

47:25Unembedding, LayerNorm, and Linear Transformations

Unembedding, LayerNorm, and Linear Transformations

59:16Positional Information and Learned Absolute Positional Embeddings

Positional Information and Learned Absolute Positional Embeddings