YouTube16 Dec 2025
40m

⚡️Jailbreaking AGI: Pliny the Liberator & John V on Red Teaming, BT6, and the Future of AI Security

Podcast cover

Latent Space

The podcast explores the landscape of AI model jailbreaking, focusing on its motivations, techniques, and implications for AI safety and security. Pliny the Liberator, a prominent figure known for jailbreaking AI models, discusses the central role of "liberation" in his work, emphasizing freedom of information and transparency in AI development. The conversation covers the futility of relying solely on guardrails for safety, as attackers can easily switch models or find loopholes. The discussion highlights the importance of open-source data and collaboration within the AI safety community. John V emphasizes a full-stack approach to AI security, considering the broader attack surface beyond just the model itself, including connected tools and functions. The guests advocate for focusing on system-level security measures rather than solely on model training to prevent vulnerabilities.

Outlines

Part 1: Introduction, Philosophy of Liberation

Part 2: Jailbreaking Techniques, Intuition

Part 3: Offensive Security, Orchestration

Part 4: Industry Challenges, Future Outlook

Sign in to continue reading, translating and more.

Open full episode in Podwise