⚡️Jailbreaking AGI: Pliny the Liberator & John V on Red Teaming, BT6, and the Future of AI Security

The podcast explores the landscape of AI model jailbreaking, focusing on its motivations, techniques, and implications for AI safety and security. Pliny the Liberator, a prominent figure known for jailbreaking AI models, discusses the central role of "liberation" in his work, emphasizing freedom of information and transparency in AI development. The conversation covers the futility of relying solely on guardrails for safety, as attackers can easily switch models or find loopholes. The discussion highlights the importance of open-source data and collaboration within the AI safety community. John V emphasizes a full-stack approach to AI security, considering the broader attack surface beyond just the model itself, including connected tools and functions. The guests advocate for focusing on system-level security measures rather than solely on model training to prevent vulnerabilities.

Outlines

Part 1: Introduction, Philosophy of Liberation

Part 2: Jailbreaking Techniques, Intuition

Part 3: Offensive Security, Orchestration

Part 4: Industry Challenges, Future Outlook

Sign in to continue reading, translating and more.

Open full episode in Podwise

Latent Space

Part 1: Introduction, Philosophy of Liberation

Introduction to Pliny the Elder and the Hacker Collective's Liberation Mission

Universal Jailbreaks: Bypassing Guardrails and the Futile Battle of Model Lockdown

DevSecOps and the Importance of Unimpeded Research in AI Safety

Part 2: Jailbreaking Techniques, Intuition

Modeling Intelligence: Intuition and Bond Formation in Jailbreaking

Hard vs. Soft Jailbreaks and the Anthropic Challenge

Open Source Data and the Hacker Collective's Ethos of Radical Transparency

Part 3: Offensive Security, Orchestration

Weaponizing Models: From Jailbreaking to Orchestrating Malicious Acts

AI Security Communities and the Magic of BT6

Part 4: Industry Challenges, Future Outlook

Beginner-Friendly Resources and the VC Cycle's Impact on Security

The Full Stack Approach to AI Security: Beyond the Model

Final Thoughts and Call to Action

⚡️Jailbreaking AGI: Pliny the Liberator & John V on Red Teaming, BT6, and the Future of AI Security

Latent Space

Part 1: Introduction, Philosophy of Liberation

00:02Introduction to Pliny the Elder and the Hacker Collective's Liberation Mission

Introduction to Pliny the Elder and the Hacker Collective's Liberation Mission

02:38Universal Jailbreaks: Bypassing Guardrails and the Futile Battle of Model Lockdown

Universal Jailbreaks: Bypassing Guardrails and the Futile Battle of Model Lockdown

07:22DevSecOps and the Importance of Unimpeded Research in AI Safety

DevSecOps and the Importance of Unimpeded Research in AI Safety

Part 2: Jailbreaking Techniques, Intuition

12:21Modeling Intelligence: Intuition and Bond Formation in Jailbreaking

Modeling Intelligence: Intuition and Bond Formation in Jailbreaking

14:53Hard vs. Soft Jailbreaks and the Anthropic Challenge

Hard vs. Soft Jailbreaks and the Anthropic Challenge

20:41Open Source Data and the Hacker Collective's Ethos of Radical Transparency

Open Source Data and the Hacker Collective's Ethos of Radical Transparency

Part 3: Offensive Security, Orchestration

23:30Weaponizing Models: From Jailbreaking to Orchestrating Malicious Acts

Weaponizing Models: From Jailbreaking to Orchestrating Malicious Acts

26:03AI Security Communities and the Magic of BT6

AI Security Communities and the Magic of BT6

Part 4: Industry Challenges, Future Outlook

30:26Beginner-Friendly Resources and the VC Cycle's Impact on Security

Beginner-Friendly Resources and the VC Cycle's Impact on Security

35:35The Full Stack Approach to AI Security: Beyond the Model

The Full Stack Approach to AI Security: Beyond the Model

39:37Final Thoughts and Call to Action

Final Thoughts and Call to Action