AI incidents, audits, and the limits of benchmarks

The podcast explores the landscape of AI incidents, verification, and evaluation with Sean McGregor, co-founder and lead research engineer at the AI Verification and Evaluation Research Institute, and founder of the AI Incident Database. McGregor discusses the necessity of collecting usable datasets to motivate safety practices, drawing parallels with aviation and food safety industries. The conversation highlights the challenges in defining AI safety and incidents, especially with the rise of general-purpose AI systems. Sourcing information on AI incidents primarily relies on journalistic reporting, supplemented by direct submissions. McGregor emphasizes the importance of third-party audits and meta-evaluation of benchmarks to ensure the reliability and safety of AI models in real-world applications, referencing an incident involving a traffic citation issued to a woman due to a misread license plate.

Outlines

Part 1: Introduction, Guest Background

Part 2: Defining AI Incidents, Safety

Part 3: Auditing, Verification, Benchmarks

Part 4: Security, Safety, DEF CON Experiment

Part 5: Future Outlook, Conclusion

Sign in to continue reading, translating and more.

Continue

Practical AI

Part 1: Introduction, Guest Background

Introduction to Practical AI Podcast: Making AI Accessible and Productive

Introducing Sean McGregor: AI Incidents, Verification, and Evaluation

Sean McGregor's Journey: From Wildfire Suppression to AI Safety

Part 2: Defining AI Incidents, Safety

Defining AI Incidents and Safety Terminology

The Evolution and Scale of AI Incidents

Sourcing AI Incident Data and the Need for Mandatory Reporting

Part 3: Auditing, Verification, Benchmarks

Third-Party Auditing and the Importance of Verifying AI Safety Claims

Real-World AI Incident: The Case of the Misidentified License Plate

Analogies to Financial Audits: Building Trust in AI Systems

Meta-Evaluation of Benchmarks: Checking the Receipts in AI

Part 4: Security, Safety, DEF CON Experiment

Underestimated Risks: Security vs. Safety Perspectives in AI

DEF CON Experiment: Testing AI Models Against Hackers

Requiring Rigor: Why Statistics Matter in AI Security

Unexpected Failure Modes: Exploiting the Handoff Between Models

Value of the DEF CON Exercise: Clear Flaw Reporting

Part 5: Future Outlook, Conclusion

Future Directions: Scaling AI Safety and Measurement

Conclusion and Podcast Information

AI incidents, audits, and the limits of benchmarks

Practical AI

Part 1: Introduction, Guest Background

00:03Introduction to Practical AI Podcast: Making AI Accessible and Productive

Introduction to Practical AI Podcast: Making AI Accessible and Productive

00:48Introducing Sean McGregor: AI Incidents, Verification, and Evaluation

Introducing Sean McGregor: AI Incidents, Verification, and Evaluation

02:34Sean McGregor's Journey: From Wildfire Suppression to AI Safety

Sean McGregor's Journey: From Wildfire Suppression to AI Safety

Part 2: Defining AI Incidents, Safety

05:16Defining AI Incidents and Safety Terminology

Defining AI Incidents and Safety Terminology

08:55The Evolution and Scale of AI Incidents

The Evolution and Scale of AI Incidents

11:18Sourcing AI Incident Data and the Need for Mandatory Reporting

Sourcing AI Incident Data and the Need for Mandatory Reporting

Part 3: Auditing, Verification, Benchmarks

14:19Third-Party Auditing and the Importance of Verifying AI Safety Claims

Third-Party Auditing and the Importance of Verifying AI Safety Claims

17:10Real-World AI Incident: The Case of the Misidentified License Plate

Real-World AI Incident: The Case of the Misidentified License Plate

18:19Analogies to Financial Audits: Building Trust in AI Systems

Analogies to Financial Audits: Building Trust in AI Systems

20:47Meta-Evaluation of Benchmarks: Checking the Receipts in AI

Meta-Evaluation of Benchmarks: Checking the Receipts in AI

Part 4: Security, Safety, DEF CON Experiment

24:49Underestimated Risks: Security vs. Safety Perspectives in AI

Underestimated Risks: Security vs. Safety Perspectives in AI

27:23DEF CON Experiment: Testing AI Models Against Hackers

DEF CON Experiment: Testing AI Models Against Hackers

32:06Requiring Rigor: Why Statistics Matter in AI Security

Requiring Rigor: Why Statistics Matter in AI Security

34:17Unexpected Failure Modes: Exploiting the Handoff Between Models

Unexpected Failure Modes: Exploiting the Handoff Between Models

36:24Value of the DEF CON Exercise: Clear Flaw Reporting

Value of the DEF CON Exercise: Clear Flaw Reporting

Part 5: Future Outlook, Conclusion

38:57Future Directions: Scaling AI Safety and Measurement

Future Directions: Scaling AI Safety and Measurement

41:50Conclusion and Podcast Information

Conclusion and Podcast Information