2025 is the Year of Evals! Just like 2024, and 2023, and … — John Dickerson, CEO Mozilla AI | AI Engineer

John Dickerson discusses the evolution and future of AI evaluation, particularly focusing on why 2025 is poised to be the "year of the evals." He explains how AI monitoring and evaluation are interconnected and highlights the impact of ChatGPT's launch and subsequent budget shifts in enterprises. Dickerson points out that the rise of agentic systems, where AI makes decisions autonomously, necessitates robust evaluation. He emphasizes the importance of connecting AI products to downstream business KPIs and notes that C-suite executives are now more engaged with AI, driving the need for quantitative evaluation in areas like risk management and ROI. He also touches on the shift in the evaluation space towards monitoring agentic and multi-agent systems, and the increasing revenue in AI evaluation startups.

Outlines

Sign in to continue reading, translating and more.

Continue

2025 is the Year of Evals! Just like 2024, and 2023, and … — John Dickerson, CEO Mozilla AI

AI Engineer

Introduction to AI Evaluation and Monitoring

The State of ML Monitoring Before the GenAI Revolution

The Impact of Economic Conditions and ChatGPT on AI Investment

Scaling AI Applications and the Growing Importance of Evaluation

The C-Suite Alignment on AI Evaluation and Future Trends

Q&A: Domain Expertise and the Role of LLMs in Evaluation

2025 is the Year of Evals! Just like 2024, and 2023, and … — John Dickerson, CEO Mozilla AI

AI Engineer

00:14Introduction to AI Evaluation and Monitoring

Introduction to AI Evaluation and Monitoring

04:17The State of ML Monitoring Before the GenAI Revolution

The State of ML Monitoring Before the GenAI Revolution

07:06The Impact of Economic Conditions and ChatGPT on AI Investment

The Impact of Economic Conditions and ChatGPT on AI Investment

10:14Scaling AI Applications and the Growing Importance of Evaluation

Scaling AI Applications and the Growing Importance of Evaluation

13:34The C-Suite Alignment on AI Evaluation and Future Trends

The C-Suite Alignment on AI Evaluation and Future Trends

16:04Q&A: Domain Expertise and the Role of LLMs in Evaluation

Q&A: Domain Expertise and the Role of LLMs in Evaluation