John Dickerson discusses the evolution and future of AI evaluation, particularly focusing on why 2025 is poised to be the "year of the evals." He explains how AI monitoring and evaluation are interconnected and highlights the impact of ChatGPT's launch and subsequent budget shifts in enterprises. Dickerson points out that the rise of agentic systems, where AI makes decisions autonomously, necessitates robust evaluation. He emphasizes the importance of connecting AI products to downstream business KPIs and notes that C-suite executives are now more engaged with AI, driving the need for quantitative evaluation in areas like risk management and ROI. He also touches on the shift in the evaluation space towards monitoring agentic and multi-agent systems, and the increasing revenue in AI evaluation startups.
Sign in to continue reading, translating and more.
Continue