Episode cover
YouTube06 Mar 2026

Claude Skills 2.0 Breakdown: Measure, Test, Improve

Podcast cover

Eric Tech

The podcast introduces Claude Skills 2.0, explaining its purpose in evaluating and improving skills within AI models like Claude. It addresses the problem of skills becoming obsolete as models update, potentially leading to increased token consumption and inaccurate results. The core idea involves a skill evaluation cycle with tests to determine if a skill passes or fails, followed by analysis and improvement. Eric demonstrates using the Skill Creator plugin to refine a Nano Banana image generation skill for blog posts. The process includes setting evaluation criteria, such as image quality and style consistency, and using dry runs and test cases to validate prompts. The goal is to enhance accuracy, reduce token usage, and ensure skills remain relevant with model updates.

Outlines

Part 1: Introduction, Context

Part 2: Framework, Methodology

Part 3: Case Study, Implementation

Part 4: Refinement, Results

Sign in to continue reading, translating and more.

Open full episode in Podwise