13 Jun 2025

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective RL for LLM Reasoning

Xiaol.x

Xiaol.x - Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective RL for LLM Reasoning

Preview

How to Get Rich: Every EpisodeNaval