15 Mar 2025

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Xiaol.x

Xiaol.x - LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Preview

How to Get Rich: Every EpisodeNaval