YouTube19 May 2025
22m

J1: Incentivizing thinking LLM-as-a-judge via reinforcement learning #meta

Podcast cover

Srikanth Bhakthan

Open in Podwise to generate AI notes

Sign in to process this episode and unlock summaries, transcripts, highlights and translations.

Open in Podwise

Shownotes are not generated by Podwise.

J1: Incentivizing thinking LLM-as-a-judge via reinforcement learning #meta