11 Jan 2025
22m
Why Model Stops Learning: Grokking, Numerical Stability and Softmax Collapse #imperialcollegelondon
Srikanth Bhakthan
Open in Podwise to generate AI notes
Sign in to process this episode and unlock summaries, transcripts, highlights and translations.
Shownotes are not generated by Podwise.

