LW - instruction tuning and autoregressive distribution shift by nostalgebraist | The Nonlinear Library | Podwise