Provable Long-Range Benefits of Next-Token Prediction | Best AI papers explained | Podwise