Video models are zero-shot learners and reasoners | Xiaol.x | Podwise