arxiv preprint - LLARVA: Vision-Action Instruction Tuning Enhances Robot Learning | AI Breakdown | Podwise