Arxiv Papers - Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Sign in to continue reading, translating and more.