Microsoft Research - ALIGN: Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Sign in to continue reading, translating and more.