Xiaol.x - Janus: Decoupling visual encoding for unified multimodal understanding and generation
Sign in to continue reading, translating and more.