How To Transform VISION Tokens to a Language Vector Space? | code_your_own_AI | Podwise