arxiv preprint - 4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities | AI Breakdown | Podwise