arxiv preprint - Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs | AI Breakdown | Podwise