Enhancing the Reasoning Ability of Multimodal LLM via Mixed Preference Optimization | Xiaol.x | Podwise