[QA] Evaluating Numerical Reasoning in Text-to-Image Models | Arxiv Papers | Podwise