LLM-as-a-Judge Evals: Comparing Kimi, Qwen, and GLM | Together AI | Podwise