Proprietary Reward Models: Sustaining Advantage in Agentic AI | Best AI papers explained | Podwise