LLMs as Greedy Agents: RL Fine-tuning for Decision-Making | Best AI papers explained | Podwise