Dead-end Discovery: How offline reinforcement learning could assist healthcare decision-makers | Microsoft Research | Podwise