Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing | Microsoft Research | Podwise