End-to-end Reinforcement Learning for the Large-scale Traveling Salesman Problem | Microsoft Research | Podwise