Arxiv Papers - [short] Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Sign in to continue reading, translating and more.