On the Convergence and Stability of Upside-Down Reinforcement Learning, Goal-Conditioned Supervised | Xiaol.x | Podwise