Xiaol.x - Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Sign in to continue reading, translating and more.