Arxiv Papers - [QA] Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
Sign in to continue reading, translating and more.