arxiv preprint - Weight subcloning: direct initialization of transformers using larger pretrained ones | AI Breakdown | Podwise