Library
Byte Latent Transformer: Patches Scale Better Than Tokens (Paper Explained) | Yannic Kilcher | Podwise