arxiv preprint - Retentive Network: A Successor to Transformer for Large Language Models | AI Breakdown | Podwise