LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding | Arxiv Papers | Podwise