Optimizing vLLM Performance through Quantization | Ray Summit 2024 | Anyscale | Podwise