02 Apr 2024
12m
Demo: Optimizing Gemma inference on NVIDIA GPUs with TensorRT-LLM
Google for Developers
Open in Podwise to generate AI notes
Sign in to process this episode and unlock summaries, transcripts, highlights and translations.
Shownotes are not generated by Podwise.

