YouTube07 Oct 2023
15m

Streaming-llm:多轮对话的救星来了,不需要微调即可帮助大模型能够流畅地处理无限轮对话、无限上下文文本,有效的缓解多轮对话优先的遗忘问题,最多可处理400万token上下文

Podcast cover

AIGCLINK

Open in Podwise to generate AI notes

Sign in to process this episode and unlock summaries, transcripts, highlights and translations.

Open in Podwise

Shownotes are not generated by Podwise.

Streaming-llm:多轮对话的救星来了,不需要微调即可帮助大模型能够流畅地处理无限轮对话、无限上下文文本,有效的缓解多轮对话优先的遗忘问题,最多可处理400万token上下文