YouTube28 May 2024
11m

【人工智能】大语言模型评估基准七宗罪 | Jason Wei | 思维链作CoT作者 | 成功与否的标准 | 评估基准的七个错误 | 面临的挑战 | 测试集污染

Podcast cover

最佳拍档

Open in Podwise to generate AI notes

Sign in to process this episode and unlock summaries, transcripts, highlights and translations.

Open in Podwise

Shownotes are not generated by Podwise.

【人工智能】大语言模型评估基准七宗罪 | Jason Wei | 思维链作CoT作者 | 成功与否的标准 | 评估基准的七个错误 | 面临的挑战 | 测试集污染