23 Feb 2025
8m

2025-02-23 | SWE-bench 数据集质疑:三成补丁存在答案泄露,编码评估基准待完善

Podcast cover

Hacker News

Open in Podwise to generate AI notes

Sign in to process this episode and unlock summaries, transcripts, highlights and translations.

Open in Podwise

Shownotes are not generated by Podwise.

2025-02-23 | SWE-bench 数据集质疑:三成补丁存在答案泄露,编码评估基准待完善