Arxiv Papers - [QA] Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?
Sign in to continue reading, translating and more.