“Evaluating honesty and lie detection techniques on a diverse suite of dishonest models” by Sam Marks, Johannes Treutlein, evhub, Fabien Roger | LessWrong (30+ Karma) | Podwise
LessWrong (30+ Karma) - “Evaluating honesty and lie detection techniques on a diverse suite of dishonest models” by Sam Marks, Johannes Treutlein, evhub, Fabien Roger
Sign in to continue reading, translating and more.