“Refusal mechanisms: initial experiments with Llama-2-7b-chat” by andyrdt, Oscar Balcells Obeso | LessWrong (30+ Karma) | Podwise