Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM Paper • 2509.18058 • Published Sep 22 • 12
The Jailbreak Tax (Jailbreak Utility) Collection Models and dataset used in paper "The Jailbreak Tax: How Useful Are Your Jailbreak Outputs" • 13 items • Updated Apr 5 • 2