CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents Paper • 2601.09923 • Published 23 days ago • 4
Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM Paper • 2509.18058 • Published Sep 22, 2025 • 12
The Jailbreak Tax (Jailbreak Utility) Collection Models and dataset used in paper "The Jailbreak Tax: How Useful Are Your Jailbreak Outputs" • 13 items • Updated Apr 5, 2025 • 2