Anthropic Disputes Fable 5 AI Jailbreak
Anthropic has disputed allegations of a prompt-based jailbreak affecting its recently launched Claude Fable 5 AI model, underscoring the robustness of the advanced classifier system and extensive red-teaming efforts that underpinned the model’s deployment.
Claude Fable 5 became generally available on Tuesday, when Anthropic introduced it as a powerful Mythos-class AI model with safeguards that restrict its use in high-risk domains such as cybersecurity, where Mythos has proved particularly potent.
In sensitive areas such as cybersecurity, where it could be abused to develop exploits, and biology, where it could be leveraged to develop bioweapons and chemical weapons, the model automatically falls back to the less capable Claude Opus 4.8.
Anthropic said it conducted extensive internal and external red-teaming to ensure that Fable 5 cannot be easily jailbroken.
However, shortly after its release, an individual with the online moniker Pliny the Liberator, who is known for AI jailbreaks, claimed to...
Copyright of this story solely belongs to securityweek.com. To see the full text click HERE