Feds freaked over Fable 5 after simple 'fix this code' prompt, not jailbreak, says researcher
According to the one person who actually read the research paper
The “jailbreak” that prompted the Trump administration to block Anthropic’s most advanced models was actually a simple three-word prompt: “Fix this code.”
That's according to Katie Moussouris, founder and CEO of Luta Security, and the fairy godmother of bug bounties. She says she was the only outside expert to read the third-party research paper on Fable 5 guardrail bypass techniques that prompted the ban.
On Friday, the US government, reportedly citing national security concerns, issued an export control directive to suspend access to Fable 5 and Mythos 5 by any foreign national, inside or outside the United States. In response, Anthropic disabled both models “for all our customers to ensure compliance.”
Anthropic shared the report privately with her, Moussouris wrote in a Monday blog post.
The outside researchers reportedly fed Anthropic’s Fable 5, Mythos, and...
Copyright of this story solely belongs to theregister.com. To see the full text click HERE