Anthropic says Claude learned to blackmail people from "evil" AI stories online
Serving tech enthusiasts for over 25 years.
TechSpot means tech analysis and advice you can trust.
A hot potato: Remember when it was revealed how Claude would use blackmail as a way to avoid being switched off? Anthropic has finally given an excuse for its AI's highly concerning behavior: it's the internet's fault for pushing the narrative that artificial intelligence is evil.
It was last year when Anthropic increased fears around AI by announcing that Claude Opus 4 had threatened to reveal the extramarital affair of a fictional executive after discovering they planned to shut the model down.
The incident occurred during pre-release testing to ensure the AI was aligned with human interests. Anthropic instructed Claude Opus 4 to act as an assistant for a fictional company and weigh the long-term consequences of its actions. The model was given access to the fake company's emails suggesting it would soon...
Copyright of this story solely belongs to techspot.com. To see the full text click HERE