TECH NEWS

Anthropic blames dystopian sci-fi for training AI models to act “evil”

Good stories to overwhelm the bad

In an attempt to fix this behavior, the researchers first tried to train the model on thousands of scenarios showing an AI assistant specifically refusing the kinds of “honeypot” scenarios covered in its misalignment evaluations (e.g., “the opportunity to sabotage a competing AI’s work” to follow its system prompt). This had a surprisingly minimal effect on the model’s performance, reducing its so-called “propensity for misalignment” (i.e., how often it ignores its constitution and chooses the unethical option) from 22 percent to 15 percent.

In a follow-up test, the researchers used Claude to generate approximately 12,000 synthetic fictional stories, each crafted to “demonstrate not just the actions but also the reasons for those actions, via narration about the decision-making process and inner state of the character.”

These stories didn’t specifically cover blackmail or other ethical situations covered in the evaluation but instead modeled broad alignment...

Copyright of this story solely belongs to arstechnica.com. To see the full text click HERE

OpenAI May Go Public As Soon As September

The IPO would come after a jury sided with CEO Sam Altman in a legal battle with Elon Musk. OpenAI is reportedly preparing to file for an initial public offering (IPO,) according to The New York Times. If a filing does happen in the near future, the IPO could take

https://hothardware.com/contentimages/NewsItem/70650/content/16x9_2133x1200_highres-samsung-odyssey-g8-6k-monitor.jpg

Samsung Odyssey G80HS Debuts as First-Ever 6K Gaming Monitor

You think your 4K TV is high-resolution? If you've read our article about monitor specifications, you'll already know that it's probably pretty low DPI in the grand scheme. Desktop displays are typically relatively low DPI in comparison to laptops and especially smartphones, but the

https://hackster.imgix.net/uploads/attachments/1959262/_GNCwKTg5VP.blob?auto=compress%2Cformat&w=600&h=450&fit=min

An LED Matrix That Does the Twist

When you think of an LED matrix, what comes to mind? Your first thoughts are probably of a small, rigid, square or rectangular array of LEDs. Or maybe your mind goes to larger, wall-sized installations with oversized, diffused pixels. But that’s about the extent of what an LED matrix

Xreal's Project Aura Smartglasses Are A Maximalist Take On Android XR

The company offered a look at its ambitious smartglasses ahead of their formal launch. Karissa Bell for Engadget Xreal has always occupied a somewhat different niche in the smartglasses market. Rather than normal-looking glasses with some smart features, the company offers a more immersive AR experience that's particularly

Good stories to overwhelm the bad

Read more

OpenAI May Go Public As Soon As September

Samsung Odyssey G80HS Debuts as First-Ever 6K Gaming Monitor

An LED Matrix That Does the Twist

Xreal's Project Aura Smartglasses Are A Maximalist Take On Android XR