AI Model Release Tracker: Opus 4.8's misalignment rates similar to Claude Mythos Preview
Follow ZDNET: Add us as a preferred source on Google.
AI labs are shipping new models nonstop. Besides being better and faster than their predecessors, however, every new model isn't guaranteed to be a major step change, despite how the company's PR may wax poetic about them. Model strengths really emerge in context: Where are competitor models lacking or excelling? Which models have outstanding specialties, and which are just catching up to industry standards?
Also: How we test AI at ZDNET
Our Model Release Tracker helps you make sense of where models stand relative to each other, and whether they're worth a deeper look. While we don't test every model or model update on this list, we'll always include the key elements you need to know, along with our hands-on expert test, where applicable. We also include an Expert Score for certain models. Curious about how we test AI? Check...
Copyright of this story solely belongs to zdnet.com. To see the full text click HERE