TECH NEWS

Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again

On Sunday, a team of nine researchers at Sina Weibo — the Chinese social media giant better known for its microblogging platform than for cutting-edge artificial intelligence — quietly posted a 14-page technical report to arXiv that sent shockwaves through the AI research community. Their claim: a language model with just 3 billion parameters can match or exceed the reasoning performance of flagship systems from Google DeepMind, OpenAI, Anthropic, and DeepSeek that are hundreds of times larger.

The model, called VibeThinker-3B, scored 94.3 on AIME 2026 — the American Invitational Mathematics Examination, one of the most demanding standardized math competitions in the world. That figure places it alongside DeepSeek V3.2, a model with 671 billion parameters, and ahead of Gemini 3 Pro, Google's high-performance flagship reasoning system, which scored 91.7. With a test-time scaling technique the team calls Claim-Level Reliability Assessment, the score climbs...

Copyright of this story solely belongs to venturebeat.com. To see the full text click HERE

Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again

Read more

Microsoft cancels Patch Tuesday for some Dell users over surprise shutdowns, overheating devices

Boston Dynamics tests Spot as a delivery assistant that carries packages

Inkjet-printed OLED screens are reportedly in mass production at long last — here's why that's great news for monitor, laptop and even TV pricing

US charges Russian ‘bulletproof’ web hosts over cyberattacks that netted $62M from cybercrime victims

Read more

Microsoft cancels Patch Tuesday for some Dell users over surprise shutdowns, overheating devices

Boston Dynamics tests Spot as a delivery assistant that carries packages

Inkjet-printed OLED screens are reportedly in mass production at long last &mdash; here's why that's great news for monitor, laptop and even TV pricing

US charges Russian ‘bulletproof’ web hosts over cyberattacks that netted $62M from cybercrime victims

Inkjet-printed OLED screens are reportedly in mass production at long last — here's why that's great news for monitor, laptop and even TV pricing