TECH NEWS

Google's latest DiffusionGemma open AI model comes with a 4x speed boost

Multiple paths to local efficiency

If diffusion is so much faster, why isn’t Google using it in big cloud-based Gemini models? Google has experimented with this, but there are a few drawbacks to text diffusion, including a higher error rate. In image diffusion models, a single badly predicted pixel doesn’t make the image useless, but language is discreet. An equivalent error in text can make a block of tokens meaningless and force you to start over to get a better output. Diffusion models also waste resources when the desired output is only a few tokens long. They have to do a lot more parallel work to whittle down to a few tokens that an autoregressive model does from beginning to end in just five steps.

The efficiency gain for local processing makes this an appealing avenue of experimentation, though. In the cloud, autoregressive models can batch large numbers of...

Copyright of this story solely belongs to arstechnica.com. To see the full text click HERE

How Inscribe uses Amazon Bedrock to stop document fraud in seconds | Amazon Web Services

Krafton settles with Subnautica 2 developer after drawn-out dispute over $250 million

Cloudflare’s new policy pushes AI companies to pay for publishers’ content

GenAI.mil records almost 1.7M users, plans new model additions

Multiple paths to local efficiency

Read more

How Inscribe uses Amazon Bedrock to stop document fraud in seconds | Amazon Web Services

Krafton settles with Subnautica 2 developer after drawn-out dispute over $250 million

Cloudflare’s new policy pushes AI companies to pay for publishers’ content

GenAI.mil records almost 1.7M users, plans new model additions