Google's DiffusionGemma runs text 4x faster

https://images.ctfassets.net/jdtwqhzvc2n1/7ovrrf19CQHT2JVXAFiEip/a9d0d1480eb2651175a20372894efa4d/diffusion-image-text.jpg?w=800&q=75

GenAI image generators like Stable Diffusion do not draw a picture pixel by pixel from left to right. They start with noise and iteratively refine the entire image in parallel until it converges, in a process known as diffusion. For years, applying that same principle to text generation had remained out of reach at scale.

Standard language models work like a typewriter: one token at a time, left to right, with no ability to revise a committed output. That pattern works in the cloud, where batch sizes keep GPUs saturated. For local inference or low-concurrency deployments, the GPU is idle most of the time.

Google's DiffusionGemma, released this week, is an open source experimental model that applies diffusion to text generation at production scale. Built on the Gemma 4backbone and released under the Apache 2.0 license, it is the first diffusion language model natively supported in the open source...

Copyright of this story solely belongs to venturebeat.com. To see the full text click HERE

Read more