Google unveils Gemini Omni 'any-to-any' AI model: what enterprises should know
Although it was already discovered by intrepid AI power users weeks ahead of the official unveiling today at Google's annual I/O developer conference, the company's new Gemini Omni model marks a significantly new paradigm in the wider AI and tech marketplace.
That's because as its "omni" (from the Latin omne — meaning "all") prefix would suggest, this is Google's first truly native, multimodal model, that is "a model that can create anything from any input — starting with video."
The model marks Google's bid to collapse the multimodal generative stack — text-to-image, image-to-video, video-to-video, audio generation — into a single foundation model with a single editing surface.
The big question for business leaders is: should you switch any of your own AI stack over to Gemini Omni now?
Unfortunately, the truth is, you may not be able to just yet — the model is only available to individual users through...
Copyright of this story solely belongs to venturebeat.com. To see the full text click HERE