Google Gemini 1.5 Pro can now hear

Google's update for Gemini 1.5 Pro gives model ears. The model can now listen to uploaded audio files and produce information from things like earnings calls or audio from videos without having to refer to written text.

During the Google Next event, Google also announced that it will make the Gemini 1.5 Pro available to the public for the first time through its platform for building artificial intelligence applications, Vertex AI. The Gemini 1.5 Pro was first announced in February.

This new version of the Gemini Pro, which is supposed to be the middle-weight model of the Gemini family, actually outperforms the larger and more powerful model, the Gemini Ultra, in performance. Google claims that the Gemini 1.5 Pro can understand complex instructions and eliminates the need to fine-tune forms.

Gemini 1.5 Pro is not available to people who do not have access to Vertex AI. Right now, most people encounter Gemini language models through the Gemini chatbot. The Gemini Ultra runs the Gemini Advanced chat software, and while it's powerful and also capable of understanding long commands, it's not as fast as the Gemini 1.5 Pro.

The Gemini 1.5 Pro isn't the only big AI model from Google to get an update. Imagen 2, the text-to-image module that helps enhance Gemini's image generation capabilities, will also add in-draw and out-draw, allowing users to add or remove elements from images. Google has also made the SynthID digital watermark feature available on all images created through Imagen Forms. SynthID adds a watermark invisible to the viewer on images that identifies their source when viewed through the detector.

Google says it's also publicly previewing a way to base its AI responses using Google Search so they can answer with up-to-date information. This is not always a given with responses produced by large language models, sometimes even intentionally; Google intentionally blocked Gemini from answering questions related to the 2024 US elections.

Chase Elliott

“Infuriatingly humble music trailblazer. Gamer. Food enthusiast. Beeraholic. Zombie guru.”

Google Pixel 9 Pro official case leaks and promotional videos

There is no solution to the problem of Intel 13th and 14th Gen processors crashing — no permanent damage

Internal change in iPhone 16 models expected to reduce overheating

Google Pixel 9 Pro official case leaks and promotional videos

Italy’s famous ‘Love Path’ reopens after more than 12 years

Video of Francis Ford Coppola kissing extras from the movie “Megalopolis”

Leave a Reply Cancel reply

More Stories

Google Pixel 9 Pro official case leaks and promotional videos

There is no solution to the problem of Intel 13th and 14th Gen processors crashing — no permanent damage

Internal change in iPhone 16 models expected to reduce overheating

You may have missed

Google Pixel 9 Pro official case leaks and promotional videos

Italy’s famous ‘Love Path’ reopens after more than 12 years

Video of Francis Ford Coppola kissing extras from the movie “Megalopolis”