The top AI announcements from Google I/O

Topics

modish

Amazon

Image Credits:Google

Apps

Biotech & Health

clime

Sundar Pichai onstage at Google IO

Image Credits:Google

Cloud Computing

mercantilism

Crypto

Gemini

Image Credits:Google / Google

Enterprise

EVs

Fintech

Veo

Image Credits:Google

Fundraising

gismo

stake

Image Credits:TechCrunch

Google

Government & Policy

Hardware

Image Credits:TechCrunch

Instagram

Layoffs

Media & Entertainment

Image Credits:Google

More from TechCrunch

result

Startup Battlefield

StrictlyVC

Podcasts

Videos

Partner Content

TechCrunch Brand Studio

Crunchboard

Google ’s going all in on AI — and it want you to know it . During the company ’s tonic at its I / O developer group discussion on Tuesday , Google mentioned “ AI”more than 120 time . That ’s a lot !

But not all of Google ’s AI declaration were substantial per se . Some were incremental . Others were retrograde . So to serve sieve the pale yellow from the straw , we rounded up the top Modern AI production and feature unveiled at Google I / atomic number 8 2024 .

Generative AI in Search

Google plans to utilize generative AI toorganize entire Google Search results page .

What will AI - devise pages see like ? Well , it count on the hunting query . But they might show AI - return sum-up of limited review , discussion from social media sites like Reddit and AI - generate lists of suggestion , Google say .

For now , Google plan to show AI - enhanced resultant role page when it detects a user is looking for inspiration — for example , when they ’re trip planning . shortly , it ’ll also show these results when users search for dining options and recipes , with results for motion picture , books , hotel , tocopherol - mercantilism and more to come .

Project Astra and Gemini Live

Google isimproving its AI - powered chatbot Geminiso that it can well sympathize the world around it .

The company previewed a raw experience in Gemini called Gemini Live , which lets exploiter have “ in - depth ” vocalisation chats with Gemini on their smartphones . Users can disrupt Gemini while the chatbot ’s speaking to ask elucidate interrogative , and it ’ll adapt to their speech patterns in real clip . And Gemini can see and respond to user ’ surround , either via photos or video recording captured by their smartphones ’ television camera .

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI

Gemini Live — which wo n’t launch until afterward this twelvemonth — can answer question about things within survey ( or lately within horizon ) of a smartphone ’s camera , like which neck of the woods a user might be in or the name of a part on a broken bicycle . The technical foundation motor Live stem in part from Project Astra , a new enterprisingness within DeepMind to make AI - power apps and “ agents ” for real - clip , multimodal understanding .

Google Veo

Google ’s gunning for OpenAI’sSorawithVeo , an AI poser that can create 1080p video clips around a minute long when return a text prompting .

Veo can capture unlike visual and cinematic styles , admit barb of landscapes and prison term reverting , and make edits and adjustments to already generated footage . The model understands camera move and VFX reasonably well from prompts ( think descriptors like “ pan , ” “ zoom ” and “ detonation ” ) . And Veo has fairly of a grasp on physics — thing like fluid dynamic and gravitational force — which contribute to the realness of the videos it generates .

Veo also supports masked redaction for change to specific area of a video and can generate TV from a still image , à la generative exemplar likeStability AI ’s static Video . Perhaps most intriguing , given a sequence of command prompt that together tell a story , Veo can generate longer videos — videos beyond a minute in distance .

Ask Photos

Google Photos is getting an AI extract with the launching of an data-based feature article calledAsk photograph , power by Google ’s Gemini family of reproductive AI models .

inquire Photos , which will roll out later this summer , will allow substance abuser to search across their Google Photos appeal using natural language interrogation that leverage Gemini ’s reason of their photo ’s subject — and other metadata .

For instance , instead of search for a specific thing in a photograph , such as “ One World Trade , ” users will be able to perform much more broad and complex search , like finding the “ ripe photo from each of the National Parks I visited . ” In that example , Gemini would use signals such as light , blurriness and lack of screen background distortion to determine what make a picture the “ good ” in a given set and mix that with an understanding of the geolocation info and dates to devolve the relevant image .

Gemini in Gmail

Gmail users will soon be able-bodied tosearch , summarize and conscription emails , courtesy of Gemini — as well as take action on emails for more complex tasks , like serve process returns .

In one demonstration at I / O , Google showed how a parent could catch up on what was going on at their nestling ’s shoal by ask Gemini to resume all the late e-mail from the school . In summation to the body of the emails , Gemini will also examine attachments , such as PDFs , and spit out a summary with key points and action item .

From a sidebar in Gmail , substance abuser can ask Gemini to aid them form revenue from their emails and even put them in a Google Drive folder , or extract entropy from the receipts and paste it into a spreadsheet . If that ’s something you do often — for instance , as a business organization traveler tracking disbursal — Gemini can also offer to automatize the workflow for enjoyment in the future .

Detecting scams during calls

Googlepreviewed an AI - powered featureto alert users to potential scams during a call .

The capableness , which will be built into a future version of Android , uses Gemini Nano , the small version of Google ’s procreative AI offering , which can be scarper wholly on - gadget , to listen for “ conversation practice usually associated with scams ” in actual fourth dimension .

No specific release day of the month has been set for the feature . Like many of these things , Google is previewing how much Gemini Nano will be able to do down the road . We do sleep with , however , that the characteristic will be opt - in — which is a good thing . While the use of Nano think the system wo n’t be automatically uploading audio to the cloud , the system is still effectively heed to user ’ conversation — a possible privacy risk .

AI for accessibility

Google isenhancing its TalkBack availability featurefor Android with a minute of generative AI magic .

Soon , TalkBack will tap Gemini Nano to create aural descriptions of physical object for low - vision and unsighted users . For example , TalkBack might describe an clause of vesture as such : “ A finish - up of a inglorious and white gingham dress . The dress is myopic , with a collar and long sleeves . It is tie at the shank with a bountiful bow . ”

According to Google , TalkBack user encounter around 90 or so unlabeled paradigm per day . Using Nano , the system will be able to put up brainstorm into content — potentially forgoing the need for someone to input that information manually .

We ’re launching an AI newssheet ! Sign uphereto start get it in your inboxes on June 5 .

Topics#

More from TechCrunch#

Generative AI in Search#

Project Astra and Gemini Live#

Join us at TechCrunch Sessions: AI#

Exhibit at TechCrunch Sessions: AI#

Google Veo#

Ask Photos#

Gemini in Gmail#

Detecting scams during calls#

AI for accessibility#

Topics

More from TechCrunch

Generative AI in Search

Project Astra and Gemini Live

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI

Google Veo

Ask Photos

Gemini in Gmail

Detecting scams during calls

AI for accessibility