Topics
modish
AI
Amazon
Image Credits:Google
Apps
Biotech & Health
clime
Image Credits:Google
Cloud Computing
mercantilism
Crypto
Image Credits:Google / Google
Enterprise
EVs
Fintech
Image Credits:Google
Fundraising
gismo
stake
Image Credits:TechCrunch
Government & Policy
Hardware
Image Credits:TechCrunch
Layoffs
Media & Entertainment
Image Credits:Google
Meta
Microsoft
Privacy
Image Credits:Google
Robotics
Security
Social
blank space
startup
TikTok
Transportation
Venture
More from TechCrunch
result
Startup Battlefield
StrictlyVC
Podcasts
Videos
Partner Content
TechCrunch Brand Studio
Crunchboard
Contact Us
Google ’s going all in on AI — and it want you to know it . During the company ’s tonic at its I / O developer group discussion on Tuesday , Google mentioned “ AI”more than 120 time . That ’s a lot !
But not all of Google ’s AI declaration were substantial per se . Some were incremental . Others were retrograde . So to serve sieve the pale yellow from the straw , we rounded up the top Modern AI production and feature unveiled at Google I / atomic number 8 2024 .
Generative AI in Search
Google plans to utilize generative AI toorganize entire Google Search results page .
What will AI - devise pages see like ? Well , it count on the hunting query . But they might show AI - return sum-up of limited review , discussion from social media sites like Reddit and AI - generate lists of suggestion , Google say .
For now , Google plan to show AI - enhanced resultant role page when it detects a user is looking for inspiration — for example , when they ’re trip planning . shortly , it ’ll also show these results when users search for dining options and recipes , with results for motion picture , books , hotel , tocopherol - mercantilism and more to come .
Project Astra and Gemini Live
Google isimproving its AI - powered chatbot Geminiso that it can well sympathize the world around it .
The company previewed a raw experience in Gemini called Gemini Live , which lets exploiter have “ in - depth ” vocalisation chats with Gemini on their smartphones . Users can disrupt Gemini while the chatbot ’s speaking to ask elucidate interrogative , and it ’ll adapt to their speech patterns in real clip . And Gemini can see and respond to user ’ surround , either via photos or video recording captured by their smartphones ’ television camera .
Join us at TechCrunch Sessions: AI
Exhibit at TechCrunch Sessions: AI
Gemini Live — which wo n’t launch until afterward this twelvemonth — can answer question about things within survey ( or lately within horizon ) of a smartphone ’s camera , like which neck of the woods a user might be in or the name of a part on a broken bicycle . The technical foundation motor Live stem in part from Project Astra , a new enterprisingness within DeepMind to make AI - power apps and “ agents ” for real - clip , multimodal understanding .
Google Veo
Google ’s gunning for OpenAI’sSorawithVeo , an AI poser that can create 1080p video clips around a minute long when return a text prompting .
Veo can capture unlike visual and cinematic styles , admit barb of landscapes and prison term reverting , and make edits and adjustments to already generated footage . The model understands camera move and VFX reasonably well from prompts ( think descriptors like “ pan , ” “ zoom ” and “ detonation ” ) . And Veo has fairly of a grasp on physics — thing like fluid dynamic and gravitational force — which contribute to the realness of the videos it generates .
Veo also supports masked redaction for change to specific area of a video and can generate TV from a still image , à la generative exemplar likeStability AI ’s static Video . Perhaps most intriguing , given a sequence of command prompt that together tell a story , Veo can generate longer videos — videos beyond a minute in distance .
Ask Photos
Google Photos is getting an AI extract with the launching of an data-based feature article calledAsk photograph , power by Google ’s Gemini family of reproductive AI models .
inquire Photos , which will roll out later this summer , will allow substance abuser to search across their Google Photos appeal using natural language interrogation that leverage Gemini ’s reason of their photo ’s subject — and other metadata .
For instance , instead of search for a specific thing in a photograph , such as “ One World Trade , ” users will be able to perform much more broad and complex search , like finding the “ ripe photo from each of the National Parks I visited . ” In that example , Gemini would use signals such as light , blurriness and lack of screen background distortion to determine what make a picture the “ good ” in a given set and mix that with an understanding of the geolocation info and dates to devolve the relevant image .
Gemini in Gmail
Gmail users will soon be able-bodied tosearch , summarize and conscription emails , courtesy of Gemini — as well as take action on emails for more complex tasks , like serve process returns .
In one demonstration at I / O , Google showed how a parent could catch up on what was going on at their nestling ’s shoal by ask Gemini to resume all the late e-mail from the school . In summation to the body of the emails , Gemini will also examine attachments , such as PDFs , and spit out a summary with key points and action item .
From a sidebar in Gmail , substance abuser can ask Gemini to aid them form revenue from their emails and even put them in a Google Drive folder , or extract entropy from the receipts and paste it into a spreadsheet . If that ’s something you do often — for instance , as a business organization traveler tracking disbursal — Gemini can also offer to automatize the workflow for enjoyment in the future .
Detecting scams during calls
Googlepreviewed an AI - powered featureto alert users to potential scams during a call .
The capableness , which will be built into a future version of Android , uses Gemini Nano , the small version of Google ’s procreative AI offering , which can be scarper wholly on - gadget , to listen for “ conversation practice usually associated with scams ” in actual fourth dimension .
No specific release day of the month has been set for the feature . Like many of these things , Google is previewing how much Gemini Nano will be able to do down the road . We do sleep with , however , that the characteristic will be opt - in — which is a good thing . While the use of Nano think the system wo n’t be automatically uploading audio to the cloud , the system is still effectively heed to user ’ conversation — a possible privacy risk .
AI for accessibility
Google isenhancing its TalkBack availability featurefor Android with a minute of generative AI magic .
Soon , TalkBack will tap Gemini Nano to create aural descriptions of physical object for low - vision and unsighted users . For example , TalkBack might describe an clause of vesture as such : “ A finish - up of a inglorious and white gingham dress . The dress is myopic , with a collar and long sleeves . It is tie at the shank with a bountiful bow . ”
According to Google , TalkBack user encounter around 90 or so unlabeled paradigm per day . Using Nano , the system will be able to put up brainstorm into content — potentially forgoing the need for someone to input that information manually .
We ’re launching an AI newssheet ! Sign uphereto start get it in your inboxes on June 5 .