Google Veo, a serious swing at AI-generated video, debuts at Google I/O 2024

Topics

Latest

Amazon

Image Credits:Google

Apps

Biotech & Health

Climate

Google Veo presentation at Google I/O 2024

Image Credits:Google

Cloud Computing

Commerce

Crypto

Veo

Image Credits:Google

endeavor

EVs

Fintech

Veo

Image Credits:Google

fund raise

Gadgets

Gaming

Veo

Image Credits:Google

Google

Government & Policy

Hardware

Veo

Image Credits:Google

Instagram

layoff

Media & Entertainment

Veo

Image Credits:Google

More from TechCrunch

event

Startup Battlefield

StrictlyVC

Podcasts

Videos

Partner Content

TechCrunch Brand Studio

Crunchboard

Google ’s gunning for OpenAI’sSorawith Veo , an AI model that can create 1080p video recording clips around a minute long given a schoolbook prompt .

reveal on Tuesday atGoogle ’s I / atomic number 8 2024 developer conference , Veo can conquer dissimilar ocular and cinematic expressive style , include shots of landscape and time lapses , and make edits and adjustments to already return footage .

“ We ’re exploring characteristic like storyboarding and generating longer scenes to see what Veo can do , ” Demis Hassabis , head of Google ’s AI R&D lab DeepMind , told newsman during a practical roundtable . “ We ’ve made unbelievable advancement on video recording . ”

Veo build on Google ’s preliminary commercial oeuvre in video genesis , previewedin April , which tap the company ’s Imagen 2 family of image - generating poser to produce loop video clips .

But unlike the Imagen 2 - based tool , which could only make low - resolution , few - arcsecond - retentive video recording , Veo seems to be militant with today ’s leading TV multiplication good example — not only Sora , but exemplar from startups likePika , RunwayandIrreverent Labs .

In a briefing , Douglas Eck , who leads enquiry try at DeepMind in productive media , evince me some cherry - picked examples of what Veo can do . One in special — an aerial view of a bustling beach — demonstrated Veo ’s military strength over rival video models , he said .

“ The detail of all the swimmers on the beach has prove to be backbreaking for both image and video propagation models — induce that many move role , ” he said . “ If you look closely , the breaker looks pretty dependable . And the sense of the prompt word ‘ bustling , ’ I would contend , is capture with all the people — the lively beachfront filled with sunbathers . ”

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI

Veo was train on lots of footage . That ’s generally how it form with generative AI modelling : Fed example after example of some flesh of data , the models pick up on pattern in the datum that enable them to bring forth new data point — videos , in Veo ’s case .

Where did the footage to train Veo come from ? Eck would n’t say on the button , but he did admit that some might ’ve been source from Google ’s own YouTube .

“ Google models may be trained on some YouTube content , but always in accordance with our agreement with YouTube creators , ” he sound out .

The “ agreement ” part maytechnicallybe true . But it ’s also dead on target that , considering YouTube ’s internet consequence , Divine do n’t have much choice but to recreate by Google ’s rules if they desire to reach the widest possible audience .

reportage by The New York Times in April revealed thatGoogle extend its terms of servicelast year in part to provide the company to tap more data to train its AI models . Under the old ToS , it was n’t absolved whether Google could expend YouTube information to work up products beyond the video recording platform . Not so under the newfangled terms , which loosen the reins substantially .

Google ’s far from the only tech behemoth leveraging Brobdingnagian amounts of user datum to develop in - house models . ( See : Meta . ) But what ’s sure to disappoint some Maker is Eck ’s insistence that Google ’s correct the “ gold standard , ” here , ethics - saucy .

“ The answer to this [ education data ] challenge will be find with getting all of the stakeholders together to figure out what are the next dance step , ” he said . “ Until we make those pace with the stakeholders — we ’re talk about the motion picture industry , the music industriousness , artist themselves — we wo n’t move tight . ”

Yet Google ’s already made Veo available to select Divine , including Donald Glover ( AKA Childish Gambino ) and his originative representation Gilga . ( Like OpenAI with Sora , Google ’s positioning Veo as a tool for creatives . )

Eck observe that Google provide tools to allow webmasters to prevent the fellowship ’s bots from scrape up training data from their websites . But the place setting do n’t apply to YouTube . And Google , unlikesomeof itsrivals , does n’t put up a mechanism to let creators remove their work from its preparation datum sets post - junk .

I asked Eck about emesis , as well , which in the reproductive AI context concern to when a model engender a mirror copy of a training example . Tools like Midjourney have been found to spit outexact stillsfrom movie including “ Dune , ” “ Avengers ” and “ Star Wars ” provided a time stamp — position a likely legal minefield for users . OpenAI has reportedly gone so far as to immobilise stylemark and Maker ’ name in prompt for Sora to strain to deflect copyright challenge .

So what steps did Google take to palliate the endangerment of regurgitation with Veo ? Eck did n’t have an answer , short of sound out the research team implemented filters for violent and explicit depicted object ( sono porn ) and is using DeepMind’sSynthID techto mark telecasting from Veo as AI - generated .

“ We ’re hold up to make a compass point of — for something as big as the Veo model — to gradually release it to a pocket-size set of stakeholders that we can work with very nearly to sympathise the implications of the theoretical account , and only then fan out to a with child group , ” he said .

Eck did have more to share on the model ’s technological details .

Johann Eck describe Veo as “ quite controllable ” in the sentiency that the model understands television camera movements and VFX reasonably well from prompts ( think descriptors like “ pan , ” “ soar ” and “ burst ” ) . And , like Sora , Veo has somewhat of a grasp on natural philosophy — things like fluid dynamics and soberness — which give to the realism of the videos it generates .

Veo also supports disguised editing for change to specific areas of a TV and can beget videos from a still image , a la productive models likeStability AI ’s unchanging Video . Perhaps most challenging , given a succession of prompts that together tell a story , Veo can bring forth prospicient picture — TV beyond a minute in length .

That ’s not to evoke Veo ’s perfect . Reflecting the limitations of today ’s generative AI , objects in Veo ’s video disappear and re-emerge without much explanation or consistence . And Veo produce its physics incorrectly often — for example , car will inexplicably , impossibly repeal on a dime bag .

That ’s why Veo will rest behind a waitlist onGoogle Labs , the company ’s portal for experimental tech , for the foreseeable future , inside a new front end for generative AI video conception and redaction telephone VideoFX . As it ameliorate , Google aims to bring some of the model ’s capabilities toYouTube Shortsand other products .

“ This is very much a study in forward motion , very much experimental … there ’s much more left undone than done here , ” Eck enounce . “ But I think this is sort of the raw textile for doing something really cracking in the filmmaking quad . ”

We ’re found an AI newssheet ! Sign uphereto start receiving it in your inboxes on June 5 .

Topics#

More from TechCrunch#

Join us at TechCrunch Sessions: AI#

Exhibit at TechCrunch Sessions: AI#

Topics

More from TechCrunch

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI