Who wants ‘Her’-like AI that gets stuff wrong?

Topics

Latest

Amazon

Image Credits:Getty Images (Image has been modified)

Apps

Biotech & Health

Climate

Annoyed frustrated woman having problem with not working mobile phone

Image Credits:Getty Images (Image has been modified)

Cloud Computing

Commerce

Crypto

Anthropic Clio

Image Credits:Anthropic

initiative

EVs

Fintech

Future of Life Institute AI Safety Index

Image Credits:Future of Life Institute

fundraise

contraption

Gaming

Google

Government & Policy

Hardware

Instagram

Layoffs

Media & Entertainment

More from TechCrunch

Events

Startup Battlefield

StrictlyVC

Podcasts

Videos

Partner Content

TechCrunch Brand Studio

Crunchboard

Last week , OpenAI launchedAdvanced Voice Mode with Vision , which feeds substantial - meter telecasting toChatGPT , grant the chatbot to “ see ” beyond the confines of its app bed . The premise is that by giving ChatGPT greater contextual awareness , that bot can answer in a more natural and intuitive way .

But the first time I hear it , it lied to me .

“ That sofa see comfortable ! ” ChatGPT said as I deem up my phone and asked the bot to describe our animation way . It had mistaken the ottoman for a couch .

“ My fault ! ” ChatGPT said when I correct it . “ Well , it still looks like a comfortable space . ”

It ’s been intimately a year since OpenAI firstdemoedAdvanced Voice Mode with Vision , which the companypitchedas a gradation toward AI as depicted in the Spike Jonze movie “ Her . ” The mode OpenAI sold it , Advanced Voice Mode with Vision would grant ChatGPT superpowers — enabling the bot to resolve sketched - out mathematics problems , interpret emotion , and respond toaffectionate letters .

Has it achieve all that ? More or less . But Advanced Voice Mode with Vision has n’t solved ChatGPT ’s biggest outcome : reliableness . If anything , the feature make the bot’shallucinationsmore obvious .

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI

At one point , curious to see if Advanced Voice Mode with Vision could assist ChatGPT offer fashion pointers , I enable it and need ChatGPT to rate an getup of mine . It jubilantly did so . But while the bot would give opinion on my jeans and olive - colored - shirt combo , it systematically missed the chocolate-brown jacket I was wear out .

I ’m not the only one who has happen slipups .

When OpenAI prexy Greg Brockman show off Advanced Voice Mode with Vision on “ 60 Minutes ” earlier this month , ChatGPT made a misunderstanding on a geometry problem . When calculating the area of a Triangulum , itmisidentifiedthe triangle ’s altitude .

So my question is , what good is “ Her”-like AI if you ca n’t trust it ?

With each ChatGPT misfire , I feel myself becoming less and less inclined to reach into my pocket , unlock my speech sound , launch ChatGPT , assailable Advanced Voice Mode , and enable Vision — a cumbersome series of tone in the good of circumstance . With its lustrous and gay behaviour , Advanced Voice Mode is clearly design to engender trust . When it does n’t deliver on that inexplicit promise , it ’s jarring — and disappointing .

Perhaps OpenAI can work out the hallucination problem once and for all someday . Until then , we ’re stuck with a bot that views the world through criss - crossed wiring . And honestly , I ’m not sure who might want that .

News

OpenAI ’s 12 days of “ shipmas ” continues : OpenAI is releasing new products every day up until December 20.Here’sa roundup of all the announcements , which we ’re update regularly .

YouTube let creators opt out : YouTube is give Maker more selection over how third party can utilize their content to prepare their AI model . Lord and right holders will be able to flag for YouTube if they ’re countenance specific company to train models on their clips .

Meta ’s smart glasses get rise : Meta ’s Ray - Ban Meta smart glasses have gottenseveral new AI - powered update , include the power to have an ongoing conversation with Meta ’s AI and read between spoken language .

DeepMind ’s answer to Sora : Google DeepMind , Google ’s flagship AI research lab , want to nonplus OpenAI at the video - generation game . On Monday , DeepMind announce Veo 2 , a next - gen video recording - generating AI that can create two - minute - plus clips in resolution up to 4k ( 4,096 x 2,160 pel ) .

OpenAI whistle-blower found bushed : A former OpenAI employee , Suchir Balaji , was recently find utter in his San Francisco flat , according to the San Francisco Office of the Chief Medical Examiner . In October , the 26 - year - old AI researcher raised concern about OpenAI bring out right of first publication law when he was interviewed by The New York Times .

Grammarly acquires Coda : Grammarly , best known for its elan and spell - check tools , has acquired productiveness inauguration Coda for an unrevealed amount . As part of the deal , Coda ’s CEO and co - founder , Shishir Mehrotra , will become the young CEO of Grammarly .

Cohere is working with Palantir : TechCrunch exclusively reported that Cohere , the enterprise - focus AI startup assess at $ 5.5 billion , has a partnership with information analytics house Palantir . Palantir is vocal about its close — and at timescontroversial — work with U.S. defense and intelligence service authority .

Research paper of the week

Anthropic has pulled back the pall on Clio ( “ Claudeinsights andobservations ” ) , a organization that the company uses to understand how customers are employing its various AI models . Clio , which Anthropic compare to analytics tools such as Google Trends , is ply “ worthful insights ” for improving the safety of Anthropic ’s AI , take the company .

Anthropic tapped Clio to compose anonymized usage datum , some of which the company made public last week . So what are customers using Anthropic ’s AI for ? A range of a function of tasks — but web and nomadic app developing , capacity innovation , and donnish research top the list . Predictably , the use cases vary across spoken language ; for example , Japanese speakers are more probable to ask Anthropic ’s AI to analyse gum anime than Spanish utterer .

Model of the week

AI startupPikareleased its next - gen television generation model , Pika 2 , which can create a clip from a lineament , target , and location that users supply . Via Pika ’s program , users can upload multiple references ( for example , simulacrum of a council chamber and office doer ) and Pika 2 will “ intuit ” the use of each reference before combining them into a single scene .

Now , no manakin ’s perfect , of course . See the “ Zanzibar copal ” below created by Pika 2 , which has telling consistency but suffers from the aesthetic bizarreness present in all reproductive AI footage .

pic.twitter.com/3jWCy4659oLike I pronounce , Animes will be the first musical genre that s 100 % AI father . Its amazing to see what ’s already possible with Pika 2.0

— Chubby ♨ ️ ( @kimmonismus)December 16 , 2024

Still , the tools are very chop-chop improve in the video domain — and in equal component pique the interest and raising the anger of creatives .

Grab bag

The Future of Life Institute ( FLI ) , the nonprofit organization co - found by MIT cosmologist Max Tegmark , released an “ AI Safety Index ” designed to evaluate the base hit drill of leading AI companies across five fundamental areas : current hurt , safety framework , experiential base hit scheme , governance and accountability , and transparency and communication .

Meta was the bad of the cluster evaluated on the Index , with an overall F grade . ( The Index apply a numerical and GPA - based scoring system . ) Anthropic was the best but failed to manage better than a C — suggesting that there ’s room for improvement .

Topics#

More from TechCrunch#

Join us at TechCrunch Sessions: AI#

Exhibit at TechCrunch Sessions: AI#

News#

Research paper of the week#

Model of the week#

Grab bag#