DeepSeek: Everything you need to know about the AI chatbot app

Topics

Latest

Amazon

Image Credits:GREG BAKER/AFP / Getty Images

Apps

Biotech & Health

clime

Cloud Computing

commercialism

Crypto

enterprisingness

EVs

Fintech

fund raise

gadget

stake

Google

Government & Policy

Hardware

Instagram

Layoffs

Media & Entertainment

More from TechCrunch

Events

Startup Battlefield

StrictlyVC

Podcasts

Videos

Partner Content

TechCrunch Brand Studio

Crunchboard

get hold of Us

DeepSeek has go viral .

Chinese AI research lab DeepSeek break into the mainstream cognisance this workweek afterits chatbot app rose to the top of the Apple App Store charts(and Google Play , as well ) . DeepSeek ’s AI models , which were train using compute - efficient techniques , have led Wall Street analysts — and applied scientist — to interview whether the U.S. can maintain its lead in the AI race and whether the need for AI chips will sustain .

But where did DeepSeek amount from , and how did it turn out to outside fame so quickly ?

DeepSeek’s trader origins

DeepSeek is backed by High - Flyer Capital Management , a Chinese quantitative hedge investment firm that uses AI to inform its trading decision .

AI enthusiastLiang Wenfengco - ground High - Flyer in 2015 . Wenfeng , who reportedly began dabble in trading while a student at Zhejiang University , launched High - Flyer Capital Management as a hedge monetary fund in 2019 focused on develop and deploying AI algorithms .

In 2023 , High - Flyer started DeepSeek as a research lab dedicated to research AI pecker separate from its financial business . With High - Flyer as one of its investors , the research lab gyrate off into its own company , also call DeepSeek .

From day one , DeepSeek built its own data center clusters for good example preparation . But like other AI companies in China , DeepSeek has been affected by U.S. export BAN on computer hardware . To condition one of its more recent exemplar , the company was force to apply Nvidia H800 chips , a less - powerful rendering of a chip , the H100 , useable to U.S. caller .

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI

DeepSeek ’s technical team is tell to skew young . The companyreportedly sharply recruitsdoctorate AI research worker from top Chinese universities . DeepSeek also hires citizenry without any computer science backgroundto help oneself its technical school better understand a encompassing stove of subject , per The New York Times .

DeepSeek’s strong models

DeepSeek reveal its first lot of modeling — DeepSeek Coder , DeepSeek LLM , and DeepSeek Chat — in November 2023 . But it was n’t until last spring , when the inauguration release its next - gen DeepSeek - V2 family of simulation , that the AI industry started to take card .

DeepSeek - V2 , a ecumenical - aim text- and range of a function - analyzing organization , performed well in various AI benchmarks — and was far sleazy to die hard than comparable models at the time . It squeeze DeepSeek ’s domestic rivalry , include ByteDance and Alibaba , to turn off the employment prices for some of their models , and make others completely free .

DeepSeek - V3 , launched in December 2024 , only total to DeepSeek ’s notoriety .

According to DeepSeek ’s inner bench mark examination , DeepSeek V3 outperforms both downloadable , openly usable models like Meta’sLlamaand “ unopen ” poser that can only be accessed through an API , like OpenAI’sGPT-4o .

Equally telling is DeepSeek ’s R1 “ reasoning ” model . release in January , DeepSeek claimsR1 performs as well as OpenAI ’s o1 model on key benchmark .

Being a logical thinking manakin , R1 in effect fact - checks itself , which aid it to deflect some of the booby trap that normally trip up model . Reasoning models take a little longer — usually moment to proceedings longer — to come at solutions compared to a typical non - intelligent framework . The upside is that they tend to be more reliable in domains such as cathartic , science , and mathematics .

There is a downside to R1 , DeepSeek V3 , and DeepSeek ’s other modeling , however . Being Chinese - developed AI , they ’re subject tobenchmarkingby China ’s cyberspace regulator to ensure that its answer “ be core socialist value . ” In DeepSeek ’s chatbot app , for example , R1 wo n’t answer questions about Tiananmen Square or Taiwan ’s self-sufficiency .

In March , DeepSeek outdo 16.5 million visit . “ [ F]or March , DeepSeek is in 2nd position , despite see dealings drop 25 % from where it was in February , based on daily visit , ” David Carr , editor at Similarweb , told TechCrunch . It still pales in comparing to ChatGPT , which surged past 500 million weekly participating users in March .

A disruptive approach

If DeepSeek has a business sector model , it ’s not clear what that model is , precisely . The party price its intersection and armed service well below grocery store value — and afford others aside for free . It ’s also not take investor money , despite a net ton of VC interest .

The elbow room DeepSeek tells it , efficiency breakthroughs have enabled it to keep extreme toll competitiveness . Some expertsdisputethe chassis the company has supply , however .

Whatever the case may be , developers have lease to DeepSeek ’s models , which are n’t capable source as the set phrase is usually understood but are available under permissive licenses that earmark for commercial use . concord to Clem Delangue , the CEO of Hugging Face , one of the platforms hosting DeepSeek ’s models , developers on Hugging Face have created over 500 “ derivative ” models of R1that have rack up 2.5 million downloads combine .

DeepSeek ’s winner against large and more launch challenger has beendescribed as “ upending AI”and“over - hype . ”The society ’s success was at least in part responsible forcausing Nvidia ’s caudex terms to drop off by 18%in January , and foreliciting a public responsefrom OpenAI CEO Sam Altman . In March , U.S. Commerce department bureaus tell staffer thatDeepSeek will be banned on their authorities devices , concord to Reuters .

Microsoftannounced that DeepSeek is uncommitted on its Azure AI Foundry service , Microsoft ’s platform that brings together AI table service for enterprises under a individual streamer . When ask about DeepSeek ’s impact on Meta ’s AI spending during its first - tail earnings call , CEO Mark Zuckerberg saidspending on AI infrastructure will stay to be a “ strategic advantage”for Meta . In March , OpenAI called DeepSeek “ state - subsidized ” and “ state - controlled,”and recommends that the U.S. government view banning models from DeepSeek .

During Nvidia ’s quaternary - quartern profit call , CEO Jensen Huang emphasized DeepSeek ’s “ excellent innovation,”saying that it and other “ abstract thought ” good example are great for Nvidia because they need so much more compute .

At the same time , some companies are banning DeepSeek , and so are entirecountriesandgovernments , include South Korea . New York state alsobanned DeepSeek from being used on regime devices .

In May , Microsoft Vice Chairman and President Brad Smith state in a Senate hearing thatMicrosoft employees are n’t allowed to use DeepSeekdue to data security measures and propaganda concerns .

As for what DeepSeek ’s future might hold , it ’s not clear . Improved models are a give . But the U.S. government come out to begrowing wary of what it perceives as harmful extraneous influence . In March , The Wall Street Journal report thatthe U.S. will belike blackball DeepSeek on administration machine .

This story was originally published January 28 , 2025 , and will be update regularly .

Topics#

More from TechCrunch#

DeepSeek’s trader origins#

Join us at TechCrunch Sessions: AI#

Exhibit at TechCrunch Sessions: AI#

DeepSeek’s strong models#

A disruptive approach#