Meta Llama: Everything you need to know about the open generative AI model

Topics

Latest

Amazon

Image Credits:Larysa Amosova via Getty

Apps

Biotech & Health

mood

Llama illustration

Image Credits:Larysa Amosova via Getty

Cloud Computing

Commerce

Crypto

Enterprise

EVs

Fintech

Fundraising

gismo

bet on

Google

Government & Policy

Hardware

Instagram

layoff

Media & Entertainment

More from TechCrunch

Events

Startup Battlefield

StrictlyVC

Podcasts

video

Partner Content

TechCrunch Brand Studio

Crunchboard

Like every Big Tech companionship these day , Meta has its own flagship productive AI model , calledLlama . Llama is pretty unique among major models in that it ’s “ open , ” mean developers can download and utilise it however they please ( with certain restriction ) . That ’s in contrast to models like Anthropic ’s Claude , OpenAI’sGPT-4o(which powersChatGPT ) andGoogle ’s Gemini , which can only be accessed via genus Apis .

In the pastime of present developer choice , however , Meta has also partner with seller , including AWS , Google Cloud and Microsoft Azure , to make cloud - host variant of Llama uncommitted . In improver , the company has released tools designed to make it easier to fine - melodic line and customize the model .

Here ’s everything you need to know about Llama , from its capabilities and editions to where you’re able to use it . We ’ll keep this situation updated as Meta releases climb and introduces new dev tools to support the model ’s habit .

What is Llama?

Llama is a crime syndicate of modelling — not just one :

The latest interlingual rendition areLlama 3.1 8B , Llama 3.1 70BandLlama 3.1 405B , which was releasedin July 2024 . They ’re train on web page in a variety of languages , public code and files on the web , and synthetic information ( i.e. , information generated by other AI model ) .

Llama 3.1 8B and Llama 3.1 70B are small , compact models meant to run on devices ranging from laptop to servers . Llama 3.1 405B , on the other hand , is a large - scale mannikin require ( remove some modifications ) data centre ironware . Llama 3.1 8B and Llama 3.1 70B are less capable than Llama 3.1 405B , but quicker . They ’re “ distill ” version of 405B , in point of fact , optimise for low-spirited storage overhead and reaction time .

All the Llama models have 128,000 - token context window . ( In data scientific discipline , tokens are subdivide bits of sore data , like the syllable “ fan , ” “ tas ” and “ tic ” in the word “ fantastical . ” ) A model ’s context , or context windowpane , mention to stimulus data point ( for example , school text ) that the model considers before sire turnout ( for instance , additional text ) . Long context can prevent models from “ forgetting ” the content of late Department of Commerce and information , and from veering off topic and extrapolating wrongly .

Join us at TechCrunch Sessions: AI

Exhibit at TechCrunch Sessions: AI

Those 128,000 tokens translate to around 100,000 words or 300 Page , which for reference is around the length of “ Wuthering Heights , ” “ Gulliver ’s travel ” and “ Harry Potter and the Prisoner of Azkaban . ”

What can Llama do?

Like other generative AI models , Llama can perform a range of unlike assistive tasks , like coding and answering basic math questions , as well as summarizing documents in eight language ( English , German , French , Italian , Portuguese , Hindi , Spanish and Thai ) . Most textual matter - base workload — consider analyzing files like PDFs and spreadsheets — are within its horizon ; none of the Llama models can process or father images , although that maychangein the near time to come .

All the up-to-the-minute Llama models can be configure to leverage third - party apps , tools and genus Apis to discharge task . They ’re trained out of the box to use Brave Search to answer questions about late effect , the Wolfram Alpha API for math- and skill - related queries , and a Python interpreter for validating code . In accession , Meta says the Llama 3.1 models can use sure tools they have n’t seen before ( but whether they canreliablyuse those shaft is another matter ) .

Where can I use Llama?

If you ’re looking to plainly chat with Llama , it’spowering the Meta AI chatbot experienceon Facebook Messenger , WhatsApp , Instagram , Oculus and Meta.ai .

developer building with Llama can download , use or fine - air the model across most of the pop swarm platforms . Meta claims it has over 25 collaborator host Llama , including Nvidia , Databricks , Groq , Dell and Snowflake .

Some of these better half have built additional tools and religious service on top of Llama , include cock that countenance the models reference proprietary data and enable them to break away at lower latencies .

Meta suggests using its smaller models , Llama 8B and Llama 70B , for general - role applications like power chatbots and generate codification . Llama 405B , the company says , is better reserved for model distillation — the mental process of transferring knowledge from a large model to a smaller , more efficient model — and generating semisynthetic data to educate ( or fine - tune ) alternative theoretical account .

Importantly , the Llama licenseconstrains how developer can deploy the model : App developer with more than 700 million monthly users must request a special licence from Meta that the company will grant on its circumspection .

What tools does Meta offer for Llama?

Alongside Llama , Meta supply tools signify to make the model “ safer ” to use :

Llama Guard tries to detect potentially problematic content either fed into — or generated — by a Llama modeling , let in content refer to criminal activity , kid victimization , right of first publication violations , detest , ego - harm and sexual abuse . developer cancustomizethe categories of forget content and apply the block to all the speech Llama supports out of the box .

Like Llama Guard , Prompt Guard can block text think for Llama , but only textual matter meant to “ attack ” the model and get it to comport in unwanted way . Meta claims that Llama Guard can defend against explicitly malicious prompts ( i.e. , jailbreaks that attempt to get around Llama ’s built - in refuge filter ) in addition to prompts that curb “ injected remark . ”

As for CyberSecEval , it ’s less a creature than a aggregation of bench mark to measure out example security measures . CyberSecEval can assess the danger a Llama mannikin pose ( at least concord to Meta ’s criterion ) to app developers and end user in area like “ automated societal engineering ” and “ scaling offensive cyber cognitive process . ”

Llama’s limitations

Llama comes with certain risks and limitation , like all procreative AI modelling .

For case , it ’s unclear whether Meta trained Llama on copyright mental object . If it did , users might be liable for infringement if they terminate up unwittingly using acopyrighted snippet that the good example regurgitated .

Meta at one pointused copyright eastward - books for AI trainingdespite its own lawyers ’ warning , according to recent reporting by Reuters . The company polemically train its AI on Instagram and Facebook post , photos and caption , andmakes it difficult for user to opt out . What ’s more , Meta , along with OpenAI , is the subject of an ongoing suit bring by authors , admit comic Sarah Silverman , over the company ’ alleged unauthorized enjoyment of copyrighted data for model training .

programing is another orbit where it ’s wise to tread gently when using Llama . That ’s because Llama might — like its generative AI counterparts — create buggy or insecure computer code .

As always , it ’s best to have a human being expert review any AI - generated codification before incorporate it into a armed service or software .

Topics#

More from TechCrunch#

What is Llama?#

Join us at TechCrunch Sessions: AI#

Exhibit at TechCrunch Sessions: AI#

What can Llama do?#

Where can I use Llama?#

What tools does Meta offer for Llama?#

Llama’s limitations#