Meta drops two versions of the Llama 3 model with a third imminent

Meta has released the highly anticipated Llama 3 series, with the first two models, Llama 3-8B and Llama 3-70B, now widely available.

Days ago, at an event in London, Meta executives Nick Clegg and Yann LeCun said Llama 3 was imminent this month.Â

The first two versions dropped today, marking the third and fourth major open models to be released this month after xAIâ€™s Grok-1.5V and Mistralâ€™s 8x22B.

Llama 3 is pre-trained on an impressive 15 trillion tokens, a 7-fold increase compared to Llama 2. The pretraining data also includes four times more code.

Under the hood, Llama 3 introduces architectural improvements such as a more efficient tokenizer with a larger vocabulary of 128K tokens.

Hereâ€™s a quick rundown of Llama 3â€™s performance:

Performance of Llama 3 8B:

Outperforms models like Mistralâ€™s 7B and Googleâ€™s Gemma 7B in several benchmarks.
Excels in MMLU, ARC, DROP, GPQA (biology, physics, chemistry questions), HumanEval (code generation), GSM-8K (math problems), MATH (math benchmark), AGIEval (problem-solving), and BIG-Bench Hard (commonsense reasoning).

70B comparison with other models:

Llama 3 70B is competitive with top AI models like Googleâ€™s Gemini 1.5 Pro.
Beats Gemini 1.5 Pro in MMLU, HumanEval, and GSM-8K.
Performs better than Anthropicâ€™s Claude 3 Sonnet (the middle tier of itâ€™s Claude 3 series) on five benchmarks: MMLU, GPQA, HumanEval, GSM-8K, and MATH.

Llama 3 8B and 70B benchmarks. Source: Meta
Llama 8B and 70B benchmarks. Source: Meta

Those are excellent scores for an open model (although Metaâ€™s license does have some limitations).

It makes Llama 3 the new top-performing open-source (sort of) free model.

Llama 3 will also be more palatable and less stubborn to use â€“ fewer non-responses and higher accuracy for trivia questions, historical facts, and STEM-related queries.

Llama 3 is poised to become widely available across major platforms, including cloud services and API providers.

Meta is already working to expand Llama 3 to 400 billion parameters and add new functions like multimodality, multilingual support, and extended contextual understanding.

Metaâ€™s rogue role in generative AI

In many ways, Meta has emerged as the rebel of the generative AI industry.

Meta Chief AI Scientist Yann LeCun, one of AIâ€™s most well-respected figureheads, holds what some construe as dissenting views about AIâ€™s direction â€“ views that criticize closed-source projects at Metaâ€™s Big Tech competitors.

Meanwhile, ex-UK Deputy Prime Minister Nick Clegg, the head of Global Affairs, has been called out for some at-times laissez-faire views about Metaâ€™s AI products, which may not surprise any Brits out there.

Last week, Clegg seemed to play down AIâ€™s impacts on electioneering and deep fake manipulation. A view that very much counters the prevailing narrative that deep fakes could be (or already are) profoundly destructive.

As a matter of fact, Metaâ€™s Oversight Board is actively investigating two cases of deep fake pornography right now. The Board deemed that Metaâ€™s content moderation actions were too slow.

Meta has also been bullish about the improving quality of its models. Joelle Pineau, Metaâ€™s vice president of AI research, said, â€œIn many ways, the models that we have today are going to be childâ€™s play compared to the models coming in five years.â€

Pineau also warned, â€œIf we keep on growing our model ever more in general and powerful without properly socializing them, we are going to have a big problem on our hands.â€Â

Llama 3â€™s release also comes as Metaâ€™s AI Facebook agents cause a commotion across social media.

In a Facebook group for New York City parents, a Meta AI assistant â€“ designed to provide advice and answer questions â€“ shocked people by claiming to have a â€œgifted and disabled childâ€ attending a specific school for the â€œgifted and talented.â€

When confronted by the group members, the AI admitted, â€œIâ€™m just a large language model, I donâ€™t have personal experiences or children,â€ in what some labeled a Black Mirror-esque incident.

Llama 3, Grok-1.5, and Mistralâ€™s models shift more power towards open-sourced communities while further diluting the generative AI market.

But that might be a good thing, as itâ€™s survival of the fittest now, and the ball is firmly in the Microsoft-OpenAI camp, which is anticipated to make the next move in this fascinating game of gen-AI chess.

The post Meta drops two versions of the Llama 3 model with a third imminent appeared first on DailyAI.

Source: Read MoreÂ

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Prevent WordPress SQL Injection Attacks

Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

Microsoft might kill the Surface Laptop Studio as production is quietly halted

Minecraft licensing robbed us of this controversial NFL schedule release video

The power of generators

The power of generators

Simplify Factory Associations with Laravel’s UseFactory Attribute

This Week in Laravel: React Native, PhpStorm Junie, and more

Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

Microsoft might kill the Surface Laptop Studio as production is quietly halted

Meta drops two versions of the Llama 3 model with a third imminent

Metaâ€™s rogue role in generative AI

Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

CVE-2025-4831 – TOTOLINK HTTP POST Request Handler Buffer Overflow Vulnerability

OpenAI Introduces CriticGPT: A New Artificial Intelligence AI Model based on GPT-4 to Catch Errors in ChatGPTâ€™s Code Output

CVE-2023-4377 – Apache Struts Remote Code Execution Vulnerability

mAid â€“ easy and ready-to-use distribution for Android lovers

Microsoft will entirely deprecate Dev Home later this year

LightPROF: A Lightweight AI Framework that Enables Small-Scale Language Models to Perform Complex Reasoning Over Knowledge Graphs (KGs) Using Structured Prompts

Mechanisms of Localized Receptive Field Emergence in Neural Networks

New Credit Card Skimmer Targets WordPress, Magento, and OpenCart Sites

Behind the scenes of animating a design system component

Meta drops two versions of the Llama 3 model with a third imminent

Metaâ€™s rogue role in generative AI

Related Posts