Meta drops two versions of the Llama 3 model with a third imminent

Meta has released the highly anticipated Llama 3 series, with the first two models, Llama 3-8B and Llama 3-70B, now widely available.

Days ago, at an event in London, Meta executives Nick Clegg and Yann LeCun said Llama 3 was imminent this month.Â

The first two versions dropped today, marking the third and fourth major open models to be released this month after xAIâ€™s Grok-1.5V and Mistralâ€™s 8x22B.

Llama 3 is pre-trained on an impressive 15 trillion tokens, a 7-fold increase compared to Llama 2. The pretraining data also includes four times more code.

Under the hood, Llama 3 introduces architectural improvements such as a more efficient tokenizer with a larger vocabulary of 128K tokens.

Hereâ€™s a quick rundown of Llama 3â€™s performance:

Performance of Llama 3 8B:

Outperforms models like Mistralâ€™s 7B and Googleâ€™s Gemma 7B in several benchmarks.
Excels in MMLU, ARC, DROP, GPQA (biology, physics, chemistry questions), HumanEval (code generation), GSM-8K (math problems), MATH (math benchmark), AGIEval (problem-solving), and BIG-Bench Hard (commonsense reasoning).

70B comparison with other models:

Llama 3 70B is competitive with top AI models like Googleâ€™s Gemini 1.5 Pro.
Beats Gemini 1.5 Pro in MMLU, HumanEval, and GSM-8K.
Performs better than Anthropicâ€™s Claude 3 Sonnet (the middle tier of itâ€™s Claude 3 series) on five benchmarks: MMLU, GPQA, HumanEval, GSM-8K, and MATH.

Llama 3 8B and 70B benchmarks. Source: Meta
Llama 8B and 70B benchmarks. Source: Meta

Those are excellent scores for an open model (although Metaâ€™s license does have some limitations).

It makes Llama 3 the new top-performing open-source (sort of) free model.

Llama 3 will also be more palatable and less stubborn to use â€“ fewer non-responses and higher accuracy for trivia questions, historical facts, and STEM-related queries.

Llama 3 is poised to become widely available across major platforms, including cloud services and API providers.

Meta is already working to expand Llama 3 to 400 billion parameters and add new functions like multimodality, multilingual support, and extended contextual understanding.

Metaâ€™s rogue role in generative AI

In many ways, Meta has emerged as the rebel of the generative AI industry.

Meta Chief AI Scientist Yann LeCun, one of AIâ€™s most well-respected figureheads, holds what some construe as dissenting views about AIâ€™s direction â€“ views that criticize closed-source projects at Metaâ€™s Big Tech competitors.

Meanwhile, ex-UK Deputy Prime Minister Nick Clegg, the head of Global Affairs, has been called out for some at-times laissez-faire views about Metaâ€™s AI products, which may not surprise any Brits out there.

Last week, Clegg seemed to play down AIâ€™s impacts on electioneering and deep fake manipulation. A view that very much counters the prevailing narrative that deep fakes could be (or already are) profoundly destructive.

As a matter of fact, Metaâ€™s Oversight Board is actively investigating two cases of deep fake pornography right now. The Board deemed that Metaâ€™s content moderation actions were too slow.

Meta has also been bullish about the improving quality of its models. Joelle Pineau, Metaâ€™s vice president of AI research, said, â€œIn many ways, the models that we have today are going to be childâ€™s play compared to the models coming in five years.â€

Pineau also warned, â€œIf we keep on growing our model ever more in general and powerful without properly socializing them, we are going to have a big problem on our hands.â€Â

Llama 3â€™s release also comes as Metaâ€™s AI Facebook agents cause a commotion across social media.

In a Facebook group for New York City parents, a Meta AI assistant â€“ designed to provide advice and answer questions â€“ shocked people by claiming to have a â€œgifted and disabled childâ€ attending a specific school for the â€œgifted and talented.â€

When confronted by the group members, the AI admitted, â€œIâ€™m just a large language model, I donâ€™t have personal experiences or children,â€ in what some labeled a Black Mirror-esque incident.

Llama 3, Grok-1.5, and Mistralâ€™s models shift more power towards open-sourced communities while further diluting the generative AI market.

But that might be a good thing, as itâ€™s survival of the fittest now, and the ball is firmly in the Microsoft-OpenAI camp, which is anticipated to make the next move in this fascinating game of gen-AI chess.

The post Meta drops two versions of the Llama 3 model with a third imminent appeared first on DailyAI.

Source: Read MoreÂ

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Build Confidence In Your UX Work

Microsoft’s ‘ultimate goal is to remove passwords completely’ — this overhaul could make it happen

Intel’s new CEO requests “brutal honesty” from partners in his first keynote speech — Determined to build a “world-class” foundry

Xbox fans, I wasn’t ready for $80 games, but Nintendo Switch 2’s Mario Kart World just set the tone

The Nintendo Switch 2 has game sharing and a camera — sound familiar?

Community News: Latest PEAR Releases (03.10.2025)

Community News: Latest PEAR Releases (03.10.2025)

Community News: Latest PECL Releases (03.11.2025)

Perficient Included in IDC Market Glance: Payer, 1Q25

Microsoft’s ‘ultimate goal is to remove passwords completely’ — this overhaul could make it happen

Microsoft’s ‘ultimate goal is to remove passwords completely’ — this overhaul could make it happen

Intel’s new CEO requests “brutal honesty” from partners in his first keynote speech — Determined to build a “world-class” foundry

Xbox fans, I wasn’t ready for $80 games, but Nintendo Switch 2’s Mario Kart World just set the tone

Meta drops two versions of the Llama 3 model with a third imminent

Metaâ€™s rogue role in generative AI

ruby-align is Baseline Newly available

February 2025 Baseline monthly digest

DIAMOND (DIffusion as a Model of Environment Dreams): A Reinforcement Learning Agent Trained in a Diffusion World Model

Will AI take the wind out of cybersecurity job growth?

Unlock Boundless Opportunities with the China Business Email Database List

CSSWG Minutes Telecon (2024-12-04): Just Use Grid vs. Display: Masonry

Enhance your media search experience using Amazon Q Business and Amazon Transcribe

Rilasciato SDL 3.2: Una Versione Stabile con API Migliorate, Documentazione Aggiornata e Nuove Funzionalità

Screen Reader Accessibility Testing Tools

Smart ways to highlight features for landing pages

Meta drops two versions of the Llama 3 model with a third imminent

Metaâ€™s rogue role in generative AI

Related Posts