Meta releases Llama 3.1 models, sticks with open strategy

Meta has released its upgraded Llama 3.1 models in 8B, 70B, and 405B versions and committed to Mark Zuckerbergâ€™s open source vision for the future of AI.

The new additions to Metaâ€™s Llama family of models come with an expanded context length of 128k and support across eight languages.

Meta says its highly anticipated 405B model demonstrates â€œunmatched flexibility, control, and state-of-the-art capabilities that rival the best closed source models.â€ It also claims that Llama 3.1 405B is the â€œthe worldâ€™s largest and most capable openly available foundation model.â€

With eye-watering computing costs being spent to train ever-larger models, there was a lot of speculation that Metaâ€™s flagship 405B model could be its first paid model.

Llama 3.1 405B was trained on over 15 trillion tokens using 16,000 NVIDIA H100s, likely costing hundreds of millions of dollars.

In a blog post, Meta CEO Mark Zuckerberg reaffirmed the companyâ€™s view that open source AI is the way forward and that the release of Llama 3.1 is the next step â€œtowards open source AI becoming the industry standard.â€

The Llama 3.1 models are free to download and modify or fine-tune with a suite of services from Amazon, Databricks, and NVIDIA.

The models are also available on cloud service providers including AWS, Azure, Google, Oracle.

Starting today, open source is leading the way. Introducing Llama 3.1: Our most capable models yet.

Today weâ€™re releasing a collection of new Llama 3.1 models including our long awaited 405B. These models deliver improved reasoning capabilities, a larger 128K token contextâ€¦ pic.twitter.com/1iKpBJuReD

â€” AI at Meta (@AIatMeta) July 23, 2024

Performance

Meta says it tested its models on over 150 benchmark datasets and released results for the more common benchmarks to show how its new models stack up against other leading models.

Thereâ€™s not a lot separating Llama 3.1 405B from GPT-4o and Claude 3.5 Sonnet. Here are the figures for the 405B model and then the smaller 8B and 70B versions.

Llama 3.1 405B benchmark comparison with other leading models. Source: Meta
Llama 3.1 405B benchmark comparison with other leading models. Source: Meta

Meta also performed â€œextensive human evaluations that compare Llama 3.1 with competing models in real-world scenarios.â€

These figures rely on users to decide whether they prefer the response from one model or another.

The human evaluation of Llama 3.1 405B reflects similar parity that the benchmark figures reveal.

Llama 3.1 405B human evaluation results compared with GPT-4, GPT-4o, and Claude 3.5 Sonnet. Source: Meta

Meta says its model is truly open as Llama 3.1 model weights are also available to download, although the training data has not been shared. The company also amended its license to allow Llama models to be used to improve other AI models.

The freedom to fine-tune, modify, and use Llama models without restrictions will have critics of open source AI ring alarm bells.

Zuckerberg argues that an open source approach is the best way to avoid unintended harm. If an AI model is open to scrutiny, he says itâ€™s less likely to develop dangerous emergent behavior that we would otherwise miss in closed models.

When it comes to the potential for intentional harm Zuckerberg says, â€œAs long as everyone has access to similar generations of models â€“ which open source promotes â€“ then governments and institutions with more compute resources will be able to check bad actors with less compute.â€

Addressing the risk of state adversaries like China accessing Metaâ€™s models Zuckerberg says that efforts to keep these out of Chinese hands arenâ€™t going to work.

â€œOur adversaries are great at espionage, stealing models that fit on a thumb drive is relatively easy, and most tech companies are far from operating in a way that would make this more difficult,â€ he explained.

The excitement over an open source AI model like Llama 3.1 405B taking on the big closed models is justified.

But with whispers of GPT-5 and Claude 3.5 Opus waiting in the wings, these benchmark results might not age very well.

The post Meta releases Llama 3.1 models, sticks with open strategy appeared first on DailyAI.

Source: Read MoreÂ

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Prevent WordPress SQL Injection Attacks

Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

Microsoft might kill the Surface Laptop Studio as production is quietly halted

Minecraft licensing robbed us of this controversial NFL schedule release video

The power of generators

The power of generators

Simplify Factory Associations with Laravel’s UseFactory Attribute

This Week in Laravel: React Native, PhpStorm Junie, and more

Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

Microsoft might kill the Surface Laptop Studio as production is quietly halted

Meta releases Llama 3.1 models, sticks with open strategy

Performance

Salesforce AI Releases BLIP3-o: A Fully Open-Source Unified Multimodal Model Built with CLIP Embeddings and Flow Matching for Image Understanding and Generation

Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

6 Best Free and Open Source Linux Console Audio Grabbers

Google DeepMind Releases RecurrentGemma: One of the Strongest 2B-Parameter Open Language Models Designed for Fast Inference on Long Qequences

NVIDIA System Monitor is a task manager monitoring your GPU

Account-Based Marketing (ABM): A Comprehensive Guide

Fileless Remcos RAT Delivered via LNK Files and MSHTA in PowerShell-Based Attacks

CVE-2025-32431 – Traefik Path Traversal Vulnerability

Eviden scales AWS DeepRacer Global League using AWS DeepRacer Event Manager

Tomb Raider: Angel of Darkness Remastered is what happens when you restore an unfinished PS2 disaster

Meta releases Llama 3.1 models, sticks with open strategy

Performance

Related Posts