Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 16, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 16, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 16, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 16, 2025

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025

      Minecraft licensing robbed us of this controversial NFL schedule release video

      May 16, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      The power of generators

      May 16, 2025
      Recent

      The power of generators

      May 16, 2025

      Simplify Factory Associations with Laravel’s UseFactory Attribute

      May 16, 2025

      This Week in Laravel: React Native, PhpStorm Junie, and more

      May 16, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025
      Recent

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Artificial Intelligence»Meta releases Llama 3.1 models, sticks with open strategy

    Meta releases Llama 3.1 models, sticks with open strategy

    July 27, 2024

    Meta has released its upgraded Llama 3.1 models in 8B, 70B, and 405B versions and committed to Mark Zuckerberg’s open source vision for the future of AI.

    The new additions to Meta’s Llama family of models come with an expanded context length of 128k and support across eight languages.

    Meta says its highly anticipated 405B model demonstrates “unmatched flexibility, control, and state-of-the-art capabilities that rival the best closed source models.” It also claims that Llama 3.1 405B is the “the world’s largest and most capable openly available foundation model.”

    With eye-watering computing costs being spent to train ever-larger models, there was a lot of speculation that Meta’s flagship 405B model could be its first paid model.

    Llama 3.1 405B was trained on over 15 trillion tokens using 16,000 NVIDIA H100s, likely costing hundreds of millions of dollars.

    In a blog post, Meta CEO Mark Zuckerberg reaffirmed the company’s view that open source AI is the way forward and that the release of Llama 3.1 is the next step “towards open source AI becoming the industry standard.”

    The Llama 3.1 models are free to download and modify or fine-tune with a suite of services from Amazon, Databricks, and NVIDIA.

    The models are also available on cloud service providers including AWS, Azure, Google, Oracle.

    Starting today, open source is leading the way. Introducing Llama 3.1: Our most capable models yet.

    Today we’re releasing a collection of new Llama 3.1 models including our long awaited 405B. These models deliver improved reasoning capabilities, a larger 128K token context… pic.twitter.com/1iKpBJuReD

    — AI at Meta (@AIatMeta) July 23, 2024

    Performance

    Meta says it tested its models on over 150 benchmark datasets and released results for the more common benchmarks to show how its new models stack up against other leading models.

    There’s not a lot separating Llama 3.1 405B from GPT-4o and Claude 3.5 Sonnet. Here are the figures for the 405B model and then the smaller 8B and 70B versions.

    Llama 3.1 405B benchmark comparison with other leading models. Source: Meta
    Llama 3.1 405B benchmark comparison with other leading models. Source: Meta

    Meta also performed “extensive human evaluations that compare Llama 3.1 with competing models in real-world scenarios.”

    These figures rely on users to decide whether they prefer the response from one model or another.

    The human evaluation of Llama 3.1 405B reflects similar parity that the benchmark figures reveal.

    Llama 3.1 405B human evaluation results compared with GPT-4, GPT-4o, and Claude 3.5 Sonnet. Source: Meta

    Meta says its model is truly open as Llama 3.1 model weights are also available to download, although the training data has not been shared. The company also amended its license to allow Llama models to be used to improve other AI models.

    The freedom to fine-tune, modify, and use Llama models without restrictions will have critics of open source AI ring alarm bells.

    Zuckerberg argues that an open source approach is the best way to avoid unintended harm. If an AI model is open to scrutiny, he says it’s less likely to develop dangerous emergent behavior that we would otherwise miss in closed models.

    When it comes to the potential for intentional harm Zuckerberg says, “As long as everyone has access to similar generations of models – which open source promotes – then governments and institutions with more compute resources will be able to check bad actors with less compute.”

    Addressing the risk of state adversaries like China accessing Meta’s models Zuckerberg says that efforts to keep these out of Chinese hands aren’t going to work.

    “Our adversaries are great at espionage, stealing models that fit on a thumb drive is relatively easy, and most tech companies are far from operating in a way that would make this more difficult,” he explained.

    The excitement over an open source AI model like Llama 3.1 405B taking on the big closed models is justified.

    But with whispers of GPT-5 and Claude 3.5 Opus waiting in the wings, these benchmark results might not age very well.

    The post Meta releases Llama 3.1 models, sticks with open strategy appeared first on DailyAI.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleAccounts Payable: Debit or Credit?
    Next Article How to extract text from an image

    Related Posts

    Machine Learning

    Salesforce AI Releases BLIP3-o: A Fully Open-Source Unified Multimodal Model Built with CLIP Embeddings and Flow Matching for Image Understanding and Generation

    May 16, 2025
    Security

    Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

    May 16, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    6 Best Free and Open Source Linux Console Audio Grabbers

    Linux

    Google DeepMind Releases RecurrentGemma: One of the Strongest 2B-Parameter Open Language Models Designed for Fast Inference on Long Qequences

    Development

    NVIDIA System Monitor is a task manager monitoring your GPU

    Linux

    Account-Based Marketing (ABM): A Comprehensive Guide

    Artificial Intelligence

    Highlights

    Development

    Fileless Remcos RAT Delivered via LNK Files and MSHTA in PowerShell-Based Attacks

    May 16, 2025

    Cybersecurity researchers have shed light on a new malware campaign that makes use of a…

    CVE-2025-32431 – Traefik Path Traversal Vulnerability

    April 21, 2025

    Eviden scales AWS DeepRacer Global League using AWS DeepRacer Event Manager

    July 8, 2024

    Tomb Raider: Angel of Darkness Remastered is what happens when you restore an unfinished PS2 disaster

    February 14, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.