Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 16, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 16, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 16, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 16, 2025

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025

      Minecraft licensing robbed us of this controversial NFL schedule release video

      May 16, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      The power of generators

      May 16, 2025
      Recent

      The power of generators

      May 16, 2025

      Simplify Factory Associations with Laravel’s UseFactory Attribute

      May 16, 2025

      This Week in Laravel: React Native, PhpStorm Junie, and more

      May 16, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025
      Recent

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»This AI Paper from Apple Introduces the Foundation Language Models that Power Apple Intelligence Features: AFM-on-Device and AFM-Server

    This AI Paper from Apple Introduces the Foundation Language Models that Power Apple Intelligence Features: AFM-on-Device and AFM-Server

    July 31, 2024

    In AI, developing language models that can efficiently and accurately perform diverse tasks while ensuring user privacy and ethical considerations is a significant challenge. These models must handle various data types and applications without compromising performance or security. Ensuring that these models operate within ethical frameworks and maintain user trust adds another layer of complexity to the task.

    Traditional AI models often rely heavily on massive server-based computations, leading to challenges in efficiency and latency. Current methods include various forms of transformer architectures, which are neural networks designed for processing data sequences. Combined with sophisticated training processes and data preprocessing techniques, these architectures aim to improve model performance and reliability. However, these methods often fall short in balancing efficiency, accuracy, and ethical considerations, especially in real-time applications on personal devices.

    Researchers from Apple have introduced two primary language models: a 3 billion parameter model optimized for on-device usage and a larger server-based model designed for Apple’s Private Cloud Compute. These models are crafted to balance efficiency, accuracy, and responsible AI principles, focusing on enhancing user experiences without compromising on privacy and ethical standards. Introducing these models signifies a step towards more efficient and user-centric AI solutions.

    The on-device model employs pre-normalization with RMSNorm, grouped-query attention with eight key-value heads, and SwiGLU activation for efficiency. RoPE positional embeddings support long-context processing. The training utilized a diverse dataset mixture, including licensed data from publishers, open-source datasets, and publicly available web data. Pre-training was conducted on 6.3 trillion tokens for the server model and a distilled version for the on-device model. The server model underwent continued pre-training at a sequence length of 8192 with a mixture that upweights math and code data. The context-lengthening stage used sequences of 32768 tokens with synthetic long-context Q&A data. Post-training involved supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) to enhance instruction-following and conversational capabilities.

    The performance of these models has been rigorously evaluated, demonstrating strong capabilities across various benchmarks. The on-device model scored 61.4 on the HELM MMLU 5-shot benchmark, while the server model scored 75.4. In addition, the server model showed impressive results in GSM8K with a score of 72.4, ARC-c with 69.7, and HellaSwag with 86.9. The AFM-server also excelled in the Winogrande benchmark with a score of 79.2. These results indicate significant improvements in instruction following, reasoning, and writing tasks. Furthermore, the research highlights a commitment to ethical AI, with extensive measures taken to prevent the perpetuation of stereotypes and biases, ensuring robust and reliable model performance.

    The research addresses the challenges of developing efficient and responsible AI models. The proposed methods and technologies demonstrate significant advancements in AI model performance and ethical considerations. These models offer valuable contributions to the field by focusing on efficiency and ethical AI, showcasing how advanced AI can be implemented in user-friendly and responsible ways.

    In conclusion, the paper provides a comprehensive overview of Apple’s development and implementation of advanced language models. It addresses the critical problem of balancing efficiency, accuracy, and ethical considerations in AI. The researchers’ proposed methods significantly improve model performance while focusing on user privacy and responsible AI principles. This work represents a significant advancement in the field, offering a robust framework for future AI developments.

    Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter..

    Don’t Forget to join our 47k+ ML SubReddit

    Find Upcoming AI Webinars here

    The post This AI Paper from Apple Introduces the Foundation Language Models that Power Apple Intelligence Features: AFM-on-Device and AFM-Server appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleCSS Functions and Mixins Module Notes
    Next Article Unlocking Japanese LLMs with AWS Trainium: Innovators Showcase from the AWS LLM Development Support Program

    Related Posts

    Security

    Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

    May 17, 2025
    Common Vulnerabilities and Exposures (CVEs)

    CVE-2025-40906 – MongoDB BSON Serialization BSON::XS Multiple Vulnerabilities

    May 17, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    Whisp is a PHP SSH Server

    Whisp is a PHP SSH Server

    Development

    Il Funzionamento dei Software nell’Intrattenimento Online

    Linux

    Abacus AI Introduces LiveBench AI: A Super Strong LLM Benchmark that Tests all the LLMs on Reasoning, Math, Coding and more

    Development

    Monster Hunter Wilds releases patch notes for its 1st Title Update

    News & Updates

    Highlights

    Improve your website’s accessibility with a single line of code

    January 28, 2025

    Post Content Source: Read More 

    Nanoscale transistors could enable more efficient electronics

    November 4, 2024

    aidesk.pro

    December 26, 2024

    Gulp is back – did it ever leave?

    May 9, 2024
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.