Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      June 1, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      June 1, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      June 1, 2025

      How To Prevent WordPress SQL Injection Attacks

      June 1, 2025

      7 MagSafe accessories that I recommend every iPhone user should have

      June 1, 2025

      I replaced my Kindle with an iPad Mini as my ebook reader – 8 reasons why I don’t regret it

      June 1, 2025

      Windows 11 version 25H2: Everything you need to know about Microsoft’s next OS release

      May 31, 2025

      Elden Ring Nightreign already has a duos Seamless Co-op mod from the creator of the beloved original, and it’ll be “expanded on in the future”

      May 31, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      Student Record Android App using SQLite

      June 1, 2025
      Recent

      Student Record Android App using SQLite

      June 1, 2025

      When Array uses less memory than Uint8Array (in V8)

      June 1, 2025

      Laravel 12 Starter Kits: Definite Guide Which to Choose

      June 1, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Photobooth is photobooth software for the Raspberry Pi and PC

      June 1, 2025
      Recent

      Photobooth is photobooth software for the Raspberry Pi and PC

      June 1, 2025

      Le notizie minori del mondo GNU/Linux e dintorni della settimana nr 22/2025

      June 1, 2025

      Rilasciata PorteuX 2.1: Novità e Approfondimenti sulla Distribuzione GNU/Linux Portatile Basata su Slackware

      June 1, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Machine Learning»Falcon 3 models now available in Amazon SageMaker JumpStart

    Falcon 3 models now available in Amazon SageMaker JumpStart

    February 11, 2025

    Today, we are excited to announce that the Falcon 3 family of models from TII are available in Amazon SageMaker JumpStart. In this post, we explore how to deploy this model efficiently on Amazon SageMaker AI.

    Overview of the Falcon 3 family of models

    The Falcon 3 family, developed by Technology Innovation Institute (TII) in Abu Dhabi, represents a significant advancement in open source language models. This collection includes five base models ranging from 1 billion to 10 billion parameters, with a focus on enhancing science, math, and coding capabilities. The family consists of Falcon3-1B-Base, Falcon3-3B-Base, Falcon3-Mamba-7B-Base, Falcon3-7B-Base, and Falcon3-10B-Base along with their instruct variants.

    These models showcase innovations such as efficient pre-training techniques, scaling for improved reasoning, and knowledge distillation for better performance in smaller models. Notably, the Falcon3-10B-Base model achieves state-of-the-art performance for models under 13 billion parameters in zero-shot and few-shot tasks. The Falcon 3 family also includes various fine-tuned versions like Instruct models and supports different quantization formats, making them versatile for a wide range of applications.

    Currently, SageMaker JumpStart offers the base versions of Falcon3-3B, Falcon3-7B, and Falcon3-10B, along with their corresponding instruct variants, as well as Falcon3-1B-Instruct.

    Get started with SageMaker JumpStart

    SageMaker JumpStart is a machine learning (ML) hub that can help accelerate your ML journey. With SageMaker JumpStart, you can evaluate, compare, and select pre-trained foundation models (FMs), including Falcon 3 models. These models are fully customizable for your use case with your data.

    Deploying a Falcon 3 model through SageMaker JumpStart offers two convenient approaches: using the intuitive SageMaker JumpStart UI or implementing programmatically through the SageMaker Python SDK. Let’s explore both methods to help you choose the approach that best suits your needs.

    Deploy Falcon 3 using the SageMaker JumpStart UI

    Complete the following steps to deploy Falcon 3 through the JumpStart UI:

    1. To access SageMaker JumpStart, use one of the following methods:
      1. In Amazon SageMaker Unified Studio, on the Build menu, choose JumpStart models under Model development.
      2. Alternatively, in Amazon SageMaker Studio, choose JumpStart in the navigation pane.
    1. Search for Falcon3-10B-Base in the model browser.
    2. Choose the model and choose Deploy.
    3. For Instance type, either use the default instance or choose a different instance.
    4. Choose Deploy.
      After some time, the endpoint status will show as InService and you will be able to run inference against it.

    Deploy Falcon 3 programmatically using the SageMaker Python SDK

    For teams looking to automate deployment or integrate with existing MLOps pipelines, you can use the SageMaker Python SDK:

    from sagemaker.serve.builder.model_builder import ModelBuilder
    from sagemaker.serve.builder.schema_builder import SchemaBuilder
    from sagemaker.jumpstart.model import ModelAccessConfig
    from sagemaker.session import Session
    import logging
    
    sagemaker_session = Session()
    
    artifacts_bucket_name = sagemaker_session.default_bucket()
    execution_role_arn = sagemaker_session.get_caller_identity_arn()
    
    
    js_model_id = "huggingface-llm-falcon-3-10B-base"
    
    gpu_instance_type = "ml.g5.12xlarge"  
    
    response = "Hello, I'm a language model, and I'm here to help you with your English."
    
    sample_input = {
        "inputs": "Hello, I'm a language model,",
        "parameters": {"max_new_tokens": 128, "top_p": 0.9, "temperature": 0.6},
    }
    
    sample_output = [{"generated_text": response}]
    
    schema_builder = SchemaBuilder(sample_input, sample_output)
    
    model_builder = ModelBuilder(
        model=js_model_id,
        schema_builder=schema_builder,
        sagemaker_session=sagemaker_session,
        role_arn=execution_role_arn,
        log_level=logging.ERROR
    )
    
    model= model_builder.build()
    
    predictor = model.deploy(model_access_configs={js_model_id:ModelAccessConfig(accept_eula=True)}, accept_eula=True)

    Run inference on the predictor:

    predictor.predict(sample_input)

    If you want to set up the ability to scale down to zero after deployment, refer to Unlock cost savings with the new scale down to zero feature in SageMaker Inference.

    Clean up

    To clean up the model and endpoint, use the following code:

    predictor.delete_model()
    predictor.delete_endpoint()

    Conclusion

    In this post, we explored how SageMaker JumpStart empowers data scientists and ML engineers to discover, access, and run a wide range of pre-trained FMs for inference, including the Falcon 3 family of models. Visit SageMaker JumpStart in SageMaker Studio now to get started. For more information, refer to SageMaker JumpStart pretrained models, Amazon SageMaker JumpStart Foundation Models, and Getting started with Amazon SageMaker JumpStart.


    About the authors

    Niithiyn Vijeaswaran is a Generative AI Specialist Solutions Architect with the Third-Party Model Science team at AWS. His area of focus is generative AI and AWS AI Accelerators. He holds a Bachelor’s degree in Computer Science and Bioinformatics.

    Marc KarpMarc Karp is an ML Architect with the Amazon SageMaker Service team. He focuses on helping customers design, deploy, and manage ML workloads at scale. In his spare time, he enjoys traveling and exploring new places.

    RaghuRaghu Ramesha is a Senior ML Solutions Architect with the Amazon SageMaker Service team. He focuses on helping customers build, deploy, and migrate ML production workloads to SageMaker at scale. He specializes in machine learning, AI, and computer vision domains, and holds a master’s degree in Computer Science from UT Dallas. In his free time, he enjoys traveling and photography.

    Banu Nagasundaram leads product, engineering, and strategic partnerships for SageMaker JumpStart, SageMaker’s machine learning and GenAI hub. She is passionate about building solutions that help customers accelerate their AI journey and unlock business value.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleMeta SAM 2.1 is now available in Amazon SageMaker JumpStart
    Next Article Building a virtual meteorologist using Amazon Bedrock Agents

    Related Posts

    Machine Learning

    How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark

    June 1, 2025
    Machine Learning

    BOND 2025 AI Trends Report Shows AI Ecosystem Growing Faster than Ever with Explosive User and Developer Adoption

    June 1, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    CVE-2025-46585 – Linux Kernel Out-of-bounds Array Read/Write Vulnerability

    Common Vulnerabilities and Exposures (CVEs)

    NVIDIA’s CEO envisions a future where “multi-shot” agentic AIs — armed with reasoning superpowers outperform humans in the job market

    Development

    From Deep Knowledge Tracing to DKT2: A Leap Forward in Educational AI

    Machine Learning

    Fahmi Fadzil Proposes Major Updates to Malaysia’s Cyber Laws

    Development

    Highlights

    Development

    Microsoft AI Introduces LazyGraphRAG: A New AI Approach to Graph-Enabled RAG that Needs No Prior Summarization of Source Data

    November 27, 2024

    In AI, a key challenge lies in improving the efficiency of systems that process unstructured…

    Exclusive: Microsoft Bing tests “AI search” that replaces blue links, like Google AI mode

    February 25, 2025

    List of Activities and Their Corresponding Suitable LLMs in the Artificial Intelligence AI World Right Now: A Comprehensive Guide

    June 8, 2024

    Exploring the Role of Machine Learning in Enhancing User Experiences with React Native🧠

    April 22, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.