Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 16, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 16, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 16, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 16, 2025

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025

      Minecraft licensing robbed us of this controversial NFL schedule release video

      May 16, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      The power of generators

      May 16, 2025
      Recent

      The power of generators

      May 16, 2025

      Simplify Factory Associations with Laravel’s UseFactory Attribute

      May 16, 2025

      This Week in Laravel: React Native, PhpStorm Junie, and more

      May 16, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025
      Recent

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Fireworks AI Releases Firefunction-v2: An Open Weights Function Calling Model with Function Calling Capability on Par with GPT4o at 2.5x the Speed and 10% of the Cost

    Fireworks AI Releases Firefunction-v2: An Open Weights Function Calling Model with Function Calling Capability on Par with GPT4o at 2.5x the Speed and 10% of the Cost

    June 20, 2024

    Fireworks AI releases Firefunction-v2, an open-source function-calling model designed to excel in real-world applications. It integrates with multi-turn conversations, instruction following, and parallel function calling. Firefunction-v2 offers a robust and efficient solution that rivals high-end models like GPT-4o but at a fraction of the cost and with superior speed and functionality.

    Introduction to Firefunction-v2

    LLMs’ capabilities have improved substantially in recent years, particularly with releases like Llama 3. These advancements have underscored the importance of function calling, allowing models to interact with external APIs and enhancing their utility beyond static data handling. Firefunction-v2 builds on these advancements, offering a model for real-world scenarios involving multi-turn conversations, instruction following, and parallel function calling.

    Image Source

    Firefunction-v2 retains Llama 3’s multi-turn instruction capability while significantly outperforming it in function-calling tasks. It scores 0.81 on a medley of public benchmarks compared to GPT-4o’s 0.80, all while being far more cost-effective and faster. Specifically, Firefunction-v2 costs $0.9 per output token, compared to GPT-4o’s $15, and operates at 180 tokens per second versus GPT-4o’s 69 tokens per second.

    The Creation Process

    The development of Firefunction-v2 was driven by user feedback and the need for a model that excels in both function calling and general tasks. Unlike other open-source function calling models, which often sacrifice general reasoning abilities for specialized performance, Firefunction-v2 maintains a balance. It was fine-tuned from the Llama3-70b-instruct base model using a curated dataset that included function calling and general conversation data. This approach ensured the preservation of the model’s broad capabilities while enhancing its function-calling performance.

    Evaluation and Performance

    The evaluation of Firefunction-v2 involved a mix of publicly available datasets and benchmarks such as Gorilla and Nexus. The results showed that Firefunction-v2 outperformed its predecessor, Firefunction-v1, and other models like Llama3-70b-instruct and GPT-4o in various function-calling tasks. For example, Firefunction-v2 achieved higher scores in parallel function calling and multi-turn instruction following, demonstrating its adaptability and intelligence in handling complex tasks.

    Image Source

    Highlighted Capabilities

    Firefunction-v2’s capabilities are best illustrated through practical applications. The model reliably supports up to 30 function specifications, significantly improving over Firefunction-v1, which struggled with more than five functions. This capability is crucial for real-world applications, as it allows the model to handle multiple API calls efficiently, providing a seamless user experience. Firefunction-v2 excels in instruction-following, making intelligent decisions about when to call functions, and executing them accurately.

    Image Source

    Getting Started with Firefunction-v2

    Firefunction-v2 is accessible through Fireworks AI’s platform, which offers a speed-optimized setup with an OpenAI-compatible API. This compatibility allows users to integrate Firefunction-v2 into their existing systems with minimal changes. The model can also be explored through a demo app and UI playground, where users can experiment with various functions and configurations.

    Conclusion

    Firefunction-v2 is a testament to Fireworks AI’s commitment to advancing the capabilities of large language models in function calling. Firefunction-v2 sets a new standard for real-world AI applications by balancing speed, cost, and performance. The positive feedback from the developer community and the impressive benchmark results underscore its potential to revolutionize how function calls are integrated into AI systems. Fireworks AI continues to iterate on its models, driven by user feedback and a dedication to providing practical solutions for developers.

    Check out the Docs, model playground, demo UI app, and Hugging Face model page. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. 

    Join our Telegram Channel and LinkedIn Group.

    If you like our work, you will love our newsletter..

    Don’t Forget to join our 44k+ ML SubReddit

    The post Fireworks AI Releases Firefunction-v2: An Open Weights Function Calling Model with Function Calling Capability on Par with GPT4o at 2.5x the Speed and 10% of the Cost appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleFirecrawl: A Powerful Web Scraping Tool for Turning Websites into Large Language Model (LLM) Ready Markdown or Structured Data
    Next Article Conformer-Based Speech Recognition on Extreme Edge-Computing Devices

    Related Posts

    Security

    Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

    May 16, 2025
    Common Vulnerabilities and Exposures (CVEs)

    CVE-2025-47916 – Invision Community Themeeditor Remote Code Execution

    May 16, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    This $80 tablet makes a great travel companion – and at this price I might get two

    News & Updates

    Google Sheets just got a lot faster, but there’s a small catch. Here’s what you need to know

    Development

    What are some approaches to testing a major software update?

    Development

    Robot Framework – Best keyword to tab off an element

    Development

    Highlights

    How CSS Container Style Queries Enhance Web Design

    June 16, 2024

    What are these CSS Container Style Queries, and why should you use them? Juan Diego…

    The best weekly deals and sales for Steam

    June 21, 2024

    Ninja Gaiden 4 on Xbox started talks “six or seven years ago,” says Microsoft Gaming CEO Phil Spencer

    January 24, 2025

    Remix vs. Next.js: A Comprehensive Look at Modern React Frameworks

    February 13, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.