Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 17, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 17, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 17, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 17, 2025

      Microsoft’s allegiance isn’t to OpenAI’s pricey models — Satya Nadella’s focus is selling any AI customers want for maximum profits

      May 17, 2025

      If you think you can do better than Xbox or PlayStation in the Console Wars, you may just want to try out this card game

      May 17, 2025

      Surviving a 10 year stint in dev hell, this retro-styled hack n’ slash has finally arrived on Xbox

      May 17, 2025

      Save $400 on the best Samsung TVs, laptops, tablets, and more when you sign up for Verizon 5G Home or Home Internet

      May 17, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      NodeSource N|Solid Runtime Release – May 2025: Performance, Stability & the Final Update for v18

      May 17, 2025
      Recent

      NodeSource N|Solid Runtime Release – May 2025: Performance, Stability & the Final Update for v18

      May 17, 2025

      Big Changes at Meteor Software: Our Next Chapter

      May 17, 2025

      Apps in Generative AI – Transforming the Digital Experience

      May 17, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Microsoft’s allegiance isn’t to OpenAI’s pricey models — Satya Nadella’s focus is selling any AI customers want for maximum profits

      May 17, 2025
      Recent

      Microsoft’s allegiance isn’t to OpenAI’s pricey models — Satya Nadella’s focus is selling any AI customers want for maximum profits

      May 17, 2025

      If you think you can do better than Xbox or PlayStation in the Console Wars, you may just want to try out this card game

      May 17, 2025

      Surviving a 10 year stint in dev hell, this retro-styled hack n’ slash has finally arrived on Xbox

      May 17, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Salesforce AI Research Unveils APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets

    Salesforce AI Research Unveils APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets

    July 5, 2024

    Function-calling agent models, a significant advancement within large language models (LLMs), face the challenge of requiring high-quality, diverse, and verifiable datasets. These models interpret natural language instructions to execute API calls, which are critical for real-time interactions with various digital services. However, existing datasets often lack comprehensive verification and diversity, leading to inaccuracies and inefficiencies. Overcoming these challenges is crucial for the reliable deployment of function-calling agents in real-world applications, such as retrieving stock market data or managing social media interactions.

    Current methods for training function-calling agents rely on static datasets that do not undergo thorough verification. This often results in datasets that are inadequate when models encounter new or unseen APIs, severely limiting their adaptability and performance. For example, a model trained primarily on restaurant booking APIs may struggle with tasks like stock market data retrieval due to a lack of relevant training data, highlighting the need for more robust datasets.

    Researchers from Salesforce AI Research propose APIGen, an automated pipeline designed to generate diverse and verifiable function-calling datasets. APIGen addresses the limitations of existing methods by incorporating a multi-stage verification process, ensuring data reliability and correctness. This innovative approach involves three hierarchical stages: format checking, actual function executions, and semantic verification. By rigorously verifying each data point, APIGen produces high-quality datasets that significantly enhance the training and performance of function-calling models.

    APIGen’s data generation process starts with sampling APIs and example query-answer pairs from a library, formatting them into a standardized JSON format. The pipeline then employs a multi-stage verification process. Stage 1 involves a format checker that ensures correct JSON structure. Stage 2 executes the function calls to verify their operational correctness. Stage 3 uses a semantic checker to ensure alignment between the function calls, execution results, and query objectives. This process results in a comprehensive dataset of 60,000 high-quality entries, covering 3,673 APIs across 21 categories, available on Huggingface.

    APIGen’s datasets significantly improved model performance, achieving state-of-the-art results on the Berkeley Function-Calling Benchmark. Notably, models trained using these datasets outperformed multiple GPT-4 models, demonstrating considerable enhancements in accuracy and efficiency. For instance, a model with only 7B parameters achieved an accuracy of 87.5%, surpassing previous state-of-the-art models by a significant margin. These results underscore the robustness and reliability of APIGen-generated datasets in enhancing the capabilities of function-calling agents.

    In conclusion, the researchers present APIGen, a novel framework for generating high-quality and diverse function-calling datasets, addressing a critical challenge in AI research. The proposed multi-stage verification process ensures data reliability and correctness, significantly enhancing model performance. The APIGen-generated datasets enable even small models to achieve competitive results, advancing the field of function-calling agents. This approach opens new possibilities for developing efficient and powerful language models, highlighting the importance of high-quality data in AI research.

    Check out the Paper and Project. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. 

    Join our Telegram Channel and LinkedIn Group.

    If you like our work, you will love our newsletter..

    Don’t Forget to join our 46k+ ML SubReddit

    The post Salesforce AI Research Unveils APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleTop AI/Machine Learning/Data Science Courses from Udacity
    Next Article Top 5 Factors to Consider Whether To Buy or Build Generative AI Solutions

    Related Posts

    Development

    February 2025 Baseline monthly digest

    May 17, 2025
    Development

    Learn A1 Level Spanish

    May 17, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    Dashboard Design: Best practices & Design Principles

    Web Development

    New Calendar app for Windows 11 is here, but not for everyone

    Operating Systems

    Applying RLAIF for Code Generation with API-usage in Lightweight LLMs

    Development

    Create a Custom Multi-Select Dropdown with Vanilla JavaScript

    Development

    Highlights

    CVE-2025-28121 – Code-Projects Online Exam Mastering System XSS Vulnerability

    April 21, 2025

    CVE ID : CVE-2025-28121

    Published : April 21, 2025, 3:15 p.m. | 3 hours, 47 minutes ago

    Description : code-projects Online Exam Mastering System 1.0 is vulnerable to Cross Site Scripting (XSS) in feedback.php via the “q” parameter allowing remote attackers to execute arbitrary code.

    Severity: 6.1 | MEDIUM

    Visit the link for more details, such as CVSS details, affected products, timeline, and more…

    Part 2: A Survey of Analytics Engineering Work at Netflix

    January 2, 2025

    Rilasciato Pinta 3.0: Un aggiornamento significativo per il programma di pittura open source

    April 12, 2025

    Racing into 2025 with new GitHub Innovation Graph data

    April 21, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.