Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      June 1, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      June 1, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      June 1, 2025

      How To Prevent WordPress SQL Injection Attacks

      June 1, 2025

      My top 5 must-play PC games for the second half of 2025 — Will they live up to the hype?

      June 1, 2025

      A week of hell with my Windows 11 PC really makes me appreciate the simplicity of Google’s Chromebook laptops

      June 1, 2025

      Elden Ring Nightreign Night Aspect: How to beat Heolstor the Nightlord, the final boss

      June 1, 2025

      New Xbox games launching this week, from June 2 through June 8 — Zenless Zone Zero finally comes to Xbox

      June 1, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      Student Record Android App using SQLite

      June 1, 2025
      Recent

      Student Record Android App using SQLite

      June 1, 2025

      When Array uses less memory than Uint8Array (in V8)

      June 1, 2025

      Laravel 12 Starter Kits: Definite Guide Which to Choose

      June 1, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      My top 5 must-play PC games for the second half of 2025 — Will they live up to the hype?

      June 1, 2025
      Recent

      My top 5 must-play PC games for the second half of 2025 — Will they live up to the hype?

      June 1, 2025

      A week of hell with my Windows 11 PC really makes me appreciate the simplicity of Google’s Chromebook laptops

      June 1, 2025

      Elden Ring Nightreign Night Aspect: How to beat Heolstor the Nightlord, the final boss

      June 1, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Machine Learning»Salesforce AI Research Proposes PerfCodeGen: A Training-Free Framework that Enhances the Performance of LLM-Generated Code with Execution Feedback

    Salesforce AI Research Proposes PerfCodeGen: A Training-Free Framework that Enhances the Performance of LLM-Generated Code with Execution Feedback

    January 18, 2025

    Large Language Models (LLMs) have become essential tools in software development, offering capabilities such as generating code snippets, automating unit tests, and debugging. However, these models often fall short in producing code that is not only functionally correct but also efficient in runtime. Overlooking runtime efficiency can lead to software that performs poorly, increases operational costs, and impacts user experience. This issue is particularly pronounced for less experienced developers, who may rely on AI-suggested code without fully understanding its implications. Salesforce Research addresses these challenges with PerfCodeGen, a framework that aims to improve both the correctness and performance of LLM-generated code.

    Salesforce AI’s PerfCodeGen is a training-free framework designed to enhance the runtime efficiency of LLM-generated code. It achieves this by using execution feedback in an iterative self-refinement process. Unlike approaches requiring fine-tuning with extensive training data, PerfCodeGen employs a feedback loop that evaluates and refines code based on runtime metrics during test execution. The framework operates in two key phases: refining correctness and optimizing performance. Initially, it ensures the generated code meets functional requirements by addressing issues identified in unit tests. Once correctness is established, the framework focuses on runtime efficiency, optimizing the code by targeting and refining the most resource-intensive test cases. This iterative process results in solutions that are both correct and efficient.

    Technical Insights and Benefits

    PerfCodeGen integrates with existing LLM workflows and begins by generating multiple candidate solutions using nucleus sampling. In the first phase, these candidates are assessed for correctness through unit tests. Feedback from failed tests is used to refine the solutions. Once functional correctness is ensured, the framework moves to the second phase, analyzing runtime metrics to identify bottlenecks. This information is then used to optimize the code further, focusing on the most time-consuming test cases.

    This two-phase process increases the likelihood of producing optimally efficient programs. PerfCodeGen’s methodology mirrors human debugging and optimization practices, making it both effective and intuitive. Additionally, the framework’s reliance on feedback rather than retraining allows it to scale across various LLMs and application domains. It has shown consistent improvements in runtime efficiency and correctness across models such as Phi-3-mini, Llama 3, and GPT-4.

    PerfCodeGen has been tested on benchmarks such as HumanEval, MBPP, and APPS, demonstrating its effectiveness:

    1. Runtime Efficiency: On HumanEval, GPT-4’s optimization rate (%Opt) increased from 24.54% to 28.83% with PERFCODEGEN, with similar improvements observed across other models.
    2. Correctness Improvement: On MBPP, GPT-3.5’s correctness rate (%Correct) rose from 66.38% to 73.36% with a single sample (Best@1).
    3. Outperforming Ground Truth: PERFCODEGEN enabled LLMs to generate more efficient solutions than ground truth in approximately 55% of HumanEval tasks and 67% of MBPP tasks.
    4. Scalability: Open models such as Phi-3-mini and Mixtral achieved performance comparable to closed models like GPT-3.5 and GPT-4.

    These results highlight PERFCODEGEN’s ability to balance correctness and runtime efficiency effectively, making it a valuable addition to LLM-driven code generation workflows.

    Hostinger

    Conclusion:

    PerfCodeGen offers a practical solution to a key limitation of current LLMs: their focus on correctness at the expense of runtime efficiency. By incorporating execution feedback into an iterative refinement process, PerfCodeGen enables the generation of code that is both correct and efficient. This approach enhances the usability of LLMs in software development, providing developers with tools to produce higher-quality code without extensive retraining. The framework’s success across diverse benchmarks demonstrates its potential as a step forward in creating efficient, reliable, and accessible AI-driven programming solutions.


    Check out the Paper and GitHub Page. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. Don’t Forget to join our 65k+ ML SubReddit.

    🚨 Recommend Open-Source Platform: Parlant is a framework that transforms how AI agents make decisions in customer-facing scenarios. (Promoted)

    The post Salesforce AI Research Proposes PerfCodeGen: A Training-Free Framework that Enhances the Performance of LLM-Generated Code with Execution Feedback appeared first on MarkTechPost.

    Source: Read More 

    Hostinger
    Facebook Twitter Reddit Email Copy Link
    Previous ArticleRsync ha un problema di Remote Code Execution, ma fortunatamente le fix sono già disponibili
    Next Article Researchers from Meta AI and UT Austin Explored Scaling in Auto-Encoders and Introduced ViTok: A ViT-Style Auto-Encoder to Perform Exploration

    Related Posts

    Machine Learning

    How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark

    June 1, 2025
    Machine Learning

    Enigmata’s Multi-Stage and Mix-Training Reinforcement Learning Recipe Drives Breakthrough Performance in LLM Puzzle Reasoning

    June 1, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    cameractrls – camera controls for Linux

    Linux

    20+ Seasonal Lightroom Presets & LUTs for Photographers

    Development

    StereoAnything: A Highly Practical AI Solution for Robust Stereo Matching

    Development

    Copilot will be enhanced with ability to keep track of Teams group chats more efficiently

    Operating Systems
    Hostinger

    Highlights

    Artificial Intelligence

    Introducing Gemini 2.0: our new AI model for the agentic era

    May 27, 2025

    Today, we’re announcing Gemini 2.0, our most capable multimodal AI model yet. Source: Read More 

    Best Hotels In Ranthambore

    August 21, 2024

    CVE-2025-4145 – Netgear EX6200 Remote Buffer Overflow

    May 1, 2025

    Three ways to create the right data culture in your business

    June 30, 2024
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.