Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 16, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 16, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 16, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 16, 2025

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025

      Minecraft licensing robbed us of this controversial NFL schedule release video

      May 16, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      The power of generators

      May 16, 2025
      Recent

      The power of generators

      May 16, 2025

      Simplify Factory Associations with Laravel’s UseFactory Attribute

      May 16, 2025

      This Week in Laravel: React Native, PhpStorm Junie, and more

      May 16, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025
      Recent

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Transforming Software Development with Multi-Agent Collaboration: CodeStory’s Aide Framework Sets State-of-the-Art on SWE-Bench-Lite with 40.3% Accepted Solutions

    Transforming Software Development with Multi-Agent Collaboration: CodeStory’s Aide Framework Sets State-of-the-Art on SWE-Bench-Lite with 40.3% Accepted Solutions

    July 2, 2024

    Recent developments in the field of software engineering have raised the bar for productivity and teamwork. A team of researchers from Codestory has recently developed a multi-agent coding framework called Aide that achieved a remarkable 40.3% accepted solutions on the SWE-Bench-Lite benchmark, establishing a new state-of-the-art. With its smooth integration into development environments and increased productivity, this framework promises to completely transform the way developers work with code.

    https://aide.dev/blog/sota-on-swe-bench-lite

    The idea of numerous agents, each in charge of a particular code symbol like a class, function, enum, or type, lies at the core of this architecture. This atomic level of granularity enables natural language communication amongst bots, enabling each to concentrate on a particular unit of task. The Language Server Protocol (LSP) facilitates the agents’ communication using protocols that guarantee accurate and effective information transmission.

    Practically, this means that up to 30 agents can be active at once during a single run, collaborating to make decisions and sharing information. The framework’s capabilities have been demonstrated by its remarkable performance on the SWE-Bench-Lite benchmark. ClaudeSonnet3.5 and GPT-4o were utilized in the creation of an editor environment for the agents through the use of Pyright and Jedi. GPT-4o was exceptional at code editing, while Sonnet3.5—which is renowned for its robust agentic behaviors—was helpful in organizing and navigating the codebase.

    The agentic aspect of Sonnet 3.5 was very significant. It was the first paradigm to propose separating functions instead of making already complex ones more complex, exhibiting a sophisticated knowledge of maintainability and code structure. This behavior, along with GPT-4o’s excellent code editing abilities, made the framework perform noticeably better than earlier versions.

    The SWE-Bench-Lite benchmark was selected because it can replicate real-world coding difficulties, giving agents a reliable testing environment. The benchmark configuration comprised a mock editor harness with Pyright for diagnostics and Jinja for LSP features, enabling agents to obtain information and perform tests quickly without taxing system resources.

    The benchmarking process yielded important lessons, one of which was the significance of agent collaboration. Together, agents who were each in charge of a different code symbol were able to do tasks quickly and often corrected unrelated problems like lint errors or TODOs as they went. This cooperative method not only enhanced the quality of the code but also demonstrated the ability of agentic systems to manage complicated coding jobs on their own.

    The team has shared that there are still a few obstacles to overcome before fully including this multi-agent framework in development environments. Research is currently underway to ensure smooth communication between human developers and agents, handle concurrent code modifications, and preserve code stability. Furthermore, the team is studying to optimize the framework’s performance better, specifically with inference speeds and intelligence costs.

    The team’s ultimate objective is to increase the capabilities of human developers rather than to replace them. The goal is to improve software development process accuracy and efficiency by supplying a swarm of specialized agents, freeing up developers to work on more complex problems while the agents take care of more detailed duties.

    The post Transforming Software Development with Multi-Agent Collaboration: CodeStory’s Aide Framework Sets State-of-the-Art on SWE-Bench-Lite with 40.3% Accepted Solutions appeared first on MarkTechPost.

    Source: Read More 

    Hostinger
    Facebook Twitter Reddit Email Copy Link
    Previous ArticleResearchers from UC Berkeley and Anyscale Introduce RouteLLM: An Open-Source Framework for Cost-Effective LLM Routing
    Next Article What Defines Open Source Success?

    Related Posts

    Common Vulnerabilities and Exposures (CVEs)

    CVE-2025-40906 – MongoDB BSON Serialization BSON::XS Multiple Vulnerabilities

    May 17, 2025
    Common Vulnerabilities and Exposures (CVEs)

    CVE-2025-4818 – SourceCodester Doctor’s Appointment System SQL Injection

    May 17, 2025
    Leave A Reply Cancel Reply

    Hostinger

    Continue Reading

    CVE-2025-2811 – “GL.iNet Router Regular Expression Complexity Inefficient Vulnerability”

    Common Vulnerabilities and Exposures (CVEs)

    Learn Laravel and Vite : Processing Static Assets

    Development

    Register Now for a Laravel Debugging Workshop by Sentry

    Development
    How iFood built a platform to run hundreds of machine learning models with Amazon SageMaker Inference

    How iFood built a platform to run hundreds of machine learning models with Amazon SageMaker Inference

    Machine Learning

    Highlights

    Linux

    (non) recensione AnduinOS

    May 12, 2025

    In questa nuova (non) recensione andremo a dare uno sguardo a AnduinOS, una nuova derivata di…

    Xbox’s South of Midnight weaves a dark yet empathetic tale while showing why “that kind of representation matters”

    February 11, 2025

    As the Elden Ring DLC beats the snot out of players, Hidetaka Miyazaki says toning difficulty down would “break the game itself”

    June 21, 2024

    Certifications | A rocket fuel for growth

    April 28, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.