Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 16, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 16, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 16, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 16, 2025

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025

      Minecraft licensing robbed us of this controversial NFL schedule release video

      May 16, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      The power of generators

      May 16, 2025
      Recent

      The power of generators

      May 16, 2025

      Simplify Factory Associations with Laravel’s UseFactory Attribute

      May 16, 2025

      This Week in Laravel: React Native, PhpStorm Junie, and more

      May 16, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025
      Recent

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»All Hands AI Open Sources OpenHands CodeAct 2.1: A New Software Development Agent to Solve Over 50% of Real Github Issues in SWE-Bench

    All Hands AI Open Sources OpenHands CodeAct 2.1: A New Software Development Agent to Solve Over 50% of Real Github Issues in SWE-Bench

    November 1, 2024

    The world of software development has seen an explosion in the use of AI agents over the last few years, promising to enhance productivity, automate complex tasks, and make the lives of developers easier. However, one problem that remains prevalent is the significant gap between these promising AI agents and their ability to address real-world issues effectively. Most AI Agents struggle to understand the complexity and contextual nuances of software development challenges—especially when it comes to solving real GitHub issues that developers face every day. These AI agents often fall short, requiring extensive oversight or manual correction from developers, which defeats their purpose. Addressing this challenge requires a solution that is not just smarter but is able to keep up with the dynamic demands of software engineering, a space full of unique challenges and fast-moving projects.

    All Hands AI Open Sources OpenHands CodeAct 2.1: a new software development agent, the first to solve over 50% of real GitHub issues in SWE-Bench, the standard benchmark for evaluating AI-assisted software engineering tools. OpenHands CodeAct 2.1 represents a significant leap forward, boasting a 53% resolution rate on SWE-Bench and a 41.7% success rate on SWE-Bench Lite. What makes OpenHands CodeAct 2.1 particularly revolutionary is that it has gone beyond experimentation in controlled environments and is now making a substantial impact on actual projects by solving real GitHub issues autonomously. Unlike other tools that are either too closed off for contribution or too niche to be useful to the broader community, OpenHands is an open-source agent that developers can freely use, improve, and adapt. With the perfect combination of openness and competitiveness, it has become the top choice for developers seeking an effective AI solution.

    OpenHands CodeAct 2.1’s performance improvements are primarily rooted in three major updates. First, it switched to Anthropic’s new Claude-3.5 model, which significantly improves natural language understanding, allowing CodeAct to better interpret issues raised by developers. Second, the agent’s actions have been modified to use function calling, which brings more precision in task execution. This ensures that the agent can call specific pieces of code without misinterpretation, effectively addressing developer issues more accurately. Lastly, the developers behind CodeAct 2.1 made significant improvements regarding directory traversal, reducing instances of the agent getting stuck in repetitive or circular tasks—a common problem that plagued earlier iterations. By refining the agent’s capabilities to navigate directories intelligently, larger and more complicated issues are resolved smoothly, and efficiency is markedly increased.

    The importance of these updates cannot be overstated. Having a 53% resolve rate on SWE-Bench means that over half of the issues in this benchmark were solved without any human intervention. Considering that SWE-Bench is specifically designed to be representative of real-world GitHub issues faced by software developers, this milestone demonstrates that OpenHands CodeAct 2.1 can directly impact software engineering workflows by solving a substantial number of issues autonomously. In the broader scope of automated development assistance, this is significant because it saves developers time and allows them to focus on higher-level challenges rather than getting bogged down by tedious issue resolution. Moreover, the open-source nature of OpenHands invites developers from around the globe to contribute and further improve the agent—a feature that the development community holds in high regard. The data from SWE-Bench Lite, where OpenHands CodeAct 2.1 achieved a 41.7% resolve rate, also supports its versatility and capability in handling less complex issues, which can be equally disruptive when left unchecked in a development pipeline.

    In conclusion, OpenHands CodeAct 2.1 is a breakthrough in AI-driven software development, moving us a step closer to fully autonomous coding assistants that genuinely enhance productivity. Its ability to solve over 50% of real GitHub issues in SWE-Bench demonstrates not only technological advancement but also practical usability that developers can rely on day-to-day. The open-source nature of OpenHands ensures that it remains a community-driven effort with the promise of continued improvements. Whether developers are looking to run OpenHands locally, integrate it through GitHub actions, or sign up for the soon-to-be-released online version, it offers flexibility and an open invitation to all developers to join in its evolution. With major improvements in the agent’s capabilities—such as adopting Anthropic’s Claude-3.5, implementing function calling, and improving directory traversal—OpenHands CodeAct 2.1 is setting the standard for what an AI development agent should be: effective, accessible, and continuously evolving.


    Check out the Details and GitHub here. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter.. Don’t Forget to join our 55k+ ML SubReddit.

    [Trending] LLMWare Introduces Model Depot: An Extensive Collection of Small Language Models (SLMs) for Intel PCs

    The post All Hands AI Open Sources OpenHands CodeAct 2.1: A New Software Development Agent to Solve Over 50% of Real Github Issues in SWE-Bench appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleHow Druva used Amazon Bedrock to address foundation model complexity when building Dru, Druva’s backup AI copilot
    Next Article On Device Llama 3.1 with Core ML

    Related Posts

    Machine Learning

    Salesforce AI Releases BLIP3-o: A Fully Open-Source Unified Multimodal Model Built with CLIP Embeddings and Flow Matching for Image Understanding and Generation

    May 16, 2025
    Security

    Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

    May 16, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    Unleash AI innovation with Amazon SageMaker HyperPod

    Machine Learning

    How to install Ubuntu Server in under 30 minutes

    News & Updates

    Google AI Released TxGemma: A Series of 2B, 9B, and 27B LLM for Multiple Therapeutic Tasks for Drug Development Fine-Tunable with Transformers

    Machine Learning

    CVE-2025-30330 – Adobe Illustrator Heap-based Buffer Overflow Vulnerability

    Common Vulnerabilities and Exposures (CVEs)

    Highlights

    I replaced my M1 MacBook Pro with a base model M4 – and it blew my $3,000 system away

    November 7, 2024

    Apple’s flagship laptop line won’t wow you with flashy features or fresh designs, but it’s…

    Microsoft Edge Tests Bottom Address Bar Swipe Gesture for Tab Switching on Android

    April 14, 2025

    Java Selenium: Custom Assert Message for Multiple Checkbox

    July 26, 2024

    Critical Unpatched Flaws Disclosed in Popular Gogs Open-Source Git Service

    July 8, 2024
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.