Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 13, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 13, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 13, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 13, 2025

      This $4 Steam Deck game includes the most-played classics from my childhood — and it will save you paper

      May 13, 2025

      Microsoft shares rare look at radical Windows 11 Start menu designs it explored before settling on the least interesting one of the bunch

      May 13, 2025

      NVIDIA’s new GPU driver adds DOOM: The Dark Ages support and improves DLSS in Microsoft Flight Simulator 2024

      May 13, 2025

      How to install and use Ollama to run AI LLMs on your Windows 11 PC

      May 13, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      Community News: Latest PECL Releases (05.13.2025)

      May 13, 2025
      Recent

      Community News: Latest PECL Releases (05.13.2025)

      May 13, 2025

      How We Use Epic Branches. Without Breaking Our Flow.

      May 13, 2025

      I think the ergonomics of generators is growing on me.

      May 13, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      This $4 Steam Deck game includes the most-played classics from my childhood — and it will save you paper

      May 13, 2025
      Recent

      This $4 Steam Deck game includes the most-played classics from my childhood — and it will save you paper

      May 13, 2025

      Microsoft shares rare look at radical Windows 11 Start menu designs it explored before settling on the least interesting one of the bunch

      May 13, 2025

      NVIDIA’s new GPU driver adds DOOM: The Dark Ages support and improves DLSS in Microsoft Flight Simulator 2024

      May 13, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Researchers at the University of Illinois have developed AI Agents that can Autonomously Hack Websites and Find Zero-Day Vulnerabilities

    Researchers at the University of Illinois have developed AI Agents that can Autonomously Hack Websites and Find Zero-Day Vulnerabilities

    June 11, 2024

    We all know AI is getting smarter every day, but you’ll never guess what these researchers just accomplished. A team from the University of Illinois has unleashed AI agents that can autonomously hack websites and exploit real-world zero-day vulnerabilities – security holes that even the developers don’t know about yet.

    That’s right, the age of AI hacking is here.

    The problem? Current AI hacking agents like the ones using ReAct are basically stumbling around blindly when it comes to complex, multi-stage attacks.

    Here’s how it works: These ReAct-style agents iteratively take an action, observe the result, and repeat. Simple enough for basic tasks. But when it comes to the long game of high-level hacking, this approach crumbles for two huge reasons:

    The context required balloons out of control for cybersecurity exploits. We’re talking pages upon pages of code, HTTP requests, and more to keep track of.

    The agent gets trapped going down one vulnerability rabbit hole. If it tries exploiting some XSS vulnerability for example, it struggles to backtrack and pivot to attempt a completely different type of attack like SQL injection.

    And yes, researchers have already confirmed this critical shortcoming empirically. If an AI agent starts down one path, it really struggles to change course and try other vulnerability types.

    Using an advanced system called HPTSA (Hierarchical Planning and Task-Specific Agents), these AI agents work together like a well-oiled machine to probe websites, identify vulnerabilities, and execute hacks. One “planning agent” acts as the mastermind, exploring the target and delegating tasks to specialized “expert agents” trained to exploit different types of vulnerabilities like cross-site scripting (XSS), SQL injection (SQLi), and more.

    But here’s the real kicker – these agents don’t even need to be told about the specific vulnerability ahead of time. They can sniff out brand new, never-before-seen zero-days all on their own. The researchers put them to the test on 15 recent real-world vulnerabilities from major platforms like WordPress, PrestaShop, and more – all unknown to the AI agents. And the results were chilling.

    HPTSA managed to successfully exploit a whopping 53% of the vulnerabilities when given just 5 attempts. Even more alarming, it performed nearly as well as an AI agent that had been explicitly briefed on the specific vulnerability details. The open-source security scanners we all rely on? They failed miserably, unable to crack a single one.

    So how much would hiring this elite team of AI hackers cost? Probably less than you’d expect. The researchers estimate each successful exploit runs about $24 for the LLM API costs ( GPT4 Turbo) not counting the other costs. Autonomous AI hacking is already a very affordable threat.

    Of course, the researchers didn’t create this just for fun – they want to help defend against the inevitable wave of AI-powered attacks. By understanding how these agents operate, we can develop better preventative security measures. The cybersecurity battle is already being waged by AIs. We’d better pick a side – offense or defense – because the hacking paradigm has definitively shifted.

    Check out the Paper and Author’s Blog. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. 

    Feel Free to join our Telegram Channel and LinkedIn Group.

    If you like our work, you will love our newsletter..

    Don’t Forget to join our 44k+ ML SubReddit

    The post Researchers at the University of Illinois have developed AI Agents that can Autonomously Hack Websites and Find Zero-Day Vulnerabilities appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleCan Machines Plan Like Us? NATURAL PLAN Sheds Light on the Limits and Potential of Large Language Models
    Next Article The Evolution of Chinese Large Language Models (LLMs)

    Related Posts

    Security

    Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

    May 14, 2025
    Common Vulnerabilities and Exposures (CVEs)

    CVE-2024-52290 – LF Edge eKuiper Cross-Site Scripting (XSS)

    May 14, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    Connect the Amazon Q Business generative AI coding companion to your GitHub repositories with Amazon Q GitHub (Cloud) connector

    Development

    New Microsoft Teams calendar adds “latest innovations from both Copilot and Places,” aligns experience with Outlook

    News & Updates

    Microsoft flags macOS bug — remotely bypassing Apple’s sophisticated System Integrity Protection (SIP) security solution and allowing unauthorized third-party rootkit installs

    News & Updates

    13,000 MikroTik Routers Hijacked by Botnet for Malspam and Cyberattacks

    Development

    Highlights

    From pixels to personalization: The future of design

    February 7, 2025

    When it comes to technological shifts, we often overestimate their short-term impact while underestimating their…

    CVE-2025-43568 – Substance3D Use After Free Vulnerability

    May 13, 2025

    ZDNET Editors’ Choice: What it is, and how we’re awarding the best products we review

    April 29, 2025

    The Soldier’s Burden

    May 22, 2024
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.