Researchers at the University of Illinois have developed AI Agents that can Autonomously Hack Websites and Find Zero-Day Vulnerabilities

We all know AI is getting smarter every day, but youâ€™ll never guess what these researchers just accomplished. A team from the University of Illinois has unleashed AI agents that can autonomously hack websites and exploit real-world zero-day vulnerabilities â€“ security holes that even the developers donâ€™t know about yet.

Thatâ€™s right, the age of AI hacking is here.

The problem? Current AI hacking agents like the ones using ReAct are basically stumbling around blindly when it comes to complex, multi-stage attacks.

Hereâ€™s how it works: These ReAct-style agents iteratively take an action, observe the result, and repeat. Simple enough for basic tasks. But when it comes to the long game of high-level hacking, this approach crumbles for two huge reasons:

The context required balloons out of control for cybersecurity exploits. Weâ€™re talking pages upon pages of code, HTTP requests, and more to keep track of.

The agent gets trapped going down one vulnerability rabbit hole. If it tries exploiting some XSS vulnerability for example, it struggles to backtrack and pivot to attempt a completely different type of attack like SQL injection.

And yes, researchers have already confirmed this critical shortcoming empirically. If an AI agent starts down one path, it really struggles to change course and try other vulnerability types.

Using an advanced system called HPTSA (Hierarchical Planning and Task-Specific Agents), these AI agents work together like a well-oiled machine to probe websites, identify vulnerabilities, and execute hacks. One â€œplanning agentâ€ acts as the mastermind, exploring the target and delegating tasks to specialized â€œexpert agentsâ€ trained to exploit different types of vulnerabilities like cross-site scripting (XSS), SQL injection (SQLi), and more.

But hereâ€™s the real kicker â€“ these agents donâ€™t even need to be told about the specific vulnerability ahead of time. They can sniff out brand new, never-before-seen zero-days all on their own. The researchers put them to the test on 15 recent real-world vulnerabilities from major platforms like WordPress, PrestaShop, and more â€“ all unknown to the AI agents. And the results were chilling.

HPTSA managed to successfully exploit a whopping 53% of the vulnerabilities when given just 5 attempts. Even more alarming, it performed nearly as well as an AI agent that had been explicitly briefed on the specific vulnerability details. The open-source security scanners we all rely on? They failed miserably, unable to crack a single one.

So how much would hiring this elite team of AI hackers cost? Probably less than youâ€™d expect. The researchers estimate each successful exploit runs about $24 for the LLM API costs ( GPT4 Turbo) not counting the other costs. Autonomous AI hacking is already a very affordable threat.

Of course, the researchers didnâ€™t create this just for fun â€“ they want to help defend against the inevitable wave of AI-powered attacks. By understanding how these agents operate, we can develop better preventative security measures. The cybersecurity battle is already being waged by AIs. Weâ€™d better pick a side â€“ offense or defense â€“ because the hacking paradigm has definitively shifted.

Check out theÂ Paper and Authorâ€™s Blog. All credit for this research goes to the researchers of this project. Also,Â donâ€™t forget to follow us onÂ Twitter.Â

Feel Free to join ourÂ Telegram Channel andÂ LinkedIn Group.

If you like our work, you will love ourÂ newsletter..

Donâ€™t Forget to join ourÂ 44k+ ML SubReddit

The post Researchers at the University of Illinois have developed AI Agents that can Autonomously Hack Websites and Find Zero-Day Vulnerabilities appeared first on MarkTechPost.

Source: Read MoreÂ

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Build Confidence In Your UX Work

How to fix Atomfall’s annoying Xbox audio bug

Do this first in Atomfall before freeing Dr. Garrow — you can thank me later for making it so much easier

GPT 4o’s image update unlocked a huge opportunity most people are ignoring

5 secrets to achieving your goals, according to business leaders

Community News: Latest PEAR Releases (03.10.2025)

Community News: Latest PEAR Releases (03.10.2025)

Community News: Latest PECL Releases (03.11.2025)

How to Sell Products to PHP Developers Using Sponsorships

How to fix Atomfall’s annoying Xbox audio bug

How to fix Atomfall’s annoying Xbox audio bug

Do this first in Atomfall before freeing Dr. Garrow — you can thank me later for making it so much easier

Google code confirms Gemini in Chrome copies Edge’s Copilot sidebar idea on Windows 11

Researchers at the University of Illinois have developed AI Agents that can Autonomously Hack Websites and Find Zero-Day Vulnerabilities

ruby-align is Baseline Newly available

February 2025 Baseline monthly digest

NoName Ransomware Claims Yet Another Attack on Germany after Ukraine Presidentâ€™s Visit

â€˜Commando Catâ€™ Cryptojacking Campaign Exploits Remote Docker API Servers

CPU-GPU I/O-Aware LLM Inference Reduces Latency in GPUs by Optimizing CPU-GPU Interactions

Building Azure DevOps CI Pipelines for SPFx

Romanian Charged for Fraud Carried Out Through Card Skimming

This AI Paper Introduces BitNet a4.8: A Highly Efficient and Accurate 4-bit LLM

The best smart pens of 2025: Expert recommended

Google Warns of CVE-2024-7965 Chrome Security Flaw Under Active Exploitation

Researchers at the University of Illinois have developed AI Agents that can Autonomously Hack Websites and Find Zero-Day Vulnerabilities

Related Posts