AI agents are transforming the software development life cycle

Earlier this year, the analyst firm Forrester revealed its list of the top 10 emerging technologies of 2024, and several of the technologies on the list related to AI agents â€“ models that donâ€™t just generate information but can perform complex tasks, make decisions and act autonomously.Â

â€œEarlier AIs that could go do things were narrow and constrained to a particular environment, using things like reinforcement learning. What weâ€™re seeing today is taking the capabilities of large language models to break those instructions into specific steps and then go execute those steps with different tools,â€ Brian Hopkins, VP of the Emerging Tech Portfolio at Forrester, said during an episode of our podcast, â€œWhat the Dev?â€Â

When it comes to software development, generative AI has commonly been used to help generate code or assist in code completions, saving developers time. Agentic AI will help developers even further by assisting them with more tasks throughout the software development life cycle, such as brainstorming, planning, building, testing, running code, and implementing fixes, explained Shuyin Zhao, VP of product at GitHub.

â€œAgents serve as an additional partner for developers, taking care of mundane and repetitive tasks and freeing developers to focus on higher-level thinking. At GitHub, we think of AI agents as being a lot like LEGOs â€“ the building blocks that help develop more advanced systems and change the software development process for the better,â€ Zhao explained.Â

An example of an AI agent for software development is IBMâ€™s recently released series of agents that can automatically resolve GitHub issues, freeing up developers to work on other things instead of getting stuck fixing their backlog of bugs. The IBM SWE-Agent suite includes a localization agent that finds the file and line of code causing the issue, an agent that edits lines of code based on developer requests, and an agent that can develop and execute tests.Â

Other examples of AI agents in software development include Devin and GitHub Copilot agents, and itâ€™s been reported that OpenAI and Google are both working on developing their own agents too.Â Â

While this technology is still relatively new, Gartner recently predicted that 33% of enterprise software will contain agentic AI capabilities by 2028 (compared to under 1% in 2024), and these capabilities will allow 15% of day-to-day decisions to be made autonomously.Â

â€œBy giving artificial intelligence agency, organizations can increase the number of automatable tasks and workflows. Software developers are likely to be some of the first affected, as existing AI coding assistants gain maturity,â€ Gartner wrote in its prediction.Â

Specialization and multi-agent architectures

Current LLMs like GPT-4o or Claude are â€œjacks-of-all-trades, masters of none,â€ meaning that they do a wide range of tasks satisfactorily, from writing poetry to generating code to solving math problems, explained Ruchir Puri, chief scientist at IBM. AI agents, on the other hand, need to be trained to do a particular task, using a particular tool. â€œThis tool is certified for doing that manual process today, and if Iâ€™m going to introduce an agent, it should use that tool,â€ he said.

Given that each agent is highly specialized, the question then becomes, how do you get many of them to work together to tackle complex problems? According to Zhao, the answer is a multi-agent architecture, which is a network of many of these specialized agents that interact with each other and collaborate on a larger goal. Because each agent is highly specialized to a particular task, together they are collectively able to solve more complex problems, she said.Â

â€œAt GitHub, our Copilot Workspace platform uses a multi-agent architecture to help developers go from idea to code entirely in natural language. In simple terms, theyâ€™re a combination of specialized agents that, when combined, can help developers solve complex problems more efficiently and effectively,â€ Zhao explained as an example.

Puri believes that implementing a multi-agent system is not very different from how a human team comes together to solve complex problems.Â

â€œYou have somebody who is a software engineer, somebody whoâ€™s an SRE, somebody who does something else,â€ Puri explained. â€œThat is the way we humans have learned to do complex tasks, with a mixture of skills and people who are experts in different areas. That is how I foresee these agents evolving as well, as we continue forward with multi-agent coordination and multi-agent complex behavior.â€

One might think that given the reputation of generative AI to hallucinate, increasing the number of agents working together might possibly increase the impact of hallucinations because as the number of decisions being made goes up, the potential for a wrong decision to be made at some point in the chain also goes up. However, there are ways to mitigate this, according to Loris Degionnai, CTO and founder of Sysdig, a security company that has developed its own AI agents for security.

â€œThere are structures and layers that we can put together to increase accuracy and decrease mistakes, especially when these mistakes are important and critical,â€ he said. â€œAgentic AI can be structured so that thereâ€™s different layers of LLMs, and some of these layers are there, essentially, to provide validation.â€

He also explained that, again, the safeguards for multi-agent architectures might mimic the safeguards a team of humans has. For instance, in a security operations center, there are entry-level workers who are less skilled, but who can surface suspicious things to a second tier of more experienced workers who can make the distinction between things that need to be investigated further and those that can be safely disregarded.

â€œIn software development, or even in cybersecurity, there are tiers, there are layers of redundancy when you have people doing this kind of stuff, so that one person can check what the prior person has done,â€Â Degionnai said.

AI agents are still building trust with developers

Just as there was skepticism into how well generative AI could write code, there will also likely be a period where AI agents will need to earn trust before they are sent off to make decisions on their own, without human input. According to Puri, people will probably need to see a very consistent output from agents for a long period of time before theyâ€™re entirely comfortable with this.

He likened it to the trust you place in your car every day. You get in every morning and it takes you from point A to point B, and even though the average person doesnâ€™t know how the internal combustion engine works, they do trust it to work and to get them to their destination safely. And, if it doesnâ€™t work, they know who to take it to to get it to work again.Â

â€œYou put your life or your familyâ€™s life in that car, and you say it should work,â€ Puri said. â€œAnd that, to me, is the level of trust you need to get in these technologies, and that is the journey you are on. But you are at the beginning of the journey.â€

Challenges that need to be solved before implementation

In addition to building trust, there are still a number of other challenges that need to be addressed. One is that AI agents need to be augmented with enterprise data, and that data needs to be up-to-date and accurate, explained Ronan Schwartz, CEO of the data company K2view.Â Â

â€œAccess to this information, the critical backbone of the organization, is really at the core of making any AI work,â€ said Schwartz.

Cost is another issue, as every query is an expense, and the costs can get even higher when working on a large dataset because of the compute and processing required.Â

Similarly, the speed and interactivity of an agent is important. Itâ€™s not really acceptable to be waiting two hours for a query to be answered, so lower latency is needed, Schwartz explained.

Data privacy and security also need to be considered, especially when a system contains multiple agents interacting with each other. Itâ€™s important to ensure that one agent isnâ€™t sharing information that another isnâ€™t supposed to have access to, he said.Â

â€œBe very, very thoughtful when evaluating tools and only deploy tools from vendors that are clearly prioritizing privacy and security,â€ said GitHubâ€™s Zhao. â€œThere should be clear documentation explaining exactly how a vendor is processing your companyâ€™s data in order to provide the service, what security measures they have in placeâ€“including filters for known vulnerabilities, harmful content, etc. If you canâ€™t find this information clearly documented, thatâ€™s a red flag.â€

And finally, AI agents need to be reliable since they are acting on someone elseâ€™s behalf. If the data they are operating on isnâ€™t reliable, then â€œthat can create a whole chain of action that is not necessary, or the wrong set of actions,â€ Schwartz explained.

Predictions for whatâ€™s to come

Jamil Valliani, head of AI product at Atlassian, believes that 2025 will be the year of the AI agent. â€œAgents are already quite good at augmenting and accelerating our work â€” in the next year, they will get even better at performing highly specific tasks, taking specialized actions, and integrating across products, all with humans in the loop,â€ he said. â€œIâ€™m most excited to see agents becoming exponentially more sophisticated in how they can collaborate with teams to handle complex tasks.â€

He believes that AI agents are benefiting from the fact that foundation models are evolving and are now able to reason over increasingly rich datasets. These advancements will not only improve the accuracy of agents, but also allow them to continuously learn from experiences, much like a human teammate might.Â

â€œOur relationship with them will evolve, and weâ€™ll see new forms of collaboration and communication on teams develop,â€ he said.Â

Steve Lucas, the CEO of Boomi, predicts that within the next three years, AI agents will outnumber humans. This doesnâ€™t mean that agents will necessarily eliminate human jobs, because as the number of agents increases, so does the need for human oversight and maintenance.Â

â€œIn this evolution, transparent protocols and governance are important for AI success and will become more significant as agents become embedded in the future of work,â€ he said.Â

K2viewâ€™s Schwartz agrees that the future workplace is not one in which agents do everything, but rather a place where humans and agents work alongside each other.Â

â€œI think sometimes people make a mistake in thinking that the humans will trigger the agent and the agent will do the work. I think the world will be more of a balanced one where agents also trigger humans to do certain work,â€ he said.Â

The post AI agents are transforming the software development life cycle appeared first on SD Times.

Source: Read MoreÂ

IBM’s next generation Granite models are now available

The Human Element: Using Research And Psychology To Elevate Data Storytelling

Google to offer free version of Gemini Code Assist

MongoDB acquires Voyage AI for its embedding and reranking models

AI-generated content in games is here to stay — the bigger issue is the outright deception and what the future may look like

Razer and Minecraft just announced a limited-edition collection, and I’m surprised it took so long

Panos Panay’s Amazon AI move: A bold bet or another Surface Duo?

OpenAI expands ‘Deep Reseach’ to those paying $20 a month or more, a day after Microsoft made OpenAI’s ‘Think Deeper’ free for all Copilot users with no usage caps

Rethink State💡 Why You Should Model Your Frontend Around Events

Rethink State💡 Why You Should Model Your Frontend Around Events

What To Expect When Migrating Your Site To A New Platform

Kotlin Multiplatform vs. React Native vs. Flutter: Building Your First App

AI-generated content in games is here to stay — the bigger issue is the outright deception and what the future may look like

AI-generated content in games is here to stay — the bigger issue is the outright deception and what the future may look like

Razer and Minecraft just announced a limited-edition collection, and I’m surprised it took so long

Panos Panay’s Amazon AI move: A bold bet or another Surface Duo?

AI agents are transforming the software development life cycle

Specialization and multi-agent architectures

AI agents are still building trust with developers

Challenges that need to be solved before implementation

Predictions for whatâ€™s to come

ANDI Accessibility Testing Tool Tutorial

How Data Analytics in Insurance is Driving Smarter Decisions

KuaiFormer: A Transformer-Based Architecture for Large-Scale Short-Video Recommendation Systems

I graduated college last year. These are the 5 essentials you actually need

Sales Cloud to Data Cloud with No Code!

The Haunted Python Algorithm

I tried Motorola’s Razr Plus (2024) and it beats the Samsung Galaxy Z Flip in 3 ways

Outlook will let go of its basic authentication for a stronger, more secure method

Russian Hackers Using Fake Brand Sites to Spread DanaBot and StealC Malware

New HTTP/2 Vulnerability Exposes Web Servers to DoS Attacks

AI agents are transforming the software development life cycle

Specialization and multi-agent architectures

AI agents are still building trust with developers

Challenges that need to be solved before implementation

Predictions for whatâ€™s to come

Related Posts