Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 9, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 9, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 9, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 9, 2025

      This Motorola Razr deal at Best Buy is the top offer I’ve seen on the flip phone

      May 9, 2025

      Google Maps can identify and save places in your screenshots – here’s how

      May 9, 2025

      T-Mobile is giving loyal users a free line right now – how to see if you qualify

      May 9, 2025

      CTA warns of tariff-fueled price hikes on consumer tech – but it’s not all bad news

      May 9, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      Big Node, VS Code, and Mantine updates

      May 9, 2025
      Recent

      Big Node, VS Code, and Mantine updates

      May 9, 2025

      Prepare for Contact Center Week with Colleen Eager

      May 9, 2025

      Preparing for the Unthinkable: Safeguarding People and Productivity During India-Pakistan Conflicts

      May 9, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Microsoft confirms Offline Calendar for New Outlook on Windows 11

      May 9, 2025
      Recent

      Microsoft confirms Offline Calendar for New Outlook on Windows 11

      May 9, 2025

      Windows 11 Microsoft Store tests Copilot integration to increase app downloads

      May 9, 2025

      Beyond APT: Software Management with Flatpak on Ubuntu

      May 9, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Machine Learning»Meet Amazon Nova Act: An AI Agent that can Automate Web Tasks

    Meet Amazon Nova Act: An AI Agent that can Automate Web Tasks

    April 2, 2025

    Amazon has revealed a new artificial intelligence (AI) model called Amazon Nova Act. This AI agent is designed to operate and take actions within a web browser, automating tasks like filling out forms, navigating interfaces, and handling popups. Think of it as an assistant working directly on websites. Amazon has also released Nova Act SDK, which lets developers experiment with the technology. Developers can create agents to handle simple online tasks.

    Current Status of AI Agents

    AI agents mostly talk or find information, responding in natural language or searching knowledge bases. According to Amazon, they envision AI agents being able to complete tasks in digital environments for users.

    However, agentic AI technology is still developing, meaning most AI agents rely heavily on existing application programming interfaces (APIs). Most real-world tasks lack comprehensive APIs, limiting what current agents can achieve reliably.

    Amazon hopes agents will eventually manage complex, multi-step jobs, such as planning large events or handling IT support tasks. Currently, AI agents still need constant human guidance and checking, making them less practical for truly independent work.

    What is Amazon Nova Act? Key Features and Functions

    Amazon Nova Act is an AI agent that can control and perform tasks within a web browser. This new AI model is trained to complete tasks in a web browser using simple commands. It is available as a research preview through the Nova Act SDK. The tool allows agents to handle tasks like scheduling and email management. It is designed to complete real-world tasks without human intervention at every step.

    Here are some features and functions:

    • Web Action Focus: Amazon Nova Act is trained specifically to operate and interact with web browser elements.
    • Developer SDK: A research preview SDK allows developers to build and test AI agent prototypes.
    • Task Automation: The goal is to automate simple browser tasks. This includes filling out forms or managing calendar entries. It can also handle tasks like ordering items online.
    • Atomic Commands: The SDK helps break down complex processes. It uses reliable basic commands like ‘search’ or ‘checkout.’
    • Detailed Instructions: Developers can add specific guidance to commands. For example, instructing the agent to decline optional add-ons.
    • API and Code Integration: The system allows calling external APIs, meaning developers can also insert Python code for checks or custom logic.
    • Reliability Emphasis: Amazon focused on high accuracy for tricky web elements. These include date pickers, dropdown menus, and pop-up windows. Internal tests show strong performance here.
    • Background Operation: AI agents can run without direct observation once set up using Amazon Nova Act. They can operate headlessly or on a schedule.
    • Cross-Environment Potential: Early tests suggest Nova Act can apply its interface understanding to new areas. Surprisingly, this includes environments like web-based games.

    Amazon stresses that Nova Act prioritizes reliability for foundational actions. Amazon is focused on targeting over 90% success on internal tests for specific web interactions. This focus means that built agents should work consistently once configured.

    Amazon Nova Act AI agent has claimed strong results on benchmarks measuring direct web control ability. The browser-based AI agent performs well against competitors in specific interaction tests. However, it hasn’t been compared using all common AI agent evaluations yet.

    Challenges to Autonomous AI Agent Workflow

    The main challenge for all AI agents is consistency. Early AI systems often prove slow or error-prone, and they struggle with tasks humans find simple. Amazon hopes its focus on reliable building blocks will offer an advantage. The true test will be how Nova Act performs in real-world developer applications.

    Conclusion

    Amazon Nova Act clearly shows Amazon’s step and move into the AI agent domain. Its emphasis on reliable task components addresses a key weakness in current agent technology. Amazon hopes to encourage practical applications by providing developers with tools to create AI agents to automate browser tasks. This release from Amazon intensified competition in agentic AI workflow automation and its potential impact on productivity. A truly autonomous AI agent needs to sustain consistent performance; only then will true workflow automation be achieved.


    Check out the Technical details and Try it here. All credit for this research goes to the researchers of this project. Also, feel free to follow us on Twitter and don’t forget to join our 85k+ ML SubReddit.

    🔥 [Register Now] miniCON Virtual Conference on OPEN SOURCE AI: FREE REGISTRATION + Certificate of Attendance + 3 Hour Short Event (April 12, 9 am- 12 pm PST) + Hands on Workshop [Sponsored]

    The post Meet Amazon Nova Act: An AI Agent that can Automate Web Tasks appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleA Comprehensive Guide to LLM Routing: Tools and Frameworks
    Next Article DeltaProduct: An AI Method that Balances Expressivity and Efficiency of the Recurrence Computation, Improving State-Tracking in Linear Recurrent Neural Networks

    Related Posts

    Machine Learning

    Enterprise AI Without GPU Burn: Salesforce’s xGen-small Optimizes for Context, Cost, and Privacy

    May 10, 2025
    Machine Learning

    ByteDance Open-Sources DeerFlow: A Modular Multi-Agent Framework for Deep Research Automation

    May 10, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    Week in review: LLM package hallucinations harm supply chains, Nagios Log Server flaws fixed

    Security

    Gamescom 2024 is all about Xbox, with no PlayStation or Nintendo in sight

    Development

    NodeStealer Malware Targets Facebook Ad Accounts, Harvesting Credit Card Data

    Development

    I’ve had a ton of fun playing Skin Deep, but I hope the developers fix the game’s crashing problems

    News & Updates

    Highlights

    Streamline Your Cluster Deployments and Monitoring with DeClustor

    July 27, 2024

    Comments Source: Read More 

    Skywork AI Advances Multimodal Reasoning: Introducing Skywork R1V2 with Hybrid Reinforcement Learning

    April 25, 2025

    Rilasciata T2 Linux SDE 25.4: la distribuzione versatile con supporto AMD ROCm per RISC-V e ARM64

    April 15, 2025

    New Malware Campaign Exposes Gaps in Manufacturing Cybersecurity Defenses

    December 7, 2024
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.