Skip to main content
For Teams

Browser Automation Agents

AI agents that autonomously browse the web, interact with websites, fill forms, extract data, and complete multi-step browser tasks. From headless automation to visual browsing agents.

7 tools in this category

browser-use helps you build agents that interact with websites, fill forms, and complete web tasks. It supports both a self-hosted open-source library and a cloud option for faster setup and scaling.

Open Source
Web Browsing
B2B
+3

Yutori provides APIs for browser automation, research, and recurring scouting tasks. It is built for teams and developers who want agents that can take web actions, monitor changes, and return results through APIs and logs.

Free Tier
Paid
iOS
+4

Gumloop is an AI automation framework for teams that want to turn repetitive workflows into agents and connected flows. It includes a free tier, usage-based credits, and enterprise controls for larger deployments.

Paid
Enterprise
B2B
+3

AutoGPT is a platform for building AI assistants that can run tasks continuously on your behalf. It’s aimed at developers and teams that want to automate multi-step digital work, especially around operations, sales, and agent building.

Open Source
iOS
Code Execution
+5

Stagehand is an open-source browser automation SDK built for developers and LLM-powered agents. It combines code-based browser control with natural-language actions so you can build web workflows that are more resilient to page changes.

Open Source
Web
API
+4

UiPath AI Computer Vision helps RPA robots recognize and interact with on-screen elements when selectors are brittle or unavailable. It is aimed at teams building automations for virtual desktops, remote apps, and other dynamic interfaces.

iOS
Vision
B2B
+4

UiPath AI Computer Vision helps UiPath Robots recognize on-screen elements when selectors break. It’s aimed at teams automating VDIs, legacy apps, PDFs, images, and other hard-to-target interfaces.

iOS
Vision
B2B
+4
Explore more

Related Categories