Skip to main content
UA

Make RPA bots read dynamic screens like a human

UiPath AI Computer Vision helps UiPath Robots recognize on-screen elements when selectors break. It’s aimed at teams automating VDIs, legacy apps, PDFs, images, and other hard-to-target interfaces.

iOS
Vision
B2B
Computer Use
Cloud Hosted
Hybrid
For Teams
Visit UiPath Automation Hub

Is this your tool? Claim this listing to manage your content and analytics.

Recent activity

What's happened with UiPath Automation Hub lately

  • Score change
    Rubric upgrade v3_0 → v3.1: score 5/32 → 9/3659/36(+4)

    Rubric upgrade: agenticness v3.0 (8 dims, /32) → v3.1 (9 dims, /36). Adds Dim 9 (Operator Sovereignty), splits Dim 6 into 6a/6b lenses, tightens Dim 4 autonomous-retry distinction. Not a product change — score shift reflects new dimension + recalibrated rubric, not a change in the tool. Fanout suppressed.

    See the news that prompted this

News mentions sourced from our news feed; score changes from periodic re-evaluations.

Ask about UiPath Automation Hub

Get answers based on UiPath Automation Hub's actual documentation

Try asking:

About

What It Is

UiPath AI Computer Vision is a computer vision capability for UiPath’s RPA platform. It is designed for automation teams and RPA developers who need robots to interact with dynamic interfaces, virtual desktops, and applications where traditional selectors are unreliable.

What to Know

This looks strongest when you need vision-based automation on interfaces that are difficult to handle with standard RPA techniques. UiPath says it uses a neural network with custom Screen OCR, text matching, and a multi-anchoring system to identify...

Key Features
Recognizes on-screen UI elements using computer vision
Supports VDI environments such as Citrix, VMware, Microsoft RDP, and VNC
Works on desktop and web applications
Handles interfaces where selectors are unreliable
Supports Flash, Silverlight, PDFs, images, and other non-standard elements
Use Cases
Automating legacy desktop applications with unstable selectors
Running RPA workflows inside virtual desktop environments
Interacting with PDFs, images, and other non-traditional UI elements
Agenticness: Guided Assistant

Executes tasks you assign, one step at a time, within narrow domains.

High evidence
Last evaluated: May 23, 2026

Dimension Breakdown

Action Capability
Autonomy
Adaptation
State & Memory
Safety

Categories

Pricing
  • Pricing not publicly available: The page promotes a free trial, but does not list public product pricing.
Details
AddedJanuary 16, 2026
RefreshedMarch 28, 2026
Agenticness
Quick Facts
DeploymentHybrid (cloud + self-hosted)
AutonomySemi-autonomous
Model supportSingle model
Open sourceNo
Team supportEnterprise
Pricing modelSubscription
Interfacegui, api, desktop
Sources
Similar tools

Related Tools

Yutori provides APIs for browser automation, research, and recurring scouting tasks. It is built for teams and developers who want agents that can take web actions, monitor changes, and return results through APIs and logs.

Free Tier
Paid
iOS
+4

Stagehand is an open-source browser automation SDK built for developers and LLM-powered agents. It combines code-based browser control with natural-language actions so you can build web workflows that are more resilient to page changes.

Open Source
Web
API
+4

Fireflies.ai connects to Webex to record, transcribe, and summarize meetings automatically. It also extracts action items and makes notes searchable and shareable across team tools.

Paid
Web
Voice
+4

browser-use helps you build agents that interact with websites, fill forms, and complete web tasks. It supports both a self-hosted open-source library and a cloud option for faster setup and scaling.

Open Source
Web Browsing
B2B
+3
Stay in the loop

Get the weekly agentic AI briefing

New tools, top picks, and trends — delivered every Thursday.

I use AI for: