Top AI Computer Use Agents (CUA) in the UK in 2026

Find and compare the best AI Computer Use Agents (CUA) in the UK in 2026

Sort:

UK AI Computer Use Agents (CUA) Reset Filters

Use the comparison tool below to compare the top AI Computer Use Agents (CUA) in the UK on the market. You can filter results by user reviews, pricing, features, platform, region, support options, integrations, and more.

1

ChatGPT

OpenAI
Free

9 Ratings

See Software

ChatGPT is a powerful AI-driven platform designed to help users work smarter by providing instant answers, creative ideas, and task automation. It supports a wide range of functions, including writing, editing, coding, research, and brainstorming. Users can interact with the platform through text or voice, making it accessible across different devices and workflows. ChatGPT can summarize meetings, analyze data, and generate insights to improve productivity and decision-making. It also offers creative support for tasks such as content creation, planning, and strategy development. A key feature is workspace agents, which allow users to automate entire workflows and repetitive tasks within their organization. These agents can run independently, integrate with tools, and handle actions like updating records, sending messages, or generating reports. Teams can build and share agents across their workspace to standardize processes and improve efficiency. Built-in controls ensure that automation remains secure and manageable with permissions and monitoring. ChatGPT helps reduce manual work while enabling teams to focus on higher-value activities. Overall, it enhances productivity by combining intelligent assistance with scalable automation.
2

OpenAI Codex

OpenAI
$20/month

1 Rating

See Software

Codex is an advanced AI coding assistant from OpenAI that helps developers streamline the entire software development process from start to finish. It functions as a powerful pair programmer capable of understanding repositories, writing code, and generating production-ready pull requests. The platform supports complex workflows, including debugging, refactoring, testing, and code reviews, all within a unified environment. One of its standout features is computer use, which allows Codex to operate your computer directly by seeing the screen, clicking, and typing within applications. This capability enables it to interact with tools and software that lack direct integrations or APIs. Codex also includes an in-app browser, allowing developers to iterate on web applications and provide precise instructions directly on live pages. It integrates with a wide range of tools and plugins, enhancing its ability to gather context and take action across workflows. The platform supports multi-agent collaboration, enabling parallel work across projects to accelerate development timelines. Codex also offers automation features that allow it to schedule and complete recurring tasks without manual input. With memory capabilities, it can remember preferences and past actions to improve future performance. Overall, Codex delivers a comprehensive AI-powered solution that combines coding, automation, and real-world computer interaction to boost developer efficiency.
3

BLACKBOX AI

BLACKBOX AI
Free

1 Rating

See Software

BLACKBOX AI is a powerful AI-driven platform that revolutionizes software development by providing a fully integrated AI Coding Agent with unique features such as voice interaction, direct GPU access, and remote parallel task processing. It simplifies complex coding tasks by converting Figma designs into production-ready code and transforming images into web apps with minimal manual effort. The platform supports seamless screen sharing within popular IDEs like VSCode, enhancing developer collaboration. Users can manage GitHub repositories remotely, running coding tasks entirely in the cloud for scalability and efficiency. BLACKBOX AI also enables app development with embedded PDF context, allowing the AI agent to understand and build around complex document data. Its image generation and editing tools offer creative flexibility alongside development features. The platform supports mobile device access, ensuring developers can work from anywhere. BLACKBOX AI aims to speed up the entire development lifecycle with automation and AI-enhanced workflows.
4

Manus AI

Manus AI
$20/month

1 Rating

See Software

Manus is a multifaceted general AI agent that effectively connects ideas with actions, allowing it to carry out various tasks in both work and personal environments. Whether it's handling data analysis, organizing travel itineraries, developing educational resources, or providing stock market insights, Manus empowers users to accomplish their goals while attending to other important matters. Its capabilities extend to conducting intricate research, crafting engaging presentations, and interpreting market dynamics, all aimed at enhancing productivity and streamlining efficiency. Furthermore, Manus produces precise, actionable insights, establishing itself as a vital resource for both professionals and everyday users aiming to simplify their workflows and achieve a greater understanding of their tasks. By integrating advanced technology with user-friendly functionality, Manus becomes an indispensable companion in navigating the complexities of modern life. Manus Desktop with the “My Computer” capability allows an AI agent to work directly on a user’s local device, extending its functionality beyond cloud-based environments. It uses command line access to read, modify, and organize files, as well as launch and control local applications and tools. This enables users to automate time-consuming tasks such as sorting files, batch renaming documents, and managing workflows with minimal effort. The platform also supports advanced development capabilities, allowing the AI to build, debug, and deploy applications using local programming environments like Python, Node.js, and Swift. By bridging cloud intelligence with local system resources, it enhances productivity and unlocks new automation possibilities.
5

WorkBeaver

WorkBeaver
$14.99 per month

1 Rating

See Software

WorkBeaver is an innovative automation platform powered by AI, designed to learn repetitive tasks by observing your actions once and then seamlessly replicating them across both desktop and web applications. With its unique "show & tell" method, there is no need for coding, integrating systems, or dragging and dropping workflows; simply perform the task you want automated, and WorkBeaver will create a robust digital model that adapts to changes in user interface elements. This versatile system is capable of managing tasks like data entry, CRM updates, invoicing, scheduling, form completion, and follow-ups, all without needing any prior API connections. Emphasizing security, it employs zero-knowledge protocols and end-to-end encryption to ensure that your workflow data remains accessible only to you. Since it functions at the visual level, WorkBeaver can interact with nearly any software displayed on your screen, including custom or proprietary applications, which significantly reduces the risk of disruptions due to interface updates. Moreover, its adaptability makes it a valuable tool for businesses looking to streamline processes across diverse platforms.
6

Browser Use

Browser Use

1 Rating

See Software

Browser Use is an open-source Python library designed to allow AI agents to interact fluidly with web browsers. By merging sophisticated AI functionalities with effective browser automation, it empowers agents to execute various tasks such as job applications, browsing websites, gathering data, and responding to messages on services like WhatsApp. This library is compatible with several large language models, including GPT-4, Claude 3, and Llama 2, making it easier to carry out intricate web activities through an intuitive interface. Among its notable features are visual recognition paired with HTML structure extraction for thorough web engagement, automated management of multiple tabs to streamline complex processes, and element tracking that leverages the extraction of XPaths from clicked elements to replicate specific actions performed by LLMs. Users can also implement custom functionalities, such as saving data to files, executing database queries, sending notifications, or incorporating human input. Furthermore, Browser Use is equipped with smart error handling and automatic recovery mechanisms, ensuring that automation workflows remain resilient and efficient. This combination of features makes Browser Use a powerful tool for developers looking to enhance web automation with AI capabilities.
7

ChatGPT Agent

OpenAI

1 Rating

See Software

ChatGPT Agents is a team-focused AI workspace that enables organizations to create, manage, and share custom agents for ongoing work. It helps teams keep projects and tasks moving continuously by giving users access to specialized AI assistants. Users can build agents tailored to specific roles, workflows, departments, or business processes. The platform includes options to invite team members, making collaboration easier across the organization. A shared team directory allows employees to browse agents created by others in the workspace. Users can also access a personal section for agents they have built themselves. The recently used area makes it simple to return to agents that support frequent tasks. ChatGPT Agents helps reduce repetitive manual work by making AI-powered assistance available whenever teams need it. It provides a centralized place for employees to find useful agents instead of starting from scratch each time. The feature is especially helpful for companies that want to standardize AI workflows across teams. By combining agent creation, team sharing, and workspace organization, ChatGPT Agents helps improve efficiency and collaboration.
8

Bytebot

Bytebot
Free

See Software

Bytebot is a cloud-based desktop agent system designed to bridge the gap between AI and real-world work. Instead of relying on APIs, Bytebot operates like a human by interacting directly with software through the UI. Each task runs on a clean, sandboxed computer environment for security and reliability. Bytebot can automate workflows across multiple applications in a single session. Users can pause, take control of the desktop, and resume the agent seamlessly. Every action is logged with before-and-after screenshots for auditing and debugging. The platform scales effortlessly from one agent to hundreds working in parallel. Bytebot supports secure logins, development workflows, and deep research tasks. It is open source and portable across local and cloud environments. Bytebot makes automation universally compatible with any software.
9

Accomplish

Accomplish AI
Free

See Software

Accomplish is an open-source AI desktop agent that helps users automate repetitive tasks and manage their digital workflows efficiently. It includes a built-in AI model, allowing users to start using the platform instantly without requiring an API key or account setup. The tool can perform a wide range of tasks, including reading files, generating documents, organizing folders, and executing browser-based actions. It runs entirely on the user’s local machine, ensuring that sensitive data stays private and secure. Users have full control over which files and folders the agent can access, and all actions require approval before execution. Accomplish can also connect to external AI services such as OpenAI, Google, or Anthropic for enhanced functionality. The platform is designed to act as a productivity tool rather than just a conversational assistant. It supports tasks like summarizing content, preparing reports, and automating file management workflows. Being open source, it allows users to customize, modify, and extend its capabilities. The system requires no subscription and offers a cost-free solution for AI-powered automation. By combining ease of use, privacy, and flexibility, Accomplish provides a practical tool for everyday productivity.
10

OWL

CAMEL-AI
Free

See Software

OWL (Optimized Workforce Learning) represents a cutting-edge system tailored for collaborative efforts among multiple agents in the automation of real-world tasks. Developed on the CAMEL-AI platform, OWL seeks to transform the way AI agents interact, leading to enhanced efficiency, natural communication, and greater resilience in task automation across diverse sectors. It stands out for its exceptional performance, achieving the top position among open-source frameworks on the GAIA benchmark with an impressive score of 58.18. Key features of OWL include real-time sharing of information, flexible task management, and seamless integration with a variety of tools and platforms, which collectively empower collaborative AI agents to tackle intricate tasks effectively. This innovative framework not only optimizes workflows but also paves the way for future advancements in AI-driven automation solutions.
11

Genspark

Genspark
Free

See Software

Genspark offers a powerful AI platform designed to assist in creating content and automating complex tasks, such as generating videos and images or conducting in-depth research. The Genspark Super Agent elevates the platform’s capabilities by handling a variety of personal and professional tasks, such as gift selection, travel planning, and restaurant reservations. Users can leverage the platform’s AI tools to produce creative content, analyze data, and automate daily processes with minimal effort, all powered by the versatile Super Agent.
12

Open Computer Agent

Hugging Face
Free

See Software

The Open Computer Agent is an AI assistant that operates within a web browser, created by Hugging Face, designed to automate tasks like web browsing, filling out forms, and retrieving information. Utilizing advanced vision-language models such as Qwen-VL, it mimics mouse and keyboard actions, allowing it to perform a variety of functions, from booking tickets to checking operating hours and navigating to locations. The agent can effectively identify and engage with various elements on web pages by analyzing their image coordinates. As part of the smolagents initiative by Hugging Face, it prioritizes both flexibility and transparency, providing an open-source framework for developers to explore, alter, and expand for specialized uses. Although still in the developmental phase and encountering certain obstacles, this agent signifies a pioneering shift toward AI functioning as a proactive digital assistant, adept at executing online tasks independently without requiring direct user involvement. Furthermore, its ongoing evolution may lead to even greater possibilities in automating complex web interactions in the future.
13

Simular

Simular
$19.99/month

See Software

Simular is a powerful macOS-native application, designed for users with macOS 15+ and Silicon chips, that streamlines digital tasks by automating actions on behalf of the user. The personal AI within Simular can reason and perform tasks across various websites, allowing users to quickly get results from a variety of sources. Security is a top priority, ensuring that all personal data remains private while still providing seamless interaction with your computer. With a simple interface and user-friendly design, Simular provides users with an efficient, automated way to interact with their computer, saving valuable time and effort.
14

Cua

Cua
$10/month

See Software

Cua is a unified infrastructure for building and deploying computer-use AI agents that interact directly with operating systems and applications. Instead of automating through integrations, Cua agents work visually—understanding interfaces, clicking UI elements, typing text, and navigating software naturally. The platform supports Linux, Windows, and macOS sandboxes with cloud-based scaling. Developers can run agents via a managed UI or integrate them programmatically using the Python Agent SDK. Cua also provides dataset generation, trajectory recording, and benchmarking tools to train and evaluate agents. With pay-as-you-go pricing and smart model routing, Cua balances performance and cost efficiently. It is fully open source and designed for production-grade automation.
15

OpenAdapt

OpenAdapt
Free

See Software

OpenAdapt is a free desktop automation software that learns to streamline your desktop and online tasks by observing your actions. It captures your screen, keyboard, mouse movements, and, if desired, audio from your microphone, all stored locally on your device. The tool then processes this recorded information using various algorithms to create instructions and prompts suitable for AI language models. Before any data is uploaded, it is thoroughly cleansed of Personally Identifiable Information (PII) and Protected Health Information (PHI), and you will have the opportunity to review the sanitized data to ensure it is free of sensitive details. We prioritize your privacy by not storing or collecting any personal data, files, or recordings of your processes. OpenAdapt also integrates robust security protocols in its architecture to effectively protect API keys and payment details, providing users with peace of mind while using the software. This commitment to security and privacy ensures that you can automate your workflows without compromising your personal information.
16

Gemini Computer Use

Google
Free

See Software

Gemini Computer Use is an agentic computer interaction capability built into Gemini 3.5 Flash. It enables developers and enterprises to create AI agents that can work across browser, desktop, and mobile environments by seeing interfaces, reasoning through tasks, and taking action. The capability was previously offered through a standalone Gemini 2.5 computer use model, but is now natively integrated into Gemini 3.5 Flash. This gives developers access to stronger performance for agentic computer use tasks while also combining with Gemini’s existing strengths in function calling, Search grounding, Maps grounding, and built-in tools. Gemini Computer Use is designed for long-horizon automation, continuous software testing, enterprise knowledge work, and workflows that span multiple professional applications. Developers can start building with the feature through the Gemini API or Gemini Enterprise Agent Platform. Google also provides a demo environment through Browserbase for testing the capability. Safety controls include targeted adversarial training for live-environment risks, optional explicit user confirmation for sensitive or irreversible actions, and automatic task stopping when indirect prompt injection is identified. Gemini Computer Use helps organizations build practical AI agents that can complete complex digital tasks while supporting sandboxing, human review, and strict access controls.
17

Lux

OpenAGI Foundation
Free

See Software

Lux introduces a breakthrough approach to AI by enabling models to control computers the same way humans do, interacting with interfaces visually and functionally rather than through traditional API calls. Through its three distinct modes—Tasker for procedural workflows, Actor for ultra-fast execution, and Thinker for complex problem-solving—developers can tailor how agents behave in different environments. Lux demonstrates its power through practical examples such as autonomous Amazon product scraping, automated software QA using Nuclear, and rapid financial data retrieval from Nasdaq. The platform is designed so developers can spin up real computer-use agents within minutes, supported by robust SDKs and pre-built templates. Its flexible architecture allows agents to understand ambiguous goals, strategize over long timelines, and complete multi-step tasks without manual intervention. This shift expands AI’s capabilities beyond reasoning into hands-on action, enabling automation across any digital interface. What was once a capability reserved for large tech labs is now accessible to any developer or team. Lux ultimately transforms AI from a passive assistant into an active operator capable of working directly inside software.
18

Proxy

Convergence
Free

See Software

Proxy is an advanced digital assistant powered by artificial intelligence, created by Convergence to autonomously manage a variety of tasks through natural language communication. Utilizing Large Meta Learning Models (LMLMs), Proxy is designed to continuously learn from user interactions, allowing it to adjust to specific workflows and preferences for a customized experience. It has the capability to handle intricate tasks on its own, including scheduling, email management, data entry, and more, which significantly boosts operational efficiency. Specifically designed for enterprise environments, Proxy prioritizes security, compliance, and scalability while integrating effortlessly with existing systems to support entire organizations. By automating repetitive tasks, Proxy not only enhances user productivity but also enables individuals to dedicate more time to strategic and innovative activities. As a result, it transforms the way professionals work, creating an environment where creativity and efficiency can thrive.
19

Agent S

Simular

See Software

Agent S is an open-source framework designed to power autonomous AI agents capable of interacting directly with computers. Through its Agent-Computer Interface (ACI), the system enables models to observe graphical user interfaces, interpret on-screen elements, and perform tasks as a human operator would. Compatible with macOS, Windows, and Linux, it supports cross-platform automation for real-world applications. The latest version, Agent S3, exceeds human-level benchmarks on OSWorld, showcasing exceptional performance in long, multi-step workflows. The framework leverages advanced foundation models like GPT-5 alongside specialized grounding models such as UI-TARS to convert visual data into structured, executable actions. Its architecture emphasizes precise control, task decomposition, and intelligent decision-making across dynamic desktop environments. Agent S can be deployed flexibly via command-line interface, software development kits, or cloud-based infrastructure. It connects with major AI providers including OpenAI, Anthropic, Gemini, Azure, and Hugging Face, offering model flexibility and extensibility. Optional local code execution allows for secure and customizable task handling. Combined with built-in reflection and compositional planning systems, Agent S delivers a research-driven and production-ready solution for building high-performance computer-use agents.
20

Surfer H

H Company
$0.13 per task

See Software

Surfer H, developed by H Company, is an innovative autonomous web-agent platform designed to seamlessly interpret and interact with user interfaces in a human-like manner by utilizing three distinct modular models: a policy model for task planning, a localizer model for visual identification of UI elements, and a validator model for outcome verification. This agent operates exclusively through the browser interface without relying on any specialized API connections, allowing it to perform actions such as scrolling, clicking, typing, and executing various real-world online tasks including hotel bookings, product comparison, and structured data extraction. When integrated with H Company’s open-weight vision-language models, Surfer H has demonstrated exceptional capabilities, achieving a remarkable 92.2% accuracy on the WebVoyager benchmark at a cost of approximately $0.13 per task, and can be deployed locally, through Docker, or on cloud platforms. Its versatile use cases encompass web automation, quality assurance testing that avoids fragile scripts, data collection, and the development of intelligent workflow agents that mimic human interactions with the web, thereby enhancing efficiency in digital tasks. Furthermore, the ability to adapt to a wide range of applications makes Surfer H an invaluable tool for businesses seeking to optimize their online operations.
21

Holo2

H Company

See Software

The Holo2 model family from H Company offers a blend of affordability and high performance in vision-language models specifically designed for computer-based agents that can navigate, localize user interface elements, and function across web, desktop, and mobile platforms. This new series, which is available in sizes of 4 billion, 8 billion, and 30 billion parameters, builds upon the foundations laid by the earlier Holo1 and Holo1.5 models, ensuring strong grounding in user interfaces while making substantial improvements to navigation abilities. Utilizing a mixture-of-experts (MoE) architecture, the Holo2 models activate only the necessary parameters to maximize operational efficiency. These models have been trained on carefully curated datasets focused on localization and agent functionality, allowing them to seamlessly replace their predecessors. They provide support for effortless inference in environments compatible with Qwen3-VL models and can be easily incorporated into agentic workflows such as Surfer 2. In benchmark evaluations, the Holo2-30B-A3B model demonstrated impressive results, achieving 66.1% accuracy on the ScreenSpot-Pro test and 76.1% on the OSWorld-G benchmark, thereby establishing itself as the leader in the UI localization sector. Additionally, the advancements in the Holo2 models make them a compelling choice for developers looking to enhance the efficiency and performance of their applications.
22

Skyvern

Skyvern

See Software

Skyvern is an advanced AI automation platform built to handle repetitive and time-consuming browser-based tasks. It leverages computer vision and natural language understanding to interact with websites just like a human would. Users can automate complex workflows using simple text-based instructions without writing custom scripts. Skyvern scales effortlessly, enabling organizations to run hundreds or even thousands of automated tasks at the same time through an API. The platform works across any website, including portals protected by CAPTCHAs, login requirements, and two-factor authentication. It also supports proxy networks for precise geographic targeting. Explainable AI summaries provide full visibility into every action taken during each run. Data extracted from workflows can be exported in structured formats such as JSON or CSV. Skyvern is trusted by thousands of users across multiple industries for high-volume automation. It allows teams to replace manual browser work with reliable, scalable AI-driven processes.
23

Claude Computer Use

Anthropic

See Software

Claude Computer Use is an advanced capability that allows Claude to operate directly on your computer to perform tasks across applications and files. It works by interacting with your screen, enabling actions like clicking, typing, opening programs, and navigating workflows without requiring manual input. The system prioritizes efficiency by first using direct connectors, then browser automation, and finally full screen interaction when necessary. Claude can handle tasks such as generating reports from local files, filling spreadsheets, testing applications, and navigating internal tools. Users retain control through permission prompts that must be approved before Claude accesses any application. The feature includes built-in safeguards designed to prevent risky actions and flag potential issues. It also captures screenshots to understand the interface, allowing it to adapt to different applications. However, users are advised to avoid exposing sensitive information while using the feature. Claude Computer Use is currently available in research preview and continues to evolve. Overall, it transforms Claude into an active assistant capable of executing real tasks on your machine.
24

Ace

General Agents

See Software

Ace functions as a computer autopilot, executing various tasks on your desktop by utilizing your mouse and keyboard. It surpasses other models in a comprehensive set of computer-related tasks, which we are choosing to open-source. We are offering the ace-control models to a select group of partners via our developer platform. Mimicking human behavior, Ace carries out mouse clicks and keystrokes by responding to on-screen prompts, having been meticulously trained by our team of software engineers and industry professionals on a dataset encompassing more than a million tasks. Its superior performance in our suite of computer use tasks sets it apart from competitors. In addition to providing these capabilities to partners, we believe Ace can significantly streamline productivity for users everywhere. Thus, Ace stands out as an innovative solution for automating desktop operations.
25

Holo3.1

H Company

See Software

Holo3.1 represents H Company’s advanced suite of swift and localized computer-use agents designed for seamless operation across web, desktop, and mobile platforms, while ensuring better integration within various agent frameworks and deployment targets. Drawing from the Qwen family, Holo3.1 significantly enhances reliability in the diverse environments where these agents are utilized, tackling the distribution changes that arise on mobile devices, alternative agent frameworks, and varied execution environments. The latest version broadens Holo3’s functionality, going beyond mere browser and desktop control, with notable advancements in mobile automation; for instance, the performance in AndroidWorld has surged from 67% to 79.3% for the 35B-A3B model, while the smaller 4B and 9B variants have also shown improvements from 58% to 71%. In addition, Holo3.1 brings forth native support for function-calling protocols alongside structured JSON outputs, which aids teams in integrating the model into third-party agent ecosystems, achieving almost identical performance between function-calling and native execution. This release marks a significant step in enhancing the versatility and effectiveness of computer-use agents across multiple platforms.