Hugging Face Unveils Free AI Agent Capable of Performing Digital Tasks Autonomously
Hugging Face has launched a new open-source AI tool called the Open Computer Agent, designed to autonomously perform various browser-based tasks. Released as a free demo, the tool is now publicly accessible through the Hugging Face website. The AI agent can navigate web platforms like Google Search, Google Maps, and even ticket booking sites to complete actions on behalf of the user — all without direct human input at each step. This development builds on Hugging Face’s smolagents framework, which was introduced earlier this year to facilitate lightweight autonomous agents.
Announced by Aymeric Roucher, Project Lead for Agents at Hugging Face, the Open Computer Agent is powered by a virtualized Linux environment and includes applications like Mozilla Firefox. This setup allows the AI agent to interact with the web as a human would — clicking, typing, and navigating through browser interfaces in real time. With its open-source foundation, the project invites developers, researchers, and enthusiasts to explore and expand its capabilities.
The intelligence behind the agent comes from the Qwen2-VL-72B, a powerful vision-language model capable of interpreting images and interfaces based on visual coordinates. This means the agent can “see” what’s on screen, make decisions, and perform follow-up actions like clicking buttons or typing search queries. Hugging Face’s smolagents library adds the logic layer that enables these autonomous interactions, forming the basis of the agentic workflow.
Users trying out the demo can instruct the agent to carry out tasks like finding directions using Google Maps. Once prompted, the agent launches a browser, navigates to the correct site, inputs the required information, and completes the task — all without the user having to touch their keyboard or mouse. With the release of the Open Computer Agent, Hugging Face continues its push toward more accessible and transparent AI tools, empowering the public to experiment with emerging forms of digital automation.



