February 19, 2025
OpenAI Launches 'Operator' AI Agent That Can Navigate the Web for You [Video]

OpenAI Launches 'Operator' AI Agent That Can Navigate the Web for You [Video]

Posted January 23, 2025 at 8:09pm by iClarified
OpenAI has announced the launch of 'Operator,' an AI agent that can use the web like a human.

Operator is one of our first agents, which are AIs capable of doing work for you independently—you give it a task and it will execute it. Operator is based on a new model we're calling "computer-using agent" (CUA). CUA combines GPT-4o's vision capabilities with advanced reasoning through reinforcement learning. It's trained to control a computer in the same way a human would—it looks at the screen, and uses a mouse and keyboard. The model still has limitations and will continue to evolve based on feedback. We plan to bring CUA to the API for developers soon.




CUA processes raw pixel data to understand what's happening on the screen and uses a virtual mouse and keyboard to complete actions. It can navigate multi-step tasks, handle errors, and adapt to unexpected changes. This enables CUA to act in a wide range of digital environments, performing tasks like filling out forms and navigating websites without needing specialized APIs.

Given a user's instruction, CUA operates through an iterative loop that integrates perception, reasoning, and action:

● Perception: Screenshots from the computer are added to the model's context, providing a visual snapshot of the computer's current state.

● Reasoning: CUA reasons through the next steps using chain-of-thought, taking into consideration current and past screenshots and actions. This inner monologue improves task performance by enabling the model to evaluate its observations, track intermediate steps, and adapt dynamically.


● Action: It performs the actions—clicking, scrolling, or typing—until it decides that the task is completed or user input is needed. While it handles most steps automatically, CUA seeks user confirmation for sensitive actions, such as entering login details or responding to CAPTCHA forms.

Operator is available now to Pro users in the U.S. at operator.chatgpt.com. Eventually, it will be part of ChatGPT and available more broadly.

Check out the videos below for more details!






Add Comment
Would you like to be notified when someone replies or adds a new comment?
Yes (All Threads)
Yes (This Thread Only)
No
iClarified Icon
Notifications
Would you like to be notified when we post a new Apple news article or tutorial?
Yes
No
Comments
You must login or register to add a comment...
Recent. Read the latest Apple News.
RECENT
Tutorials. Help is here.
TUTORIALS
Where to Download macOS Sonoma
Where to Download macOS Ventura
AppleTV Firmware Download Locations
Where To Download iPad Firmware Files From
Where To Download iPhone Firmware Files From
Deals. Save on Apple devices and accessories.
DEALS