![OpenAI Launches 'Operator' AI Agent That Can Navigate the Web for You [Video] OpenAI Launches 'Operator' AI Agent That Can Navigate the Web for You [Video]](/images/news/96229/459875/459875-64.png)
OpenAI Launches 'Operator' AI Agent That Can Navigate the Web for You [Video]
Posted January 23, 2025 at 8:09pm by iClarified
OpenAI has announced the launch of 'Operator,' an AI agent that can use the web like a human.
Operator is one of our first agents, which are AIs capable of doing work for you independently—you give it a task and it will execute it. Operator is based on a new model we're calling "computer-using agent" (CUA). CUA combines GPT-4o's vision capabilities with advanced reasoning through reinforcement learning. It's trained to control a computer in the same way a human would—it looks at the screen, and uses a mouse and keyboard. The model still has limitations and will continue to evolve based on feedback. We plan to bring CUA to the API for developers soon.
CUA processes raw pixel data to understand what's happening on the screen and uses a virtual mouse and keyboard to complete actions. It can navigate multi-step tasks, handle errors, and adapt to unexpected changes. This enables CUA to act in a wide range of digital environments, performing tasks like filling out forms and navigating websites without needing specialized APIs.
Given a user's instruction, CUA operates through an iterative loop that integrates perception, reasoning, and action:
● Perception: Screenshots from the computer are added to the model's context, providing a visual snapshot of the computer's current state.
● Reasoning: CUA reasons through the next steps using chain-of-thought, taking into consideration current and past screenshots and actions. This inner monologue improves task performance by enabling the model to evaluate its observations, track intermediate steps, and adapt dynamically.
● Action: It performs the actions—clicking, scrolling, or typing—until it decides that the task is completed or user input is needed. While it handles most steps automatically, CUA seeks user confirmation for sensitive actions, such as entering login details or responding to CAPTCHA forms.
Operator is available now to Pro users in the U.S. at operator.chatgpt.com. Eventually, it will be part of ChatGPT and available more broadly.
Check out the videos below for more details!
Operator is one of our first agents, which are AIs capable of doing work for you independently—you give it a task and it will execute it. Operator is based on a new model we're calling "computer-using agent" (CUA). CUA combines GPT-4o's vision capabilities with advanced reasoning through reinforcement learning. It's trained to control a computer in the same way a human would—it looks at the screen, and uses a mouse and keyboard. The model still has limitations and will continue to evolve based on feedback. We plan to bring CUA to the API for developers soon.
CUA processes raw pixel data to understand what's happening on the screen and uses a virtual mouse and keyboard to complete actions. It can navigate multi-step tasks, handle errors, and adapt to unexpected changes. This enables CUA to act in a wide range of digital environments, performing tasks like filling out forms and navigating websites without needing specialized APIs.
Given a user's instruction, CUA operates through an iterative loop that integrates perception, reasoning, and action:
● Perception: Screenshots from the computer are added to the model's context, providing a visual snapshot of the computer's current state.
● Reasoning: CUA reasons through the next steps using chain-of-thought, taking into consideration current and past screenshots and actions. This inner monologue improves task performance by enabling the model to evaluate its observations, track intermediate steps, and adapt dynamically.
● Action: It performs the actions—clicking, scrolling, or typing—until it decides that the task is completed or user input is needed. While it handles most steps automatically, CUA seeks user confirmation for sensitive actions, such as entering login details or responding to CAPTCHA forms.
Operator is available now to Pro users in the U.S. at operator.chatgpt.com. Eventually, it will be part of ChatGPT and available more broadly.
Check out the videos below for more details!

![Apple Seeds tvOS 26.2 Release Candidate 2 to Developers [Download] Apple Seeds tvOS 26.2 Release Candidate 2 to Developers [Download]](/images/news/99251/99251/99251-160.jpg)
![Alan Dye's Departure Viewed as 'Best Personnel News at Apple in Decades' [Report] Alan Dye's Departure Viewed as 'Best Personnel News at Apple in Decades' [Report]](/images/news/99247/99247/99247-160.jpg)
![Apple Shares Trailer for 'Tehran' Season 3, Announces Season 4 Renewal [Video] Apple Shares Trailer for 'Tehran' Season 3, Announces Season 4 Renewal [Video]](/images/news/99244/99244/99244-160.jpg)






![Final Cyber Monday Deals: M4 MacBook Air for $749, Beats, Sonos, and More [List] Final Cyber Monday Deals: M4 MacBook Air for $749, Beats, Sonos, and More [List]](/images/news/99203/99203/99203-160.jpg)
![iPad mini 7 Falls to New All-Time Low of $349 [Cyber Monday 2025] iPad mini 7 Falls to New All-Time Low of $349 [Cyber Monday 2025]](/images/news/99197/99197/99197-160.jpg)
![Apple Watch Series 11 Drops to New All-Time Low Price of $329 [Cyber Monday 2025] Apple Watch Series 11 Drops to New All-Time Low Price of $329 [Cyber Monday 2025]](/images/news/99195/99195/99195-160.jpg)

![Apple Watch Ultra 3 Drops to New All-Time Low of $679 [Deal] Apple Watch Ultra 3 Drops to New All-Time Low of $679 [Deal]](/images/news/99189/99189/99189-160.jpg)