![OpenAI Launches 'Operator' AI Agent That Can Navigate the Web for You [Video] OpenAI Launches 'Operator' AI Agent That Can Navigate the Web for You [Video]](/images/news/96229/459875/459875-64.png)
OpenAI Launches 'Operator' AI Agent That Can Navigate the Web for You [Video]
Posted January 23, 2025 at 8:09pm by
Shalom Levytam
OpenAI has announced the launch of 'Operator,' an AI agent that can use the web like a human.
Operator is one of our first agents, which are AIs capable of doing work for you independently—you give it a task and it will execute it. Operator is based on a new model we're calling "computer-using agent" (CUA). CUA combines GPT-4o's vision capabilities with advanced reasoning through reinforcement learning. It's trained to control a computer in the same way a human would—it looks at the screen, and uses a mouse and keyboard. The model still has limitations and will continue to evolve based on feedback. We plan to bring CUA to the API for developers soon.
CUA processes raw pixel data to understand what's happening on the screen and uses a virtual mouse and keyboard to complete actions. It can navigate multi-step tasks, handle errors, and adapt to unexpected changes. This enables CUA to act in a wide range of digital environments, performing tasks like filling out forms and navigating websites without needing specialized APIs.
Given a user's instruction, CUA operates through an iterative loop that integrates perception, reasoning, and action:
● Perception: Screenshots from the computer are added to the model's context, providing a visual snapshot of the computer's current state.
● Reasoning: CUA reasons through the next steps using chain-of-thought, taking into consideration current and past screenshots and actions. This inner monologue improves task performance by enabling the model to evaluate its observations, track intermediate steps, and adapt dynamically.
● Action: It performs the actions—clicking, scrolling, or typing—until it decides that the task is completed or user input is needed. While it handles most steps automatically, CUA seeks user confirmation for sensitive actions, such as entering login details or responding to CAPTCHA forms.
Operator is available now to Pro users in the U.S. at operator.chatgpt.com. Eventually, it will be part of ChatGPT and available more broadly.
Check out the videos below for more details!
Operator is one of our first agents, which are AIs capable of doing work for you independently—you give it a task and it will execute it. Operator is based on a new model we're calling "computer-using agent" (CUA). CUA combines GPT-4o's vision capabilities with advanced reasoning through reinforcement learning. It's trained to control a computer in the same way a human would—it looks at the screen, and uses a mouse and keyboard. The model still has limitations and will continue to evolve based on feedback. We plan to bring CUA to the API for developers soon.
CUA processes raw pixel data to understand what's happening on the screen and uses a virtual mouse and keyboard to complete actions. It can navigate multi-step tasks, handle errors, and adapt to unexpected changes. This enables CUA to act in a wide range of digital environments, performing tasks like filling out forms and navigating websites without needing specialized APIs.
Given a user's instruction, CUA operates through an iterative loop that integrates perception, reasoning, and action:
● Perception: Screenshots from the computer are added to the model's context, providing a visual snapshot of the computer's current state.
● Reasoning: CUA reasons through the next steps using chain-of-thought, taking into consideration current and past screenshots and actions. This inner monologue improves task performance by enabling the model to evaluate its observations, track intermediate steps, and adapt dynamically.
● Action: It performs the actions—clicking, scrolling, or typing—until it decides that the task is completed or user input is needed. While it handles most steps automatically, CUA seeks user confirmation for sensitive actions, such as entering login details or responding to CAPTCHA forms.
Operator is available now to Pro users in the U.S. at operator.chatgpt.com. Eventually, it will be part of ChatGPT and available more broadly.
Check out the videos below for more details!
![Apple Could Ship 25M MacBooks in 2026 as PC Market Declines [Kuo] Apple Could Ship 25M MacBooks in 2026 as PC Market Declines [Kuo]](/images/news/100186/100186/100186-160.jpg)
![MacBook Air M5 Reviews: Apple's Best Laptop for Most People Gets Faster [Video] MacBook Air M5 Reviews: Apple's Best Laptop for Most People Gets Faster [Video]](/images/news/100184/100184/100184-160.jpg)
![Apple Reportedly Abandons Plans for Clamshell Foldable iPhone [Rumor] Apple Reportedly Abandons Plans for Clamshell Foldable iPhone [Rumor]](/images/news/100182/100182/100182-160.jpg)

![MacBook Neo Reviews: Apple's $599 Laptop Delivers Big Value [Video] MacBook Neo Reviews: Apple's $599 Laptop Delivers Big Value [Video]](/images/news/100178/100178/100178-160.jpg)





![Apple's New M4 iPad Air is Already on Sale for $559 Ahead of Launch [Deal] Apple's New M4 iPad Air is Already on Sale for $559 Ahead of Launch [Deal]](/images/news/100174/100174/100174-160.jpg)
![Apple AirPods 4 (ANC) Back On Sale for $119 [Deal] Apple AirPods 4 (ANC) Back On Sale for $119 [Deal]](/images/news/100103/100103/100103-160.jpg)
![Apple's Official iPhone Crossbody Strap Drops to Just $23.71 (60% Off) [Deal] Apple's Official iPhone Crossbody Strap Drops to Just $23.71 (60% Off) [Deal]](/images/news/100069/100069/100069-160.jpg)
![Apple Watch Series 11 Now $299, 46mm Model Also at Record Low [Deal] Apple Watch Series 11 Now $299, 46mm Model Also at Record Low [Deal]](/images/news/99986/99986/99986-160.jpg)
![Expired: Save $900 on Apple's 11-Inch M4 iPad Pro 2TB With Nano-Texture Glass [Deal] Expired: Save $900 on Apple's 11-Inch M4 iPad Pro 2TB With Nano-Texture Glass [Deal]](/images/news/99982/99982/99982-160.jpg)