OpenAI has unveiled “Operator,” an advanced AI agent designed to autonomously perform web-based tasks, marking a significant leap in task automation. Currently available as a research preview for ChatGPT Pro users in the U.S., Operator utilizes a sophisticated model known as the Computer-Using Agent (CUA), which integrates GPT-4’s vision capabilities with advanced reasoning. This enables it to interact with web elements such as buttons, menus, and text fields, effectively navigating and executing tasks across various websites.

Operator’s capabilities include booking trips, purchasing groceries, and managing expense reports. It interprets user commands to operate a web browser, automating both daily and professional tasks to enhance productivity. To ensure safety, Operator incorporates safeguards like user confirmations for critical actions and monitoring for prompt injections. OpenAI is also collaborating with companies such as Instacart, Uber, and eBay to enhance user accessibility on the Operator platform.

Initially, Operator is available to ChatGPT Pro subscribers in the U.S. at $200 per month, with plans to expand access to more users and regions in the future. This development signifies a shift towards practical, autonomous AI applications, highlighting the growing focus on AI agents capable of executing actions independently.