OpenAI’s new ChatGPT Agent can control an entire computer and do tasks for you


OpenAI is going all-in on the most-hyped trend in AI right now: AI agents, or tools that go a step beyond chatbots to complete complex, multi-step tasks on a user’s behalf. The company on Thursday debuted ChatGPT Agent, which it bills as a tool that can complete work on your behalf using its own “virtual computer.”
In a briefing and demo with The Verge, Yash Kumar and Isa Fulford — product lead and research lead on ChatGPT Agent, respectively — said it’s powered by a new model that OpenAI developed specifically for the product. The company said the new tool can perform tasks like looking at a user’s calendar to brief them on upcoming client meetings, planning and purchasing ingredients to make a family breakfast, and creating a slide deck based on its analysis of competing companies.
The model behind ChatGPT Agent, which has no specific name, was trained on complex tasks that require multiple tools — like a text browser, visual browser, and terminal where users can import their own data — via reinforcement learning, the same technique used for all of OpenAI’s reasoning models. OpenAI said that ChatGPT Agent combines the capabilities of both Operator and Deep Research, two of its existing AI tools.
To develop the new tool, the company combined the teams behind both Operator and Deep Research into one unified team. Kumar and Fulford told The Verge that the new team is made up of between 20 and 35 people across product and research.
In the demo, Kumar and Fulford demonstrated potential use cases for ChatGPT Agent, like asking it to plan a date night by connecting to Google Calendar to see when the user has a free evening, and then cross-referencing OpenTable to find openings at certain types of restaurants. They also showed how a user could interrupt the process by adding, say, another restaurant category to search for. Another demonstration showed how ChatGPT Agent could generate a research report on the rise of Labubus versus Beanie Babies.
Fulford said she enjoyed using it for online shopping because the combination of tech behind Deep Research and Operator worked better and was more thorough than trying the process solely using Operator. And Kumar said he had begun using ChatGPT Agent to automate small parts of his life, like requesting new office parking at OpenAI every Thursday instead of showing up Monday having forgotten to request it with nowhere to park.
Kumar said that since ChatGPT Agent has access to “an entire computer” instead of just a browser, they’ve “enhanced the toolset quite a bit.”
According to the demo, though, the tool can be a bit slow. When asked about latency, Kumar said their team is more focused on “optimizing for hard tasks” and that users aren’t meant to sit and watch ChatGPT Agent work.
“Even if it takes 15 minutes, half an hour, it’s quite a big speed-up compared to how long it would take you to do it,” Fulford said, adding that OpenAI’s search team is more focused on low-latency use cases. “It’s one of those things where you can kick something off in the background and then come back to it.”
Before ChatGPT Agent does anything “irreversible,” like sending an email or making a booking, it asks for permission first, Fulford said.
Since the model behind the tool has increased capabilities, OpenAI said it has activated the safeguards it created for “high biological and chemical capabilities,” even though the company said it does not have “direct evidence that the model could meaningfully help a novice create severe biological or chemical harm” in the form of weapons. Anthropic in May activated similar safeguards for its launch of one of its Claude models, Opus 4.
When asked about whether the tool is permitted to perform financial transactions, Kumar said those actions have been restricted “for now,” and that there’s an additional protection called Watch Mode, wherein if a user navigates to a certain category of webpages, like financial sites, they must not navigate away from the tab ChatGPT Agent is operating in or the tool will stop working.
OpenAI will start rolling out the tool today to Pro, Plus, and Team users — pick “agent mode” in the tools menu or type “/agent” to access it — and the company said it will make it available to ChatGPT Enterprise and Education users later this summer. There’s no rollout timeline yet for the European Economic Area and Switzerland.
The concept of AI agents has been a buzzworthy trend in the industry for years. The ideal developers are working toward is something like Iron Man’s J.A.R.V.I.S., a tool that can perform specific job functions, check people’s calendars for the best time to schedule an event, purchase a gift based on a friend’s preferences, and more, but at the moment, they’re somewhat limited to assisting with coding and compiling research reports.
The term “AI agent” became more common to investors and tech executives in 2023 and quickly picked up speed, especially after fintech company Klarna announced in February 2024 that in just one month of operation, its own AI agent had handled two-thirds of its customer service chats — the equivalent of 700 full-time human workers. From there, executives at Amazon, Meta, Google, and more started mentioning their AI agent goals on earnings call after earnings call. And since then, AI companies have been strategically hiring to reach those goals: Google, for instance, last week hired Windsurf’s CEO, co-founder and some R&D team members to help further its agentic AI projects.
OpenAI’s debut of ChatGPT Agent follows its January release of Operator, which the company billed as “an agent that can go to the web to perform tasks for you” since it was trained to be able to handle the internet’s buttons, text fields and more. It’s also part of a larger trend in AI, as companies large and small chase AI agents that will capture the attention of consumers and ideally become habits. Last October, Anthropic, the Amazon-backed AI startup behind Claude, released a similar tool called “Computer Use,” which it billed as a tool that could use a computer the same way a human can in order to complete tasks on a user’s behalf. Multiple AI companies, including OpenAI, Google and Perplexity, also offer an AI tool that all three have dubbed Deep Research, denoting an AI agent that can write sizable analyses and research reports on anything a user wants.
What's Your Reaction?






