OpenAI's ChatGPT agent can control your PC to do tasks on your behalf

As an Amazon Associate I earn from qualifying purchases.

(Image credit: wildpixel/Getty Images)

OpenAI has actually introduced ChatGPT representative, an upgrade to its flagship expert system( AI)design that equips it with a virtual computer system and an incorporated toolkit.

These brand-new tools permit the representative to perform complex, multi-step jobs that previous models of ChatGPT were incapable of– managing your computer system and finishing jobs for you.

This more effective variation, which is still extremely based on human input and guidance, shown up soon before Mark Zuckerberg revealed that Meta scientists had actually observed their own AI designs revealing indications of independent self-improvement. It likewise introduced soon before OpenAI released GPT-5– the most recent variation of OpenAI’s chatbot.With ChatGPT representative, users can now ask the big language design (LLM) to not just carry out analysis or collect information, however to act upon that information, OpenAI agents stated in a declaration

You might command the representative to examine your calendar and quick you on upcoming occasions and pointers, or to study a corpus of information and summarize it in a pithy run-through or as a slide deck. While a standard LLM might look for and offer dishes for a Japanese-style breakfast, ChatGPT representative might totally prepare and buy components for the very same breakfast for a particular variety of visitors.

The brand-new design, while extremely capable, still deals with a number of constraints. Like all AI designs, its spatial thinking is weak, so it has problem with jobs like preparing physical paths. It likewise does not have real consistent memory, processing details in the minute without reputable recall or the capability to referral previous interactions beyond instant context.

ChatGPT representative does reveal considerable enhancements in OpenAI’s benchmarking. On Mankind’s Last Examan AI standard that assesses a design’s capability to react to expert-level concerns throughout a variety of disciplines, it more than doubled the precision portion (41.6%) versus OpenAI o3 without any tools geared up (20.3%).

Get the world’s most remarkable discoveries provided directly to your inbox.

Related: OpenAI’s ‘most intelligent’ AI design was clearly informed to close down– and it declined

It likewise carried out better than other OpenAI tools, along with a variation of itself that did not have tools like a web browser and virtual computer system. On the planet’s hardest understood mathematics criteria, FrontierMath, ChatGPT representative and its enhance of tools once again exceeded previous designs by a broad margin.

The representative is developed on 3 pillars stemmed from previous OpenAI items. One leg is ‘Operator’, a representative that would utilize its own virtual internet browser to plumb the web for users. The 2nd is ‘deep research study’, constructed to comb through and manufacture big quantities of information. The last piece of the puzzle is previous variations of ChatGPT itself, which mastered conversational fluency and discussion.

“In essence, it can autonomously browse the web, generate code, create files, and so on, all under human supervision,” stated Kofi Nyarkoa teacher at Morgan State University and director of the Data Engineering and Predictive Analytics (DEPA) Research Lab.

Nyarko fasted to highlight, nevertheless, that the brand-new representative is still not self-governing. “Hallucinations, user interface fragility, or misinterpretation can lead to errors. Built-in safeguards, like permission prompts and interruptibility, are essential but not sufficient to eliminate risk entirely.”

The risk of advancing AI OpenAI has itself acknowledged the threat of the brand-new representative and its increased autonomy. Business agents specified that ChatGPT representative has “high biological and chemical capabilities,” which they declare possibly permit it to help in the development of chemical or biological weapons.

Compared to existing resources, like a chem laboratory and book, an AI representative represents what biosecurity specialists call a “ability escalation path.” AI can draw on countless resources and synthesize the data in them instantly, merge knowledge across scientific disciplines, provide iterative troubleshooting like an expert mentor, navigate supplier websites, fill out order forms, and even help bypass basic verification checks.

With its virtual computer, the agent can also autonomously interact with files, websites, and online tools in ways that empower it to do much more potential harm if misused. The opportunity for data breaches or data manipulation, as well as for misaligned behavior like financial fraud, is amplified in the event of a prompt injection attack or hijacking.

As Nyarko pointed out, these risks are in addition to those implicit in traditional AI models and LLMs.

“There are more comprehensive issues for AI representatives as an entire, like how representatives running autonomously can enhance mistakes, present predispositions from public information, make complex liability structures, and accidentally foster mental reliance,” he said.

In action to the brand-new hazards that a more agential design presents, OpenAI engineers have actually likewise enhanced a variety of safeguards, business agents stated in the declaration.

These consist of danger modeling, dual-use rejection training– where a design is taught to decline hazardous demands around information that might have either useful or destructive usage– bug bounty programs, and professional red-teaming– evaluating weak points by assaulting the system yourself– concentrated on biodefense. A threat management evaluation performed in July of 2025 by SaferAI, a safety-focused non-profit, called OpenAI’s threat management policies Weak, granting them a rating of 33% out of a possible 100%. OpenAI likewise just scored a C grade on the AI Safety Index put together by the Future of Life Institute, a leading AI security company.

Alan is an independent tech and home entertainment reporter who focuses on computer systems, laptop computers, and computer game. He’s formerly composed for websites like PC Gamer, GamesRadar, and Rolling Stone. If you require guidance on tech, or assist discovering the very best tech offers, Alan is your male.

Find out more

As an Amazon Associate I earn from qualifying purchases.