OpenAI unveils GPT-5.4 built specifically for AI agents

OpenAI has released GPT-5.4, a new flagship AI model that moves beyond simple chat responses and into real computer control.

The model—available in ChatGPT as “GPT-5.4 Thinking”—is also accessible through the OpenAI API and the company’s coding tool OpenAI Codex. A Windows version of Codex was recently introduced, allowing developers to run the AI model directly within coding workflows.

AI that can act, not just answer

The most notable feature of GPT-5.4 is its ability to interact with a computer through AI agent systems. Instead of simply explaining how to complete a task, the model can trigger actions such as clicking a mouse, typing commands, editing files, and analyzing screenshots.

These actions are executed by a local AI agent that receives instructions from GPT-5.4. In practice, that means the model can navigate software interfaces, open programs, and complete tasks within applications.

For example, users could ask an AI agent powered by GPT-5.4 to open financial software like Quicken, navigate menus, and perform bookkeeping tasks automatically.

New reasoning and planning abilities

Alongside computer-use capabilities, GPT-5.4 also introduces improvements in reasoning efficiency. According to OpenAI, the model can solve problems using fewer tokens, which reduces computing costs when using the API.

Another addition is the ability to generate an “upfront plan” before executing complex instructions. This allows users to review and adjust the strategy before the AI begins performing actions.

The model also includes improved spreadsheet capabilities, making it better suited for tasks involving data analysis and structured information.

Important limitations

Despite its expanded abilities, GPT-5.4 cannot directly control a user’s computer when accessed through the ChatGPT web or desktop interface. In those cases, it remains limited to the chat environment and integrated services such as cloud storage or creative tools.

Full computer control is only available through the OpenAI API or Codex environments where an AI agent system can execute commands on the local machine.

The release highlights a growing shift toward agentic AI, where language models collaborate with automated agents to perform real tasks on a computer rather than just providing instructions.

Post Views: 105

What's Hot

Samsung warns RAM shortages will deepen beyond 2027

Windows 11 April update breaks third-party backup software

Oxford study finds friendly AI chatbots make more mistakes

Google Maps vs Waze: I Put the Two Best Navigation Apps Head-to-Head — and One Clearly Came Out on Top

T-Mobile Bundles Free Hulu and Netflix for 5G Users: Eligibility Explained

This Portable Mini PC Is the Unexpected Raspberry Pi Alternative You Might Actually Want

Samsung warns RAM shortages could worsen beyond 2027

Oxford study finds friendly AI chatbots are less accurate

OpenAI unveils GPT-5.4 built specifically for AI agents

Anthropic’s Claude Security Tool Analyzes Codebases to Detect Vulnerabilities and Prioritize Fixes

Microsoft’s Windows Insider Program Finally Becomes More Streamlined and User-Friendly

Microsoft launches tool to gather user feedback on Windows issues

Apple Planning Big Mac Redesign and Half-Sized Old Mac

Autonomous Driving Startup Attracts Chinese Investor

Onboard Cameras Allow Disabled Quadcopters to Fly

Review: T-Mobile Winning 5G Race Around the World

Samsung Galaxy S21 Ultra Review: the New King of Android Phones

Xiaomi Mi 10: New Variant with Snapdragon 870 Review

Subscribe to Updates

What's Hot

OpenAI unveils GPT-5.4 built specifically for AI agents

AI that can act, not just answer

New reasoning and planning abilities

Important limitations

Related Posts