OpenAI has released GPT-5.4, its latest frontier artificial intelligence model designed to combine reasoning, coding, and agentic workflows into a single system. The model is rolling out across ChatGPT, the developer API, and OpenAI’s coding platform Codex, which has been launched for Windows just recently.
The company also introduced GPT-5.4 Pro, a higher-performance variant intended for complex professional workloads requiring deeper reasoning and higher accuracy.
“GPT-5.4 brings together the best of our recent advances in reasoning, coding, and agentic workflows into a single frontier model,” OpenAI said in its announcement.
The company added that the model is designed to help users complete complex professional tasks “accurately, effectively, and efficiently—delivering what you asked for with less back and forth.”
GPT-5.4 builds on recent advances from earlier models, integrating the coding capabilities of GPT-5.3-Codex with improvements in reasoning, tool usage, and professional productivity tasks. The model is designed to handle real-world work involving spreadsheets, documents, presentations, and complex multi-step workflows.
In ChatGPT, the GPT-5.4 Thinking mode introduces a planning feature that outlines the model’s reasoning approach before completing a response. According to OpenAI, the feature allows users to “adjust course mid-response while it’s working” to produce outputs more closely aligned with their needs. The update also improves deep web research capabilities and long-context reasoning for complex queries.
Benchmark results show measurable improvements over earlier models. On GDPval, a benchmark testing professional knowledge work across 44 occupations, GPT-5.4 matched or exceeded industry professionals in 83% of comparisons, compared with 70.9% for GPT-5.2.
Computer Use and Agentic Workflows
GPT-5.4 introduces native computer-use capabilities, allowing AI agents to interact with software environments and operating systems. The model can operate applications using keyboard and mouse actions, interpret screenshots, and execute workflows across multiple programs.
OpenAI described the release as “the first general-purpose model we’ve released with native, state-of-the-art computer-use capabilities.” The company said the feature enables developers to build agents capable of carrying out complex workflows across applications and websites.
On the OSWorld-Verified benchmark, which evaluates desktop navigation and computer interaction, GPT-5.4 achieved a 75% success rate. This represents a significant improvement over the 47.3% score recorded by GPT-5.2 and exceeds average human performance in the benchmark.
The model also improves visual reasoning and document parsing capabilities. On the MMMU-Pro benchmark measuring multimodal reasoning, GPT-5.4 reached 81.2% accuracy without tools. The model also introduces higher-fidelity image input support, enabling analysis of images up to 10.24 million pixels.
Coding and Developer Tools
GPT-5.4 incorporates the coding strengths of GPT-5.3-Codex while improving performance on longer-running development workflows. On the SWE-Bench Pro benchmark, which measures software engineering performance, GPT-5.4 achieved a score of 57.7% while maintaining lower latency compared with earlier models.
OpenAI also improved how models interact with external tools through a feature called tool search. Instead of loading every tool definition into the prompt, the system retrieves them dynamically when needed.
“This approach dramatically reduces the number of tokens required for tool-heavy workflows and preserves the cache, making requests faster and cheaper,” the company said.
Internal tests showed the feature reduced token usage by roughly 47% while maintaining the same accuracy.
In Codex, developers can also use a new experimental capability called Playwright Interactive, which allows the model to visually debug web applications and test software as it is being generated. OpenAI said the feature helps developers automate browser interactions and validate application behavior during development.
Availability and Pricing
GPT-5.4 is rolling out gradually across OpenAI’s platforms. In ChatGPT, GPT-5.4 Thinking replaces GPT-5.2 Thinking for Plus, Team, and Pro subscribers, while GPT-5.4 Pro is available to Pro and Enterprise users.
Developers can access the model through the API using the gpt-5.4 and gpt-5.4-pro endpoints. Pricing starts at $2.50 per million input tokens and $15 per million output tokens, reflecting the model’s expanded capabilities.
OpenAI said the release represents a major step toward more capable autonomous systems. By combining reasoning, coding, computer interaction, and tool use, GPT-5.4 is designed to support “more reliable agents, faster developer workflows, and higher-quality outputs across ChatGPT, the API, and Codex.”
The release also aligns with OpenAI’s broader push to expand its developer ecosystem. The company is reportedly developing a new code hosting platform that could compete with Microsoft-owned GitHub, reflecting a strategy to integrate AI models, coding agents, and developer infrastructure into a unified platform.