OpenAI's Responses API overhaul with WebSockets boosts agentic workflow speeds by 40%, enabling GPT-5.3 to hit 1,000 TPS. Here's how they did it. (Read More)OpenAI's Responses API overhaul with WebSockets boosts agentic workflow speeds by 40%, enabling GPT-5.3 to hit 1,000 TPS. Here's how they did it. (Read More)

OpenAI Speeds Up Codex Workflows with WebSockets in Responses API

2026/04/29 00:33
3 min read
For feedback or concerns regarding this content, please contact us at [email protected]

OpenAI Speeds Up Codex Workflows with WebSockets in Responses API

Terrill Dicki Apr 28, 2026 16:33

OpenAI's Responses API overhaul with WebSockets boosts agentic workflow speeds by 40%, enabling GPT-5.3 to hit 1,000 TPS. Here's how they did it.

OpenAI Speeds Up Codex Workflows with WebSockets in Responses API

OpenAI has introduced WebSocket support to its Responses API, slashing latency in Codex agent workflows by 40%. This upgrade allows the latest GPT-5.3-Codex-Spark model to achieve speeds of over 1,000 tokens per second (TPS), a massive leap from the 65 TPS of earlier versions. These advancements are central to enabling faster and more efficient AI-driven coding assistance, a key feature of Codex’s functionality.

Codex, OpenAI's AI-powered coding assistant, operates by scanning codebases, building context, making edits, and running tests in rapid succession. Historically, these tasks involved numerous back-and-forth API calls, which added significant latency. The issue became more pronounced as model inference speeds improved, exposing API overhead as the bottleneck. Addressing this, OpenAI reengineered the Responses API, introducing a persistent WebSocket connection to minimize redundant processing.

Previously, the Responses API treated each request as independent, requiring full conversation histories to be sent and processed with every call. The new WebSocket implementation caches reusable state in memory, allowing only incremental changes to be sent during follow-up requests. This streamlined approach reduces overhead and makes better use of both server-side GPUs and client-side resources.

Key optimizations included:

  • Caching tokens and model configurations to bypass repetitive tokenization.
  • Eliminating unnecessary network hops to streamline API-to-inference communication.
  • Enhancing safety classifiers to process only new inputs instead of entire histories.
  • Overlapping non-blocking processes like billing with subsequent requests.

The decision to adopt WebSockets over alternatives like gRPC bidirectional streaming hinged on simplicity and minimal disruption to developers. The WebSocket mode preserved the familiar API interaction patterns, allowing developers to integrate it seamlessly without major changes to their workflows.

The results have been striking. In alpha tests with key partners, startups reported up to a 40% improvement in agentic workflow speeds. Codex users, including those leveraging platforms like Vercel and Cursor, saw significant reductions in latency. Additionally, Codex achieved its target of 1,000 TPS for GPT-5.3-Codex-Spark and even recorded bursts of up to 4,000 TPS in production traffic.

For AI developers and enterprises, these changes underscore the growing importance of optimizing API infrastructures to keep pace with accelerating model inference speeds. As OpenAI’s advancements demonstrate, improving the systems surrounding AI models is as critical as enhancing the models themselves. By addressing API bottlenecks, OpenAI ensures that performance gains from faster inference hardware, like the Cerebras GPUs used by GPT-5.3, are fully realized by end users.

Looking ahead, the implementation of WebSocket support in the Responses API sets a new benchmark for speed in AI agent workflows. It positions OpenAI strongly as demand for high-performance, real-time AI capabilities continues to grow across industries. With the API updates now in full production, developers can expect smoother, faster integrations—an essential step as AI tools become foundational to coding, productivity, and beyond.

Image source: Shutterstock
  • openai
  • websockets
  • api
  • codex
  • gpt-5.3
Market Opportunity
CodexField Logo
CodexField Price(CODEX)
$17.1385
$17.1385$17.1385
+0.29%
USD
CodexField (CODEX) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.
Tags:

You May Also Like

The Chemistry of UV Resistance: How Titanium Dioxide Protects Against the California Sun

The Chemistry of UV Resistance: How Titanium Dioxide Protects Against the California Sun

Homeowners considering synthetic boundary systems frequently voice a singular, pervasive concern: “Will the material turn yellow and brittle after a few years in
Share
Techbullion2026/04/02 18:06
How to earn from cloud mining: IeByte’s upgraded auto-cloud mining platform unlocks genuine passive earnings

How to earn from cloud mining: IeByte’s upgraded auto-cloud mining platform unlocks genuine passive earnings

The post How to earn from cloud mining: IeByte’s upgraded auto-cloud mining platform unlocks genuine passive earnings appeared on BitcoinEthereumNews.com. contributor Posted: September 17, 2025 As digital assets continue to reshape global finance, cloud mining has become one of the most effective ways for investors to generate stable passive income. Addressing the growing demand for simplicity, security, and profitability, IeByte has officially upgraded its fully automated cloud mining platform, empowering both beginners and experienced investors to earn Bitcoin, Dogecoin, and other mainstream cryptocurrencies without the need for hardware or technical expertise. Why cloud mining in 2025? Traditional crypto mining requires expensive hardware, high electricity costs, and constant maintenance. In 2025, with blockchain networks becoming more competitive, these barriers have grown even higher. Cloud mining solves this by allowing users to lease professional mining power remotely, eliminating the upfront costs and complexity. IeByte stands at the forefront of this transformation, offering investors a transparent and seamless path to daily earnings. IeByte’s upgraded auto-cloud mining platform With its latest upgrade, IeByte introduces: Full Automation: Mining contracts can be activated in just one click, with all processes handled by IeByte’s servers. Enhanced Security: Bank-grade encryption, cold wallets, and real-time monitoring protect every transaction. Scalable Options: From starter packages to high-level investment contracts, investors can choose the plan that matches their goals. Global Reach: Already trusted by users in over 100 countries. Mining contracts for 2025 IeByte offers a wide range of contracts tailored for every investor level. From entry-level plans with daily returns to premium high-yield packages, the platform ensures maximum accessibility. Contract Type Duration Price Daily Reward Total Earnings (Principal + Profit) Starter Contract 1 Day $200 $6 $200 + $6 + $10 bonus Bronze Basic Contract 2 Days $500 $13.5 $500 + $27 Bronze Basic Contract 3 Days $1,200 $36 $1,200 + $108 Silver Advanced Contract 1 Day $5,000 $175 $5,000 + $175 Silver Advanced Contract 2 Days $8,000 $320 $8,000 + $640 Silver…
Share
BitcoinEthereumNews2025/09/17 23:48
Tillis Stablecoin Agreement Accelerates Clarity Act

Tillis Stablecoin Agreement Accelerates Clarity Act

The post Tillis Stablecoin Agreement Accelerates Clarity Act appeared on BitcoinEthereumNews.com. Senator Thom Tillis was at the center of negotiations with bankers
Share
BitcoinEthereumNews2026/04/30 16:29