Ars Technica→ original

OpenAI unveiled an ultra-fast coding model to sidestep Nvidia chips

OpenAI announced GPT-5.3-Codex-Spark, a specialized language model for writing code that delivers a 15-fold performance gain. The key achievement was optimizing

AI-processed from Ars Technica; edited by Hamidun News
OpenAI unveiled an ultra-fast coding model to sidestep Nvidia chips
Source: Ars Technica. Collage: Hamidun News.
◐ Listen to article

OpenAI presented GPT-5.3-Codex-Spark — a specialized model for code generation that works 15 times faster than the previous version. But the real news is not so much about the speed, but about how OpenAI achieved it: the company optimized the architecture for non-standard plate-sized chips, effectively bypassing the critical dependence on scarce Nvidia graphics processors. This move demonstrates OpenAI's desire to control the entire stack — from software to physical hardware — and simultaneously highlights the growing competition in the specialized AI processor market.

Over the past two years, Nvidia has become a bottleneck for all major AI companies. GPUs H100 and A100, which became the standard for training large language models, are in huge demand, but supply lags behind. Prices are steep, shipments are delayed, and geopolitical restrictions complicate everything further. OpenAI, Meta, Google — they all are looking for ways to either reduce dependence on Nvidia or optimize the use of available resources. Elon Musk's x.AI company recently announced its own chip, Apple is doing the same with its Neural Engine, and Amazon is investing in Trainium and Inferentia processors. The market is fragmenting, and OpenAI decided not to fall behind.

GPT-5.3-Codex-Spark is a targeted development. If universal models like GPT-4o must handle multiple tasks, then Codex is focused on one thing: writing code as quickly and accurately as possible. This allowed OpenAI engineers to conduct significant optimization. The architecture was reworked for coding specifics, unnecessary computational layers were removed, tensor operations were restructured. The result is the same quality output with minimal computational costs. But the main change concerned the hardware layer. OpenAI developed or rethought non-standard chips that work with this architecture natively. These processors are more compact than standard GPUs, integrate more easily, require less power and cooling — all things that make data centers cheaper and more resilient to failures.

The 15-fold performance increase is impressive, but the number requires context. It likely does not mean that the model performs direct computations 15 times faster. It is about end-to-end code generation time — from user request to ready output. Here improvements add up: optimized architecture, specialized chips, reworked processing pipeline. This is a typical approach for the AI industry: the most efficient algorithm on the most efficient hardware gives the maximum result.

For the industry, this means several things at once. First, pressure on Nvidia increases. If other companies begin to successfully use their own chips, GPU demand may fall. Second, it reinforces the trend toward vertical integration in AI labs. Apple, Google, Meta, and now OpenAI — they are all developing their own chips. Third, for developers, this is good news: a faster and cheaper code generator could reduce the cost of using APIs and make AI assistants more accessible.

But OpenAI won't completely free itself from Nvidia. Universal models still require powerful GPUs during training. OpenAI's own chips appear to be optimized specifically for inference — running already trained models. This makes sense: training happens rarely, but user requests come constantly. OpenAI has found a way to control the layer where it can profit from scale. And this is the right strategic move.

ZK
Hamidun News
AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

What do you think?
Loading comments…