36Kr (36氪)→ original

OpenAI introduced its first AI model based on Cerebras chips

OpenAI has officially introduced GPT-5.3-Codex-Spark, its first model running on hardware from startup Cerebras Systems. The new release is an optimized, faster

AI-processed from 36Kr (36氪); edited by Hamidun News
OpenAI introduced its first AI model based on Cerebras chips
Source: 36Kr (36氪). Collage: Hamidun News.
◐ Listen to article

OpenAI has made a strategic move toward technological independence: the company unveiled GPT-5.3-Codex-Spark — the first artificial intelligence model specifically developed to run on Cerebras Systems chips. This is not merely a new product, but a demonstration that OpenAI is ready to diversify its hardware partnerships and reduce dependence on Nvidia, which dominates the AI accelerator market. Codex-Spark is positioned as an optimized version of the well-known Codex tool for code-writing automation — on one hand, easier to operate, on the other, significantly faster at executing tasks. The model's launch coincides with intensifying competition in the segment of AI assistants for developers, where OpenAI competes against Google Gemini and Claude from Anthropic.

The context of this decision lies in the growing systemic constraints that OpenAI has encountered. The company globally purchases Nvidia GPUs, but this creates a vulnerability: at any moment, supplies can be restricted, prices can rise, and Nvidia can impose its own conditions. Cerebras Systems offers an alternative with its own WSE (Wafer Scale Engine) architecture — a chip designed differently and providing a different balance between throughput and energy efficiency. For OpenAI, this is a strategic necessity: diversifying sources of performance means a more resilient market position and reduced vulnerability to external factors.

GPT-5.3-Codex-Spark is not entirely a new model, but an optimized version of Codex, specifically adapted to the characteristics of Cerebras chips. This is an important technical solution: different architectures require different approaches to memory distribution, compute parallelization, and data caching. OpenAI engineers redesigned the model's structure so it maximally and efficiently uses the specific Cerebras topology. The result is a noticeable speed increase without loss of precision in code generation. For developers, this means a faster feedback loop: code autocomplete requests will execute with minimal latency, which is critical for the IDE ecosystem and continuous development.

The size and weight of the model also played a role in this optimization. Codex-Spark is positioned as a "lightweight" version, which allows for faster model loading into memory and reduces network bandwidth requirements. In a world where every millisecond of latency affects user experience, this matters. Especially in the context of competition: Google Gemini and Claude already offer built-in tools for code handling, but developers often criticize their speed. Codex-Spark can capture part of this audience if it is indeed faster while maintaining high accuracy in suggestions.

This move reflects a broader industry trend: Nvidia's GPU monopoly is unstable and all major players are beginning to seek alternatives. Apple developed its own Neural Engine, Google invested in TPU, Amazon created Trainium and Inferentia. OpenAI, being a model developer rather than a chip manufacturer, has chosen a different path: partnership with Cerebras allows it to remain flexible and not be tied to a single supplier.

The launch of Codex-Spark marks the beginning of a new phase in OpenAI's strategy — not only to create the best models, but also to ensure they run on diverse hardware. This complicates life for competitors who still depend on standard hardware and strengthens OpenAI's position in the market for AI developer tools. The question now is how effectively this strategy will work in practice and whether Cerebras can scale its chips fast enough to meet growing demand.

ZK
Hamidun News
AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

What do you think?
Loading comments…