36Kr (36氪)→ original

Sugon scaleX: 10,000 Accelerators in One Harness for Chinese Neural Networks

In a world where GPU quantity determines superpower status, China decided to stop complaining about sanctions and started building its own digital…

AI-processed from 36Kr (36氪); edited by Hamidun News
Sugon scaleX: 10,000 Accelerators in One Harness for Chinese Neural Networks
Source: 36Kr (36氪). Collage: Hamidun News.
◐ Listen to article

In a world where GPU quantity determines superpower status, China decided to stop complaining about sanctions and started building its own digital fortresses. While we follow NVIDIA's quarterly reports, Sugon announced that its megacluster scaleX has entered real operational deployment. This is not another lab test or paper announcement, but a full-fledged production supercluster with 10,000 accelerators, ready to process terabytes of data for training the most ambitious neural networks.

The project officially moves into the application phase in the country's largest engineering projects, and this deserves close attention. To understand the scale of what's happening, we need to recall the conditions Chinese tech sector has lived under for the past couple of years. Export restrictions on cutting-edge chips forced local players either into the shadows or to reinvent the wheel—but with a jet engine.

Sugon, with extensive supercomputing experience, chose the second path. The scaleX project was created as a direct response to the wild hunger for computational power experienced by Chinese LLM developers. When you don't have direct access to tens of thousands of H100s, the only way out is to learn to unite into one efficient network what you have at home.

What does the figure of 10,000 accelerators in one network actually mean? To the uninitiated, it's just a large pile of servers, but to an engineer—it's pure logistics nightmare. The main problem with such systems is interconnect—the speed at which cards communicate with each other.

If data gets "stuck" in bottlenecks of network bridges, all computational power becomes an expensive server room heater. Sugon claims their scaleX architecture solves the scalability problem, allowing thousands of chips to work as a single organism. This is critically important for GPT-4-level models, where training requires synchronizing colossal parameter volumes in real time without delays.

The transition to "application" phase (deployment in large engineering projects) means the system's childhood diseases have been cured, and the cluster is handed over to real customers. The first users will likely be government structures and tech giants like Baidu or Alibaba, who need to train their models in closed-circuit environments.

It's important to understand that China is building not just "hardware," but a completely closed ecosystem. Their own accelerators, their own software, their own optimization libraries, and their own data. This makes their AI industry practically invulnerable to external political storms and new packages of restrictions.

Of course, skeptics will immediately ask about efficiency. It's one thing to assemble 10,000 cards in one building, and quite another to make them work with the same efficiency as clusters based on NVIDIA infrastructure. However, the mere fact of a working system of such scale suggests that the technology gap Western analysts love to discuss might be shorter than it seems.

If scaleX shows decent results in the coming months, this will be a powerful signal to the entire market: the monopoly on "big compute" is officially under threat.

In the near future, we will see the first fruits of this machine. These will likely be not just chatbots for generating poetry, but specialized industry-specific models for heavy industry, weather forecasting, and deep medicine. China traditionally bets on practical AI application in the real economy, where computational power converts into real money and competitive advantages on the global stage. While the rest of the world discusses ethics, Sugon is simply building the foundation for digital dominance.

The key point: Sugon scaleX is not just a server farm, but a full-fledged claim to computational sovereignty. If a 10,000-card cluster operates at full capacity and demonstrates stability, China's dependence on Western hardware becomes yesterday's news. Will this "Chinese giant" overcome performance limitations of individual chips through flawless architecture of the entire network? We'll get the answer in the coming tests.

ZK
Hamidun News
AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

What do you think?
Loading comments…