36Kr (36氪)→ original

MinerU adapted to run on 10 Chinese AI chip models

The OpenDataLab team at the Shanghai AI Laboratory announced the completion of a deep adaptation of the MinerU tool to run on more than 10 Chinese computing…

AI-processed from 36Kr (36氪); edited by Hamidun News
MinerU adapted to run on 10 Chinese AI chip models
Source: 36Kr (36氪). Collage: Hamidun News.
◐ Listen to article

# MinerU adapted for operation on 10 Chinese AI chip models: why this is critical for technological supply chain independence

Chinese artificial intelligence developers have received a long-awaited tool to overcome dependence on Western equipment. The OpenDataLab team from the Shanghai AI Laboratory, in collaboration with DeepLink and several domestic chip manufacturers, announced the completion of adapting MinerU — a high-precision document parser — for operation on more than 10 different domestic computing platforms. These include Ascend, T-Head, and Metax architectures. This work underscores the region's large-scale effort to reduce technological dependence and build its own innovation ecosystem.

MinerU is not simply another text processing tool. It is a specialized system that transforms complex PDF files, web pages, mathematical formulas, and intricate tables into structured data that large language models can properly process. The conversion accuracy reaches 99%, which is critically important because the quality of training data directly affects the capabilities of the resulting model. Essentially, MinerU solves a problem that has long been a bottleneck in AI data preparation: how to extract meaning from millions of unstructured documents stored in corporate archives and government registries.

The problem exists not only in theory. When companies and government institutions attempt to digitize their archives or prepare datasets for model training, they face an avalanche of PDF files, scanned documents, and tables that need to be converted to machine-readable format. Doing this manually is impossible, and existing solutions often lose context, distort formulas, or misinterpret visual elements. MinerU solves this task with accuracy close to perfect, allowing organizations to save months of work and human resources.

But what is the true significance of this news? Adapting MinerU for 10+ domestic chip platforms means that Chinese developers can now build a complete AI production cycle without turning to American and European components. This applies to all stages: from data collection and preparation to model training. When infrastructure runs on local chips — whether Ascend from Huawei or T-Head from Alibaba — the entire value creation chain remains in the country.

The geopolitical context here is inevitable. Tensions between the West and China have led to sanctions on the export of advanced semiconductors, forcing the region to invest in its own development. OpenDataLab chose precisely this moment to complete the adaptation of MinerU, signaling that the local technological base is sufficiently developed to launch complex engineering projects. This is not simply a technical success — it is a demonstration of the state of the local AI industry.

For users in the global market, this means the emergence of an alternative source of data and tools for document processing. For Chinese companies and government bodies, this opens the possibility of scaling their AI projects without equipment constraints. And while the adaptation does not change the technology itself, it changes the economics of its application: now working with MinerU is possible more cheaply and without concerns about how sanctions impact the supply chain.

ZK
Hamidun News
AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

What do you think?
Loading comments…