DeepSeek, together with Tsinghua University and Peking University, improves AI agents' reasoning

Q: What is the source?

Originally published on Jiqizhixin (机器之心). Hamidun News processes and adapts the material with AI.

Q: When was it published?

2026-02-27. Reading time: 3 min.

DeepSeek has released a new scientific paper together with Tsinghua University and Peking University. The study focuses on optimizing the reasoning process in l

Hamidun News Editorial

AI monitoring · Jiqizhixin (机器之心)

2026-02-27· 3 min

AI-processed from Jiqizhixin (机器之心); edited by Hamidun News

DeepSeek, together with Tsinghua University and Peking University, improves AI agents' reasoning — Source: Jiqizhixin (机器之心). Collage: Hamidun News.

◐ Listen to article

DeepSeek continues to surprise the global AI community. The Chinese laboratory has published a new scientific paper in collaboration with Tsinghua University and Peking University — two flagship technical universities in the country. At the center of the research is one of the most pressing problems in modern machine learning: how to make large language models reason more efficiently when they act not as passive conversation partners, but as autonomous agents capable of planning and executing multi-step tasks.

To understand the significance of this publication, context is needed. Over the past twelve months, DeepSeek has consistently built a reputation as a research organization capable of challenging far more resource-intensive Western laboratories. The R1 series models attracted attention with their approach to reasoning chains, and each new paper from the team immediately topped academic aggregators and became the subject of discussion in leading AI communities. Now, combining efforts with Tsinghua and Peking University, DeepSeek is betting on synergy between commercial developments and academic expertise — a combination that has historically yielded strong results precisely in fundamental research.

The topic itself — optimizing reasoning in agent models — is no accident. Standard language models, trained to answer questions in dialogue mode, face fundamental limitations when embedded in agent systems. An agent doesn't simply generate an answer: it must decompose the task, select an appropriate tool, execute an action, interpret the result, and decide what to do next. Each of these steps requires stable and consistent reasoning — this is precisely where modern LLMs often make mistakes, lose context, or accumulate errors in long chains of actions. Dozens of laboratories worldwide are attempting to solve this problem, and each new approach to overcoming it has practical significance far beyond academic benchmarks.

The details of architectural solutions and specific scores on standard tests are still awaiting publication in the full version of the paper, but the fact of collaboration itself speaks volumes. Tsinghua University possesses some of the strongest research groups in deep learning and the theoretical foundations of neural networks; Peking University is traditionally strong in optimization and mathematical methods. DeepSeek, in turn, brings infrastructure for large-scale training and experience working with production systems. Such an alliance allows not just proposing a new method, but testing its viability on tasks of real scale.

For the industry, this research has several important dimensions. First, the quality of reasoning in agent tasks directly determines how reliably AI agents can be deployed in corporate scenarios — from automating code development to managing complex business processes. Second, publications of this caliber from Chinese institutions intensify competitive pressure on OpenAI, Google DeepMind, and Anthropic, forcing them to accelerate their own research in related directions. Finally, the transparency of the academic format means that methods from this work can be adapted and reproduced by independent teams around the world — which accelerates progress across the entire industry.

However, one should not expect immediate implementation of results in commercial products. The path from an academic paper to a stably functioning production system is long and thorny. Benchmarks capture improvements under controlled conditions, whereas real agent scenarios are full of unpredictable edge cases. Nevertheless, it is precisely such works that set the direction: they formulate which exact properties should be present in the next generation of agent models and what tools can measure these properties.

DeepSeek continues to move along its own trajectory — methodically, publicly, and with a clear calculation for long-term influence. The publication of a paper co-authored with Tsinghua and Peking University is not just an academic publication, but a signal that the Chinese AI ecosystem is capable of integrating university science and industrial resources into a single research flow. The full version of the paper will reveal how far this collaboration has advanced — and what it can offer developers who are already building the next generation of agent systems.

Hamidun News

AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Telegram channel RSS hamidun.com

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

🎓 Academy — 7 days free Free consultation