New achievement: Transformers without normalization exceed standards

Q: What is the source?

Originally published on Jiqizhixin (机器之心). Hamidun News processes and adapts the material with AI.

Q: When was it published?

Jan 22, 2026. Reading time: 2 min.

Graduates of the Yao Program from Tsinghua University, under the leadership of Liu Zhuang, have once again made their mark in the world of artificial…

Hamidun News Editorial

AI monitoring · Jiqizhixin (机器之心)

Jan 22, 2026· 1 min

AI-processed from Jiqizhixin (机器之心); edited by Hamidun News

New achievement: Transformers without normalization exceed standards — Source: Jiqizhixin (机器之心). Collage: Hamidun News.

◐ Listen to article

Graduates of the Yao Program from Tsinghua University, under the leadership of Liu Zhuang, have once again made their mark in the world of artificial intelligence. Their latest work represents a significant breakthrough in Transformer architecture – a fundamental structure underlying modern natural language processing models. The key achievement is the development of a model that demonstrates superior performance without requiring normalization, potentially simplifying and accelerating the training process.

Previous Transformer versions, used in models like GPT and BERT, relied on normalization to stabilize training and improve convergence. However, this process adds computational complexity. Liu Zhuang's team managed to bypass this step by developing an architecture that effectively handles training without normalization while maintaining or even exceeding the performance of existing models.

This achievement is particularly important given the growing need for more efficient and scalable AI models. The main contribution of the work lies in a new architectural design approach that eliminates the need for normalization. This could lead to reduced computational resources required to train models and, consequently, lower costs for development and deployment.

Additionally, architecture simplification can contribute to faster training and greater stability, which is critical for developing advanced AI systems. For the industry, this means accelerating the development and deployment of natural language processing models. Companies will be able to create more efficient and cost-effective models, allowing them to implement new technologies faster and offer more advanced products.

For users, this means faster access to new features and improved service quality in AI-powered applications such as chatbots, translation systems, and intelligent assistants. In conclusion, Liu Zhuang's team's work represents an important step forward in Transformer architecture. Their innovative approach to developing models that do not require normalization opens new possibilities for improving performance, reducing costs, and accelerating development in natural language processing.

This achievement underscores the ongoing progress in AI and demonstrates that even in well-studied areas, significant opportunities for innovation remain. The future of AI looks increasingly promising, and such research will undoubtedly contribute to further advancement in this field.

Hamidun News

AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Telegram channel RSS hamidun.com

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

🎓 Academy — 7 days free Free consultation