Jiqizhixin (机器之心)→ original

SkyMusic: Why Chinese Kunlun Tech Spooked Suno and Udio

Китайская компания Kunlun Tech представила SkyMusic — первую в стране большую модель для генерации музыки, способную на равных конкурировать с западными лидерам

AI-processed from Jiqizhixin (机器之心); edited by Hamidun News
SkyMusic: Why Chinese Kunlun Tech Spooked Suno and Udio
Source: Jiqizhixin (机器之心). Collage: Hamidun News.
◐ Listen to article

Let's be honest: until recently, most AI-generated tracks sounded like a robot trying to mimic singing underwater. We've grown accustomed to Suno and Udio as the gold standard, but the AI industry evolves too quickly for anyone to rest on their laurels for long. Chinese Kunlun Tech has entered the scene with its new development, SkyMusic, and this is one of those cases where bombastic headlines about "killer apps" actually have substance behind them.

To understand the scale of this moment, it's worth recalling that Kunlun is not just another startup, but a giant standing behind the Opera browser and a powerful Skywork ecosystem. The main challenge in AI music has always been nuance. Generating a beat or a simple melody is a 2023 problem.

In 2024, we want to hear emotion, breath, subtle changes in timbre, and what musicians call "soul." SkyMusic is betting on exactly that. The developers applied large language model (LLM) architecture directly to audio tokens.

Instead of simply mixing sound layers, the system understands song structure the way GPT understands text structure. This allows the model to generate vocals that sound disturbingly natural. If you close your eyes, in many cases you won't be able to tell the recording from a live studio performance at a mid-level studio.

The technical specifications are impressive, but they're not the main point here. Yes, 80 seconds of generation and 44.1 kHz is the industry standard.

What matters far more is how SkyMusic handles multitasking. It simultaneously manages text, melody, and arrangement, maintaining stylistic integrity throughout the entire track. Chinese engineers claim their model is the first of its kind to achieve this level of quality in comprehensive generation.

This is a serious challenge to Western companies, which are now more focused on legal aspects of using copyrighted content than on pure technological leaps. What does this mean for the industry as a whole? We're witnessing the formation of two parallel technological stacks.

On one side—American models, constrained by regulations and potential lawsuits from labels. On the other—Chinese solutions developing at incredible speed, backed by massive internal computational resources. Kunlun Tech clearly intends to make SkyMusic a global product, and they have all the resources for it.

The fact that the model already holds top positions in specialized benchmarks suggests that the gap between East and West in creative AI is shrinking rapidly, and in some areas has already disappeared.

Of course, questions remain about ethics and how the professional community will accept such tools. But the reality is this: the barrier to entry for creating quality musical content has dropped even lower. Now to create a hit, you don't need to sing or play an instrument—you just need an idea and the right prompt.

SkyMusic is not just "another neural network"; it's confirmation that music is finally turning into data that can be manipulated as easily as text in a chatbot. We're entering an era where competition will be based not on sound purity, but on idea originality, because technical perfection is now available at the click of a button. The bottom line: Kunlun Tech has proven that leadership in AI music is not locked to Silicon Valley.

Are you ready for your next favorite track to be written by a Chinese neural network?

ZK
Hamidun News
AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

What do you think?
Loading comments…