TechCrunch→ original

Google brings music generation to the Gemini mobile app

Google has officially added music generation features to the Gemini mobile app. Users can now create original audio compositions using not only text as a refere

AI-processed from TechCrunch; edited by Hamidun News
Google brings music generation to the Gemini mobile app
Source: TechCrunch. Collage: Hamidun News.
◐ Listen to article

Google has officially announced the addition of a music generation feature to its Gemini mobile application. This is a significant step that transforms the AI assistant into a universal tool for content creators, capable of working not only with text but also with visual data. Users can now create unique audio compositions using not just text descriptions but also images and video clips as reference. This multimodal integration opens new horizons for creativity, allowing for the instant generation of soundtracks that perfectly match visual content.

The context for implementing such a feature lies in Google's overall strategy for developing artificial intelligence as a central element of its products. Gemini, as the company's flagship AI project, continuously expands its capabilities, aiming to become a single point of entry for solving a wide range of tasks. The addition of music generation is a logical continuation of this trend, since sound plays an integral role in creating any media content.

Previously, Gemini has already demonstrated impressive capabilities in working with text, code, and images, and now this toolkit has been supplemented with the ability to manipulate sound. This underscores Google's commitment to creating a complete ecosystem where the AI assistant is capable of integrating diverse creative processes into a single intuitive interface.

A deep dive into the new feature shows that Gemini can now analyze the content of images and videos to suggest appropriate musical themes. For example, a user can upload a short video of a landscape and ask Gemini to create relaxing background music, or provide an image of a city and request an energetic electronic track. Text prompts still play an important role, allowing users to refine the genre, mood, tempo, and even specific instruments that should be present in the composition. This flexibility makes Gemini a powerful tool for both professionals and hobbyists who need quick and high-quality musical accompaniment for their projects, whether they are short videos for social media, presentations, or even game prototypes.

The implications of this update for the content creation industry are difficult to overstate. First, it significantly lowers the barrier to entry for those who want to add original music to their work but lack the necessary skills or resources to hire a composer or use expensive software. Second, it accelerates the production process, allowing creators to obtain ready-made soundtracks in minutes rather than days or weeks.

Third, it can stimulate the emergence of new forms of media where music is an integral part of interactive or adaptive content generated in real time. For Google, this means strengthening Gemini's position as the leading AI assistant and further developing their ecosystem, where users can receive increasingly more services from a single integrated application.

In conclusion, the implementation of music generation in the Gemini mobile application is an important step in the development of multimodal AI tools. Google demonstrates that the future belongs to assistants capable of understanding and generating content in various formats, combining text, images, video, and sound. This integration not only expands the creative capabilities of users but also presages a deeper transformation in the ways media content is created and consumed in the digital age.

ZK
Hamidun News
AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

What do you think?
Loading comments…