MusicLM: Google AI generates music in various genres at 24 kHz

text-to-music —

Your musical wish is MusicLM’s command, making audio from “rich captions.”


An AI-generated image of an exploding ball of music.

Enlarge / An AI-generated image of an exploding ball of music.

Ars Technica

On Thursday, researchers from Google announced a new generative AI model called MusicLM that can create 24 KHz musical audio from text descriptions, such as “a calming violin melody backed by a distorted guitar riff.” It can also transform a hummed melody into a different musical style and output music for several minutes.

MusicLM uses an AI model trained on what Google calls “a large dataset of unlabeled music,” along with captions from MusicCaps, a new dataset composed of 5,521 music-text pairs. MusicCaps gets its text descriptions from human experts and its matching audio clips from Google’s AudioSet, a collection of over 2 million labeled 10-second sound clips pulled from YouTube videos.

Generally speaking, MusicLM works in two main parts: first, it takes a sequence of audio tokens (pieces of sound) and maps them to semantic tokens (words that represent meaning) in captions for training. The second part receives user captions and/or input audio and generates acoustic tokens (pieces of sound that make up the resulting song output). The system relies on an earlier AI model called AudioLM (introduced by Google in September) along with other components such as SoundStream and MuLan.

Google claims that MusicLM outperforms previous AI music generators in audio quality and adherence to text descriptions. On the MusicLM demonstration page, Google provides numerous examples of the AI model in action, creating audio from “rich captions” that describe the feel of the music, and even vocals (which so far are gibberish). Here is an example of a rich caption that they provide:

Slow tempo, bass-and-drums-led reggae song. Sustained electric guitar. High-pitched bongos with ringing tones. Vocals are relaxed with a laid-back feel, very expressive.

Google also shows off MusicLM’s “long generation” (creating five-minute music clips from a simple prompt), “story mode” (which takes a sequence of text prompts and turns it into a morphing series of musical tunes), “text and melody conditioning” (which takes a human humming or whistling audio input and changes it to match the style laid out in a prompt), and generating music that matches the mood of image captions.

A block diagram of the MusicLM AI music-generation model taken from its academic paper.

Enlarge / A block diagram of the MusicLM AI music-generation model taken from its academic paper.

Google Research

Further down the example page, Google dives into MusicLM’s ability to re-create particular instruments (e.g., flute, cello, guitar), different musical genres, various musician experience levels, places (escaping prison, gym), time periods (a club in the 1950s), and more.

AI-generated music isn’t a new idea by any stretch, but AI music-generation methods of previous decades often created musical notation that was later played by hand or through a synthesizer, whereas MusicLM generates the raw audio frequencies of the music. Also, in December, we covered Riffusion, a hobby AI project that can similarly create music from text descriptions, but not at high fidelity. Google references Riffusion in its MusicLM academic paper, saying that MusicLM surpasses it in quality.

In the MusicLM paper, its creators outline potential impacts of MusicLM, including “potential misappropriation of creative content” (i.e., copyright issues), potential biases for cultures underrepresented in the training data, and potential cultural appropriation issues. As a result, Google emphasizes the need for more work on tackling these risks and is holding back the code: “We have no plans to release models at this point.”

Google’s researchers are already looking ahead toward future improvements: “Future work may focus on lyrics generation, along with improvement of text conditioning and vocal quality. Another aspect is the modeling of high-level song structure like introduction, verse, and chorus. Modeling the music at a higher sample rate is an additional goal.”

It’s probably not too much of a stretch to suggest that AI researchers will continue improving music-generation technology until anyone can create studio-quality music in any style just by describing it—although no one can yet predict exactly when that goal will be attained or how exactly it will impact the music industry. Stay tuned for further developments.

Read More
Benj Edwards

Latest

Moore Park South Unveils New Park, 12-Hole Golf Course | Mirage News

NSW Gov Mums, dads and young people from across Sydney are a step closer to being able to enjoy a brand-new park with sports fields, courts, outdoor fitness equipment, a nature playground, shaded picnic spaces with barbecues and more. The Minns Labor Government has today released the final plan for the new 20-hectare park and

HDB resale prices and transactions ease slightly in April 2026, Money News

April 2026 brings a clearer view of how the HDB resale market is evolving. While headline figures show slight changes in both prices and activity, the underlying trends point to a shift in buyer behaviour and market dynamics. HDB resale prices ease slightly in April 2026 In April 2026, the HDB resale market showed signs

Big Breakthrough In Suvendu Aide Chandrakanth’s Murder Probe: Fake Number Plate, Live Rounds, Fired Cartridges Recovered; Vehicle Seized

Updated 7 May 2026 at 10:30 IST On Wednesday, Suvendu Adhikari's PA was allegedly shot at and succumbed to his injuries at a hospital near Madhyamgram. Big Breakthrough In Suvendu Aide Chandranath’s Murder Probe: Fake Number Plate, Live Rounds, Fired Cartridges Recovered; Vehicle Seized | Image: Republic Madhyamgram: West Bengal Police on Thursday seized a

Berkshire-owned distribution giant to deploy driverless big rigs across U.S. Sun Belt

Berkshire Hathaway's McLane, with autonomous trucking company Aurora Innovation, is planning new autonomous freight routes between its distribution centers and restaurants across the U.S. Sun Belt by year-end. Aurora Innovation Berkshire Hathaway subsidiary McLane is planning to deploy self-driving trucking technology from Aurora Innovation on routes in Texas and across the U.S. Sun Belt by

Newsletter

Don't miss

Moore Park South Unveils New Park, 12-Hole Golf Course | Mirage News

NSW Gov Mums, dads and young people from across Sydney are a step closer to being able to enjoy a brand-new park with sports fields, courts, outdoor fitness equipment, a nature playground, shaded picnic spaces with barbecues and more. The Minns Labor Government has today released the final plan for the new 20-hectare park and

HDB resale prices and transactions ease slightly in April 2026, Money News

April 2026 brings a clearer view of how the HDB resale market is evolving. While headline figures show slight changes in both prices and activity, the underlying trends point to a shift in buyer behaviour and market dynamics. HDB resale prices ease slightly in April 2026 In April 2026, the HDB resale market showed signs

Big Breakthrough In Suvendu Aide Chandrakanth’s Murder Probe: Fake Number Plate, Live Rounds, Fired Cartridges Recovered; Vehicle Seized

Updated 7 May 2026 at 10:30 IST On Wednesday, Suvendu Adhikari's PA was allegedly shot at and succumbed to his injuries at a hospital near Madhyamgram. Big Breakthrough In Suvendu Aide Chandranath’s Murder Probe: Fake Number Plate, Live Rounds, Fired Cartridges Recovered; Vehicle Seized | Image: Republic Madhyamgram: West Bengal Police on Thursday seized a

Berkshire-owned distribution giant to deploy driverless big rigs across U.S. Sun Belt

Berkshire Hathaway's McLane, with autonomous trucking company Aurora Innovation, is planning new autonomous freight routes between its distribution centers and restaurants across the U.S. Sun Belt by year-end. Aurora Innovation Berkshire Hathaway subsidiary McLane is planning to deploy self-driving trucking technology from Aurora Innovation on routes in Texas and across the U.S. Sun Belt by

New members for Registration Board | Local Business | trinidadexpress.com

THE Government has appointed new members to the Registration, Recognition and Certification Board (RRCB). The appointments were formalised during a ceremony hosted by the Ministry of Labour on April 10 at the ministry’s head office, International Waterfront Centre, Port of Spain. In a release from the ministry, Labour Minister Leroy Baptiste said the RRCB plays

Your business texts could look like scam messages from July 1 if you don’t act now

From July 1, any branded SMS your business sends without a registered sender ID will be labelled “Unverified” and grouped with scam messages.  What’s happening: From 1 July 2026, any business or organisation that sends SMS using a branded name, such as “MyShop” or “AcmeServices”, instead of a phone number, must have that sender ID

Business groups are fighting Labor’s CGT changes. Here is where SMEs stand

Labor’s most contested tax reform in a generation cleared its first formal hurdle on Thursday and immediately ran into organised resistance. Treasurer Jim Chalmers introduced the government’s tax reform legislation to the House of Representatives on 28 May, bundling together four budget measures: the capital gains tax overhaul, new limits on negative gearing, a $250

Meet the most influential business owners from Southwest Nigeria

This article spotlights the most influential business owners from Southwest Nigeria, adjudged by their dominance in their respective sectors of the economy where they operate. The post Meet the most influential business owners from Southwest Nigeria appeared first on Nairametrics...