AI creates never-before-heard sound

Nvidia claims Fugatto tool could revolutionise music

Anthony Cuthbertson
Tuesday 26 November 2024 13:21 GMT
Comments
Nvidia’s AI tool Fugatto can make a trumpet bark o a saxophone meow
Nvidia’s AI tool Fugatto can make a trumpet bark o a saxophone meow (iStock)

Your support helps us to tell the story

From reproductive rights to climate change to Big Tech, The Independent is on the ground when the story is developing. Whether it's investigating the financials of Elon Musk's pro-Trump PAC or producing our latest documentary, 'The A Word', which shines a light on the American women fighting for reproductive rights, we know how important it is to parse out the facts from the messaging.

At such a critical moment in US history, we need reporters on the ground. Your donation allows us to keep sending journalists to speak to both sides of the story.

The Independent is trusted by Americans across the entire political spectrum. And unlike many other quality news outlets, we choose not to lock Americans out of our reporting and analysis with paywalls. We believe quality journalism should be available to everyone, paid for by those who can afford it.

Your support makes all the difference.

Researchers have developed an AI audio generator that they claim can create sounds that have never been heard before.

The new generative artificial intelligence model, called Fugatto, was built by a team at computer chip giant Nvidia who wanted to create a “Swiss Army knife for sound”.

The tool allows users to edit or generate audio using simple text prompts, such as removing a particular instrument from a song or changing the accent of someone’s voice.

In a blog post detailing Fugatto, Nvidia’s Richard Kerris claimed it offered more dexterity than any other AI model and holds the potential to revolutionise music through brand new sounds.

“Fugatto can make a trumpet bark or a saxophone meow,” Mr Kerris wrote. “Whatever users can describe, the model can create.”

A video demonstrating the tool shows Fugatto creating a unique audio sample from a text prompt: “Deep, rumbling bass pulses paired with intermittent, high-pitched digital chirps, like the sound of a massive, sentient machine waking up.”

Another example shows Fugatto transforming the sound of a train into a string orchestra.

“This thing is wild,” said Ido Zmishlany, a producer and songwriter who is part of Nvidia’s Inception program to help new startups.

“Sound is my inspiration. It’s what moves me to create music. The idea that I can create entirely new sounds on the fly in the studio is incredible.”

Fugatto took more than a year to develop, requiring millions of audio samples to train it.

The rapid advancements that generative AI has brought to audio technology has led to concerns about how it might impact people working within creative industries.

Earlier this year, the Australian Association of Voice Actors warned a parliamentary committee that an estimated 5,000 local voice actors could soon be put out of a job if companies adopt AI-powered replacements.

The music industry has also voiced its concerns about how generative AI programs might infringe copyrighted material, with the Recording Industry Association of America recently launching a lawsuit against AI tools for allegedly replicating their artists’ music.

Despite the pushback, some artists believe AI represents another evolution of music that will bring with it new creative possibilities.

“The history of music is also a history of technology. The electric guitar gave the world rock and roll,” said Mr Zmishlany.

“When the sampler showed up, hip-hop was born. With AI, we’re writing the next chapter of music. We have a new instrument, a new tool for making music – and that’s super exciting.”

Join our commenting forum

Join thought-provoking conversations, follow other Independent readers and see their replies

Comments

Thank you for registering

Please refresh the page or navigate to another page on the site to be automatically logged inPlease refresh your browser to be logged in