Nvidia unveils AI audio generation model Fugatto

Nvidia has introduced a new AI model called Fugatto that can generate and modify audio, including music, voice, and sound effects. As reported by Stephen Nellis for Reuters, the technology allows users to transform existing sounds, change voice accents, and create novel audio effects through text prompts. The model, whose name stands for Foundational Generative Audio Transformer Opus 1, can perform unique transformations like making a trumpet sound like a barking dog or converting piano notes into sung vocals. While aimed at music, film, and gaming producers, Nvidia has not announced plans for public release, citing potential risks of misuse. The company trained the model on open-source data and joins other tech firms like Meta and OpenAI in developing generative audio AI while carefully considering release strategies.

Stay up to date

Related posts: