Generating music and sound with AI – three examples

AIs can generate not only text, images, and video, but also sound and music. The progress in quality is amazing. Let’s look at three prominent examples: Udio Launched a week ago as part of a public beta, Udio has already caused quite a stir. The website contains numerous examples of songs created with this tool. …

Read more

Soundry AI creates additional music

Soundry AI is a generative AI tool for musicians that can be used to create additional music snippets by entering text or using samples as starting points. Source: Hacker News

Assembly AI announces speech recognition model

Assembly AI introduces its new Universal-1 speech recognition model, which is said to have 30% fewer hallucinations in speech data and 90% fewer hallucinations in ambient noise compared to OpenAI’s Whisper. The model offers improved accuracy for English, Spanish, French, and German, supports code-switching, optimized timestamp estimation, and faster parallel processing, which can be beneficial …

Read more

OpenAI Voice Engine announced

OpenAI presents its new AI technology “Voice Engine“, which can apparently imitate human voices deceptively realistically. However, the company is limiting access to selected partners for now. Source: VentureBeat

OpenVoice can replicate voices in multiple languages

MyShell TTS introduces OpenVoice. The tool can replicate a person’s voice in multiple languages using short snippets of audio. OpenVoice allows detailed control over voice style, emotion, accent, rhythm, pauses, and intonation. Source: Hacker News

Resemble AI announces Rapid Voice Cloning

Resemble AI introduces Rapid Voice Cloning, a tool that creates AI-powered voice clones from short audio recordings in less than a minute. Source: VentureBeat

Stable Audio 2.0: Songs by text command

Stability AI has released Stable Audio 2.0, an update to its generative audio AI. The new version can create audio clips of up to three minutes from text descriptions. Stable Audio 2.0 can also transform uploaded audio files based on natural language instructions. The company has seemingly taken copyright protection very seriously: It says it …

Read more