ElevenLabs AI Voice Isolator introduced

ElevenLabs has introduced a new free service called AI Voice Isolator, which removes unwanted background noise from movies, podcasts or YouTube videos. Unlike other programs that can only remove constant noise, the Voice Isolator also handles irregular noises such as a door opening or someone clapping.

ElevenLabs Reader reads any text aloud for you

ElevenLabs has released a new app called Reader, which allows users to have any text read aloud in AI voices. New are “Iconic Voices”, which recreate the voices of deceased stars such as Judy Garland, James Dean and Laurence Olivier. The company acquired the rights to the voices from CMG Worldwide and stresses that the …

Read more

Resemble Detect-2B helps to recognize audio deepfakes

Resemble AI has introduced Detect-2B, a new audio deepfake detection model that claims to have 94% accuracy. The model looks for subtle artifacts to determine whether speech is real or artificially generated.

DeepMind V2A automatically generates audio for videos

Google’s AI research lab DeepMind has developed a new technology called V2A that can automatically generate appropriate soundtracks, sound effects, and even dialogue for videos. While V2A seems promising, DeepMind admits that the quality of the audio generated is not yet perfect. For now, it is not generally available.

Meta releases several new AI models

Meta is releasing a series of new AI models for audio, text and watermarks. Meta is also making two sizes of its Chameleon multimodal text model available for research. These models can be used to perform tasks that require both visual and textual understanding, such as image annotation.

Add sound effects to your videos with this tool

ElevenLabs has released a new tool that allows video creators to quickly and easily add sound effects to their clips. The app analyzes uploaded videos and suggests different sound effects that can be integrated directly into the videos via an interface.

Camb AI Mars5 enables voice cloning in over 140 languages

Camb AI’s Mars5 AI model enables realistic voice cloning in over 140 languages, combining voice cloning and text-to-speech in a single platform. The company claims that Mars5 is particularly good at capturing emotional nuances in speech, making it ideal for applications such as sports commentary and movies.

Stability AI release Stable Audio Open

Stability AI releases “Stable Audio Open,” a new AI model for the free creation of sounds and pieces of music up to 47 seconds in length. However, due to the training material, it is limited to English descriptions and Western music styles.

ElevenLabs Sound Effects creates audio samples

ElevenLabs, a speech synthesis AI startup, unveiled “Sound Effects,” a new product that allows users to create audio samples simply by entering text. Developed in partnership with Shutterstock, the tool is designed to help creative professionals in fields as diverse as film, television, video games, and social media enhance their content with interesting and appropriate …

Read more