AI voice cloning tools lack effective safeguards against misuse

Most AI voice cloning services have inadequate protections against nonconsensual voice impersonation, according to a Consumer Reports investigation. The study examined six leading publicly available tools and found that five had easily bypassed safeguards. As reported by NBC News, four services (ElevenLabs, Speechify, PlayHT, and Lovo) merely require checking a box confirming authorization, while Resemble … Read more

ElevenLabs launches Scribe with record 96.7% accuracy for English speech-to-text

ElevenLabs has released Scribe v1, a new speech-to-text model achieving record accuracy rates across 99 languages. According to Carl Franzen of VentureBeat, the model outperforms competitors from Google, OpenAI, and Deepgram with a 96.7% accuracy rate for English. Scribe can distinguish up to 32 different speakers in a single audio file and detect non-verbal elements … Read more

Sesame introduces conversational AI assistant with natural voice presence

Sesame, a startup led by Oculus co-founder Brendan Iribe, has unveiled a new AI voice assistant called Maya that aims to cross “the uncanny valley of conversational voice.” According to a recent article by technology journalist Sean Hollister, Maya offers more natural and engaging conversations compared to existing voice assistants like Amazon’s Alexa or Google’s … Read more

Hume AI launches Octave, a text-to-speech model with emotional controls

Hume AI has introduced Octave, a new text-to-speech system that can generate emotionally nuanced AI voices for content creation. As reported by Carl Franzen for VentureBeat, this large language model can adjust tone, rhythm, and cadence based on textual context. Users can fine-tune emotions at the sentence level through simple text prompts like “happier” or … Read more

Spotify partners with ElevenLabs to expand AI audiobook narration

Spotify has expanded its AI audiobook capabilities through a new partnership with ElevenLabs, a leading AI voice technology provider. As reported by Jess Weatherbed for The Verge, the streaming platform will now accept audiobooks created using ElevenLabs’ voice synthesis software. The service supports narration in 29 languages and requires a Pro subscription of $99 monthly … Read more

Riffusion launches free AI platform for personalized music creation

Riffusion, a San Francisco-based AI startup, has introduced a new free web platform that enables users to create original music through artificial intelligence. According to Michael Nuñez at VentureBeat, the platform’s AI model called Fuzz can generate complete songs from text descriptions, audio clips, or visual prompts. The system learns users’ musical preferences over time … Read more

Google expands NotebookLM with interactive AI features and enterprise version

Google has announced significant updates to its AI-powered note-taking application NotebookLM, including a new interactive feature for its Audio Overviews function and an enterprise-focused version called NotebookLM Plus. The application, which has gained popularity for its ability to generate podcast-like conversations between AI hosts based on source materials, now allows users to directly interact with … Read more

ElevenLabs introduces AI podcast creation and editing system

ElevenLabs has launched a new AI-powered tool that enables users to create and edit podcasts from text documents and other source materials. As reported by Ashley Carman for Bloomberg, the system can generate conversational podcasts in 32 languages using AI-voiced hosts selected from thousands of voice samples. The New York-based startup, valued at $1.1 billion … Read more

Hume AI releases voice customization tool for developers

Hume AI has launched Voice Control, a new feature that enables developers to create custom AI voices by adjusting vocal characteristics through an interface with sliding controls. As reported by Carl Franzen for VentureBeat, the tool allows users to modify voices along ten different dimensions including assertiveness, confidence, and enthusiasm without requiring coding skills. The … Read more

Nvidia unveils AI audio generation model Fugatto

Nvidia has introduced a new AI model called Fugatto that can generate and modify audio, including music, voice, and sound effects. As reported by Stephen Nellis for Reuters, the technology allows users to transform existing sounds, change voice accents, and create novel audio effects through text prompts. The model, whose name stands for Foundational Generative … Read more