Truecaller lets an AI with your voice answer the phone

Calling app Truecaller is introducing a new feature that allows users to create an AI version of their own voice to answer calls and ask the reason for the call, for example. But is it a good idea to use your own voice for this? I think it would be confusing …

OpenAI releases GPT-4o and more

One day before Google’s I/O, OpenAI tried to steal the show from its big competitor. And their demos definitely caused quite a stir. The focus was on their latest AI model GPT-4o, where the “o” stands for “omnimodel”. This is to indicate that this version does not only process text, but also e.g. image and …

Read more

Google’s fireworks of new tools and features

As expected, Google used the keynote at its I/O developer conference to demonstrate its strength in AI. Among other things, the company presented new AI models for a wide range of tasks. Some will run directly on Android devices or can be found in the Chrome browser. Others use Google’s specialized servers. They create text, …

Read more

OpenVoice is an AI for voice cloning

OpenVoice allows users to realistically clone voices in different languages and accents, and even control emotions and speaking styles. The latest version, OpenVoice V2, offers improved audio quality, native support for multiple languages, and is available free for commercial use. Source: Hacker News

AdaKWS claims better speech recognition than OpenAI’s Whisper

The new AI model AdaKWS from speech recognition specialist aiOla claims to be able to convert speech correctly into text, even if it is technical jargon. The model achieves an accuracy of 94.6% – better than OpenAI’s Whisper.

Generating music and sound with AI – three examples

AIs can generate not only text, images, and video, but also sound and music. The progress in quality is amazing. Let’s look at three prominent examples: Udio Launched a week ago as part of a public beta, Udio has already caused quite a stir. The website contains numerous examples of songs created with this tool. …

Read more

Soundry AI creates additional music

Soundry AI is a generative AI tool for musicians that can be used to create additional music snippets by entering text or using samples as starting points. Source: Hacker News

Assembly AI announces speech recognition model

Assembly AI introduces its new Universal-1 speech recognition model, which is said to have 30% fewer hallucinations in speech data and 90% fewer hallucinations in ambient noise compared to OpenAI’s Whisper. The model offers improved accuracy for English, Spanish, French, and German, supports code-switching, optimized timestamp estimation, and faster parallel processing, which can be beneficial …

Read more

OpenAI Voice Engine announced

OpenAI presents its new AI technology “Voice Engine“, which can apparently imitate human voices deceptively realistically. However, the company is limiting access to selected partners for now. Source: VentureBeat