Salesforce xLAM-1B announced

Salesforce has developed xLAM-1B, a small but powerful AI model that supposedly outperforms larger models from OpenAI and Anthropic in function calls.

ElevenLabs AI Voice Isolator introduced

ElevenLabs has introduced a new free service called AI Voice Isolator, which removes unwanted background noise from movies, podcasts or YouTube videos. Unlike other programs that can only remove constant noise, the Voice Isolator also handles irregular noises such as a door opening or someone clapping.

ElevenLabs Reader reads any text aloud for you

ElevenLabs has released a new app called Reader, which allows users to have any text read aloud in AI voices. New are “Iconic Voices”, which recreate the voices of deceased stars such as Judy Garland, James Dean and Laurence Olivier. The company acquired the rights to the voices from CMG Worldwide and stresses that the …

Read more

Apple 4M is a multimodal powerhouse

The “4M” AI model provides a glimpse into Apple’s progress in artificial intelligence. Developed in collaboration with EPF Lausanne, the model can convert text to images, recognize objects, and manipulate 3D scenes based on speech input.

Resemble Detect-2B helps to recognize audio deepfakes

Resemble AI has introduced Detect-2B, a new audio deepfake detection model that claims to have 94% accuracy. The model looks for subtle artifacts to determine whether speech is real or artificially generated.

AWS introduces App Studio

With the new AWS App Studio, organizations are supposed to be able to build scalable internal applications in minutes-without any programming skills. Using generative AI, App Studio uses natural language descriptions to create applications that can be integrated with internal systems. According to Amazon, the tool handles all aspects of application development, from deployment to …

Read more

Major update for Writer

Writer released a major update to its platform to make it easier for businesses to use AI. Chatbots can now process up to 10 million words of company-specific data and take on complex tasks such as document analysis and knowledge management. An advanced algorithm is said to provide more accurate search results and make it …

Read more

Kyutai’s voice assistant Moshi is especially emotional

French AI research lab Kyutai, backed by billionaire Xavier Niel, has unveiled a new voice assistant called Moshi. This assistant can use 70 different emotions and styles to appear particularly authentic. Kyutai is releasing the code for the technology as open source.

Updates from Google and Meta

Google wants to improve the accuracy of its AI models. To avoid “hallucinations,” the company is working with partners such as Moody’s, Thomson Reuters, and ZoomInfo who will feed the AI systems with up-to-date information. A new “confidence score” is also supposed to indicate how confident the AI is in its answer being correct. With …

Read more

DeepMind V2A automatically generates audio for videos

Google’s AI research lab DeepMind has developed a new technology called V2A that can automatically generate appropriate soundtracks, sound effects, and even dialogue for videos. While V2A seems promising, DeepMind admits that the quality of the audio generated is not yet perfect. For now, it is not generally available.