Apple 4M is a multimodal powerhouse

The “4M” AI model provides a glimpse into Apple’s progress in artificial intelligence. Developed in collaboration with EPF Lausanne, the model can convert text to images, recognize objects, and manipulate 3D scenes based on speech input.

Resemble Detect-2B helps to recognize audio deepfakes

Resemble AI has introduced Detect-2B, a new audio deepfake detection model that claims to have 94% accuracy. The model looks for subtle artifacts to determine whether speech is real or artificially generated.

AWS introduces App Studio

With the new AWS App Studio, organizations are supposed to be able to build scalable internal applications in minutes-without any programming skills. Using generative AI, App Studio uses natural language descriptions to create applications that can be integrated with internal systems. According to Amazon, the tool handles all aspects of application development, from deployment to …

Read more

Major update for Writer

Writer released a major update to its platform to make it easier for businesses to use AI. Chatbots can now process up to 10 million words of company-specific data and take on complex tasks such as document analysis and knowledge management. An advanced algorithm is said to provide more accurate search results and make it …

Read more

Kyutai’s voice assistant Moshi is especially emotional

French AI research lab Kyutai, backed by billionaire Xavier Niel, has unveiled a new voice assistant called Moshi. This assistant can use 70 different emotions and styles to appear particularly authentic. Kyutai is releasing the code for the technology as open source.

Updates from Google and Meta

Google wants to improve the accuracy of its AI models. To avoid “hallucinations,” the company is working with partners such as Moody’s, Thomson Reuters, and ZoomInfo who will feed the AI systems with up-to-date information. A new “confidence score” is also supposed to indicate how confident the AI is in its answer being correct. With …

Read more

DeepMind V2A automatically generates audio for videos

Google’s AI research lab DeepMind has developed a new technology called V2A that can automatically generate appropriate soundtracks, sound effects, and even dialogue for videos. While V2A seems promising, DeepMind admits that the quality of the audio generated is not yet perfect. For now, it is not generally available.

LiveBench is a new benchmark for LLMs

LiveBench is a new benchmark for large language models developed by a team of scientists. Unlike existing benchmarks, it uses constantly updated questions from current sources and automatically scores the answers based on objective criteria. The team has taken special care to avoid the risk of “contamination”, where the training data of a language model …

Read more

Kong’s AI Gateway is a new platform for enterprise AI

Kong Inc. has released its “AI Gateway”, a platform designed to make it easier for enterprises to control and securely deploy generative AI in various cloud environments. According to Kong, the gateway enables the integration and management of various AI technologies through a single interface and provides security features to prevent misuse by manipulating the …

Read more

Nvidia shows new model for synthetic data

According to Nvidia, their new Nemotron-4 340B open language model will revolutionize the generation of synthetic data and enable companies to develop custom AI models.