Soundry AI creates additional music
Soundry AI is a generative AI tool for musicians that can be used to create additional music snippets by entering text or using samples as starting points. Source: Hacker News
I’ll keep you up-to-date with the latest tools and updates. There’s a lot to cover …
Soundry AI is a generative AI tool for musicians that can be used to create additional music snippets by entering text or using samples as starting points. Source: Hacker News
OpenAI is expanding its program for building custom AI models for enterprises to address specific use cases. With techniques such as assisted fine-tuning and custom-trained models, companies of all sizes should be able to develop personalized models. Source: TechCrunch
OctoAI introduces OctoStack, a platform that allows companies to customize generative AI models and deploy them in their own environments. Source: VentureBeat
Eggnog enables AI-generated videos with consistent characters. First you create the person, including outfits, then you storyboard the planned scenes of the clip, and finally you create the video. Eggnog aims to become the “YouTube for AI videos”. Sources: TechCrunch, Y Combinator
Vectorview helps to evaluate the performance and security of language models. Targeted testing with real-world scenarios is supposed to detect and prevent unintended behavior that is often missed by generic benchmarks. Sources: TechCrunch, Y Combinator
Assembly AI introduces its new Universal-1 speech recognition model, which is said to have 30% fewer hallucinations in speech data and 90% fewer hallucinations in ambient noise compared to OpenAI’s Whisper. The model offers improved accuracy for English, Spanish, French, and German, supports code-switching, optimized timestamp estimation, and faster parallel processing, which can be beneficial …
A demo video on Twitter/X shows a function commonly known as “inpainting”: parts of the image can be selected with a brush tool and then changed by text command. Source: Axios
OpenAI presents its new AI technology “Voice Engine“, which can apparently imitate human voices deceptively realistically. However, the company is limiting access to selected partners for now. Source: VentureBeat
MyShell TTS introduces OpenVoice. The tool can replicate a person’s voice in multiple languages using short snippets of audio. OpenVoice allows detailed control over voice style, emotion, accent, rhythm, pauses, and intonation. Source: Hacker News
Resemble AI introduces Rapid Voice Cloning, a tool that creates AI-powered voice clones from short audio recordings in less than a minute. Source: VentureBeat