Microsoft unveils Magentic-One, an open-source framework for managing multi-agent AI systems

Microsoft has released Magentic-One, a new open-source infrastructure that enables a single AI model to manage multiple helper agents working together to complete complex, multi-step tasks in various scenarios. According to a paper by Microsoft researchers, Magentic-One is a generalist agentic system that can “fully realize the long-held vision of agentic systems that can enhance …

Read more

OmniGen: First unified model for image generation

Researchers have introduced OmniGen, the first diffusion model capable of unifying various image generation tasks within a single framework. Unlike existing models like Stable Diffusion, OmniGen does not require additional modules to handle different control conditions, according to the authors Shitao Xiao, Yueze Wang, Junjie Zhou, Huaying Yuan, et al. The model can perform text-to-image …

Read more

Hugging Face releases compact language models for smartphones and edge devices

Hugging Face has released SmolLM2, a new family of compact language models designed to run on smartphones and edge devices with limited processing power and memory. The models, released under the Apache 2.0 license, come in three sizes up to 1.7B parameters and achieve impressive performance on key benchmarks, outperforming larger models like Meta’s Llama …

Read more

Meta makes Llama AI models available for US defense applications

Meta is making its Llama AI models available to U.S. government agencies and contractors working on defense and national security applications. According to a blog post by Meta cited by TechCrunch, the company is partnering with firms like Accenture, Amazon Web Services, and Lockheed Martin to bring Llama to these entities. The move comes after …

Read more

Omnivore acquired by ElevenLabs to power new ElevenReader app

Omnivore, a reading app startup, has been acquired by ElevenLabs, an AI audio technology company, to help develop their new ElevenReader app. According to a note from Omnivore’s founders Jackson and Hongbo, the acquisition will enable them to create more accessible reading and listening experiences on a larger platform. Omnivore users are invited to create …

Read more

Amphion: open-source toolkit for audio, music and speech generation

Amphion is an open-source toolkit designed to support research and development in audio, music and speech generation. According to the project’s GitHub site, it offers unique visualizations of classic models and architectures to help junior researchers and engineers better understand them. The toolkit supports various individual generation tasks such as text-to-speech (TTS), singing voice synthesis …

Read more

Speech to text: Moonshine is fast and as accurate as OpenAI’s Whisper

Useful, an AI company focused on improving human-machine communication, has open-sourced Moonshine, a new speech-to-text model that aims to significantly reduce the latency of voice interfaces. According to Useful founder Pete Warden, Moonshine returns results 1.7 times faster than OpenAI’s Whisper model while matching or exceeding its accuracy. The model’s variable-length input window allows it …

Read more

Open washing: AI companies mislead with “open source” label

A study by Andreas Liesenfeld and Mark Dingemanse from Radboud University’s Center for Language Studies reveals that many AI companies, including Google, Meta, and Microsoft, engage in “open washing” by mislabeling their products as open source. The researchers surveyed 45 text and text-to-image models and found that while a handful of lesser-known models meet the …

Read more

Open Source Initiative releases first Open Source AI Definition

The Open Source Initiative (OSI) has released version 1.0 of its Open Source AI Definition (OSAID), establishing the first industry standard for determining whether an AI system can be considered truly open source. Developed through years of collaboration with academia and industry, the OSAID requires open source AI to provide sufficient information to substantially recreate …

Read more

Meta releases AI models for mobile devices

Meta Platforms has released quantized versions of its Llama 3.2 1B and 3B models, which the company says offer reduced memory requirements, faster on-device inference, accuracy, and portability. The models were developed in close collaboration with Qualcomm and MediaTek and are available on SoCs with Arm CPUs. According to Meta, the average model size has …

Read more