OpenAI prepares to launch its first open-weights model since 2019

OpenAI is reportedly preparing to release a new open-weights language model, with a potential launch as soon as next week. This information comes from a report by Tom Warren in The Verge, citing sources familiar with OpenAI’s plans. The release would be the company’s first open-weights model since it launched GPT-2 in 2019. Unlike OpenAI’s …

Read more

Hugging Face releases small language model with full training recipe

Hugging Face has launched SmolLM3, a new 3-billion-parameter language model designed for high performance and efficiency. In their official post, the company states that the model outperforms others in its size class and is competitive with some larger alternatives. A key feature is its dual-mode capability, which allows it to provide direct answers or show …

Read more

Apple’s AI tools in Shortcuts prove powerful but unreliable for automation

Apple’s new AI features in macOS Shortcuts offer impressive capabilities but reveal significant reliability issues, according to recent testing by technology writers. Dan Moren and Jason Snell from Six Colors experimented with the “Use Model” action, which processes data through AI models running on devices, Apple’s Private Cloud Compute, or OpenAI servers. Snell attempted to …

Read more

Lemony launches plug-and-play AI device for secure on-premise computing

Lemony has launched an on-premise AI device that allows organizations to run generative AI workflows without cloud dependence. The company secured $2 million in seed funding led by True Ventures, according to GamesBeat’s Dean Takahashi. The stackable device supports up to five users and comes preloaded with 16 open-source AI models, including IBM’s Granite family …

Read more

Google releases app to run AI models locally on Android

Google has quietly launched an experimental Android app called AI Edge Gallery that allows users to run artificial intelligence models directly on their smartphones without an internet connection. The app is available through GitHub and will come to iOS devices later. Users can download and run AI models from the Hugging Face platform to perform …

Read more

Microsoft expands Phi language model family with new reasoning capabilities

Microsoft has introduced three new small language models (SLMs) focused on complex reasoning tasks: Phi-4-reasoning, Phi-4-reasoning-plus, and Phi-4-mini-reasoning. These models represent a significant advancement in what small AI models can accomplish, particularly in mathematical reasoning and multi-step problem solving. The flagship Phi-4-reasoning-plus, a 14-billion parameter model, demonstrates performance that rivals much larger AI systems. According …

Read more

Alibaba launches Qwen3 models with competitive AI reasoning capabilities

Alibaba has released Qwen3, a new family of large language models that compete with leading AI systems from OpenAI and Google. The lineup includes two mixture-of-experts (MoE) models and six dense models, with parameters ranging from 0.6 billion to 235 billion. According to benchmarks shared by Alibaba, the flagship Qwen3-235B-A22B model outperforms DeepSeek R1 and …

Read more

Pleias launches small reasoning models optimized for RAG with built-in citations

French AI startup Pleias has released two open-source small reasoning models specifically designed for retrieval-augmented generation (RAG) with native citation support. As reported by Carl Franzen for VentureBeat, the new models—Pleias-RAG-350M and Pleias-RAG-1B—are available under the Apache 2.0 license, allowing commercial use. Despite their small size, the models outperform many larger alternatives on multi-hop reasoning …

Read more

Google’s Gemma 3 models now run on consumer GPUs through quantization

Google has released new versions of its Gemma 3 AI models that can run on consumer-grade graphics cards through a technique called Quantization-Aware Training (QAT). This development makes powerful AI models accessible to users without high-end hardware. The company announced that QAT dramatically reduces memory requirements while maintaining high quality performance. Gemma 3’s largest 27B …

Read more

Nous Research launches AI model with optional reasoning mode

Nous Research has released DeepHermes-3, a new AI language model that allows users to switch between detailed reasoning and quick responses. As reported by Carl Franzen for VentureBeat, this 8-billion parameter model builds on Meta’s Llama technology. Users can activate a special reasoning mode that makes the AI show its thought process before providing answers. …

Read more