Local AI - ✦ Smart Content Report

PrismML brings a 27B-parameter AI model to smartphones

July 17, 2026

PrismML introduces Bonsai 27B, a compressed version of Qwen3.6 27B that the company describes as the first model of its size to run entirely on a smartphone. PrismML announces in an official blog post that the new model shrinks a 27 billion parameter system down to a few gigabytes without abandoning its reasoning, tool use, …

China’s Kimi K3 rivals the world’s biggest AI models and it’s open source

July 17, 2026

Moonshot AI, the Beijing based startup backed by Alibaba, has released Kimi K3, a 2.8 trillion parameter model the company calls in an official blog post the world’s first open 3T-class model. The system combines two new architectural components, Kimi Delta Attention and Attention Residuals, with a 1 million token context window and native vision …

Thinking Machines shows Inkling, an open source multimodal AI model

July 16, 2026

Thinking Machines, the AI startup founded by former OpenAI chief technology officer Mira Murati, has released its first major language model under an open source license. Carl Franzen reports for VentureBeat that the model, called Inkling, targets enterprises that want to run AI on their own servers while keeping costs low and avoiding ideological filtering. …

Rowboat brings a local-first AI coworker to the desktop

July 8, 2026

A new open-source tool called Rowboat wants to turn AI assistants into a full workspace rather than just a chat window. The developer, posting under the handle segmenta, introduces the project in a Show HN post on Hacker News, describing it as a local-first alternative to Claude Desktop. The team behind Rowboat previously built AI …

Why tiny AI models could outperform giant chatbots

July 8, 2026

In 2019, Nigerian entrepreneur Adebayo Alonge faced a technical failure during a product demo in Cape Town: his AI-powered pill scanner needed to reach a data center 14,000 kilometers away, and the connection was far too slow. David Berreby reports for IEEE Spectrum that this incident pushed Alonge to shrink his AI model within hours …

Half the size, no legal fine print: Tencent’s Hy3 challenges GLM-5.2

July 7, 2026

Tencent has released the full version of its Hy3 language model under the permissive Apache 2.0 license, removing regional restrictions that previously excluded the European Union, the United Kingdom and South Korea. Sam Witteveen reports for VentureBeat that the license reversal, more than the model’s capabilities, drove the strongest reaction from the open-model community. Hy3 …

GPT-NL: Netherlands builds its own AI language model to cut reliance on US tech giants

June 16, 2026

The Netherlands is building its own large language model, independent of major American AI providers. TNO writes that the project, called GPT-NL, is being developed by research institute TNO together with SURF and the Netherlands Forensic Institute (NFI). The Dutch government has allocated €13.5 million to the initiative through the Netherlands Enterprise Agency. The project …

Google DiffusionGemma generates text four times faster than standard models

June 10, 2026

Google has released DiffusionGemma, an experimental open AI model that takes a fundamentally different approach to generating text. Brendan O’Donoghue and Sebastian Flennerhag write for Google’s blog The Keyword that the model can produce text up to four times faster than conventional large language models on dedicated GPUs. It is available under an Apache 2.0 …

Googles Gemma 4 12B is powerful and runs locally on laptops with just 16 GB of memory

June 4, 2026

Google has released Gemma 4 12B, an open-weights multimodal AI model designed to run entirely on a standard laptop with 16 GB of VRAM or unified memory. The model is available now for free download and marks a notable step toward powerful AI that operates offline, without sending data to the cloud. The model is …

Odysseus wants to replace ChatGPT and Claude on your home server

June 2, 2026

Odysseus is a new open-source project that lets users run a full-featured AI workspace on their own hardware. Developer pewdiepie-archdaemon publishes the project on GitHub as a self-hosted alternative to commercial AI interfaces such as ChatGPT and Claude, with a strong focus on privacy and local data storage. And, yes, that PewDiePie, the YouTube guy. …