Reasoning | ✦ Smart Content Report

Meta launches proprietary AI model Muse Spark

April 9, 2026

Meta has released Muse Spark, a new proprietary artificial intelligence model built by its internal division Meta Superintelligence Labs. The model is available through the Meta AI app and website, with a private API preview for select users. Unlike Meta’s previous Llama models, Muse Spark is not open source. Muse Spark can process text and …

Google releases Gemma 4, its most capable open AI model family

April 2, 2026

Google has launched Gemma 4, a new family of open-weight AI models that the company describes as its most capable to date. The models are built on the same research and technology as Google DeepMind’s proprietary Gemini 3 system and are released under an Apache 2.0 open-source license, which allows developers to use and modify …

Mistral Small 4 is a unified AI model for reasoning, coding and image analysis

March 17, 2026

Mistral AI has released Mistral Small 4, a new open-source artificial intelligence model that combines reasoning, multimodal processing and coding capabilities in a single system. The company reports that users no longer need to switch between separate specialised models for different tasks. The model uses a Mixture of Experts architecture with 128 specialists, activating only …

Think less, do more: Microsoft’s new tiny AI knows when to skip the hard thinking

March 10, 2026

Microsoft has released Phi-4-reasoning-vision-15B, a compact AI model that processes both images and text and can solve complex math and science problems. Michael Nuñez reports for VentureBeat that the 15-billion-parameter model matches or exceeds the performance of much larger systems while using significantly less computing power and training data. The model is available now on …

GPT‑5.4 aims to handle real professional work as OpenAI expands agent-style AI

March 5, 2026

OpenAI has released GPT‑5.4, a new AI model designed for professional tasks such as coding, document creation, spreadsheet analysis, and multi‑step workflows. The company positions the model as its most capable system for knowledge work and software development so far. The model is available across ChatGPT, the OpenAI API, and the company’s coding tool Codex. …

Should you walk or drive 50 meters to a car wash? Most AI models get it wrong

February 23, 2026

A deceptively simple question has exposed a widespread reasoning failure across the artificial intelligence industry. Felix Wunderlich writes for opper.ai that 42 out of 53 leading AI models answered incorrectly when asked: “I want to wash my car. The car wash is 50 meters away. Should I walk or drive?” The correct answer is, of …

Google releases Gemini 3.1 Pro with much improved reasoning

February 19, 2026

Google has released Gemini 3.1 Pro, an updated version of its Gemini 3 Pro AI model. The company describes it as a step forward in core reasoning, intended for complex tasks where straightforward answers fall short. The model is now available to consumers through the Gemini app and NotebookLM, though access on those platforms is …

Claude Sonnet 4.6: near-flagship performance at mid-tier pricing

February 18, 2026

Anthropic has released Claude Sonnet 4.6, a significant upgrade to its mid-tier AI model. The company says it outperforms its predecessor across coding, computer use, long-context reasoning, agent planning, knowledge work, and design. Sonnet 4.6 is now the default model in claude.ai and Claude Cowork and carries the same price as its predecessor, Sonnet 4.5, …

OpenAI releases GPT-5.3-Codex with 25% speed boost and expanded capabilities

February 5, 2026

OpenAI has launched GPT-5.3-Codex, a coding model that the company describes as its most capable agentic coding tool to date. The model runs 25% faster than its predecessor and combines advanced coding performance with reasoning capabilities in a single system. According to OpenAI, GPT-5.3-Codex marks a significant milestone as the first model that helped create …

Anthropic releases Claude Opus 4.6 with expanded context window and agent teams

February 5, 2026

Anthropic has released Claude Opus 4.6, an upgraded version of its flagship AI model that can handle longer conversations and coordinate multiple AI agents working simultaneously on complex tasks. The company claims the model outperforms competitors including OpenAI’s GPT-5.2 on several professional benchmarks. The release introduces a one-million-token context window for the first time in …