Claude Opus 4.5 brings lower prices and new features

The AI company Anthropic has released Claude Opus 4.5, its latest flagship model. The launch is accompanied by a significant price reduction, new product features, and performance claims that position it competitively against models from rivals like OpenAI and Google. According to Anthropic, Claude Opus 4.5 delivers state-of-the-art performance, particularly in software engineering and complex …

Read more

Ai2 releases Olmo 3 with a focus on full development transparency

The Allen Institute for AI (Ai2), a Seattle-based nonprofit research institute, has released Olmo 3, a new family of open-source language models. According to Ai2, the new models are designed to be competitive with other leading open models in performance and efficiency while offering a new level of transparency for developers and researchers. The key …

Read more

Kimi K2 Thinking: open source AI matches performance of leading proprietary systems

A new open source AI model from Chinese startup Moonshot AI has matched or exceeded the performance of leading proprietary systems, including OpenAI’s GPT-5 and Anthropic’s Claude Sonnet 4.5, according to multiple benchmarks. Carl Franzen reports for VentureBeat that the Kimi K2 Thinking model achieved 44.9 percent on Humanity’s Last Exam, 60.2 percent on BrowseComp, …

Read more

Microsoft introduces Agent Mode for automated Office tasks

Microsoft has launched Agent Mode for Excel and Word, along with Office Agent in Copilot chat, marking a significant expansion of AI capabilities in its productivity suite. The company calls this approach “vibe working,” comparing it to the concept of “vibe coding” where users create complex outputs through simple prompts. Agent Mode represents a more …

Read more

Google updates Gemini 2.5 Flash models for developers and app users

Google has announced updates for its Gemini 2.5 Flash and Flash-Lite models, releasing improvements for both the public-facing Gemini app and for developers through new preview versions. According to a report by 9to5Google, users of the Gemini app will notice several enhancements to the Gemini 2.5 Flash model. Responses are now better organized with improved …

Read more

New Qwen3-VL model aims to understand and act in the digital world

The QwenTeam has released a new series of open-source vision-language models called Qwen3-VL. According to the team’s official announcement, the models are designed not just to see images and videos but to understand context, reason about events, and perform actions. The flagship model, Qwen3-VL-235B-A22B, is available in two versions. The developers claim the “Instruct” version …

Read more

Qwen3-Omni is an open-source model for text, image, audio, and video

The Chinese technology company Alibaba has released Qwen3-Omni, a new generative AI model that can process a combination of text, images, audio, and video. The model is notable for its “omni-modal” capabilities and its open-source license, positioning it as a direct competitor to proprietary models from U.S. tech companies like OpenAI and Google. According to …

Read more

AI startup Nous Research releases unrestricted chatbot that outperforms major competitors

Nous Research has quietly launched Hermes 4, a family of AI language models that the company claims matches leading commercial systems while removing most content restrictions. Michael Nuñez reports about the release for VentureBeat. Unlike ChatGPT or Claude, Hermes 4 responds to nearly any request without safety guardrails that have become standard in commercial AI …

Read more

Study finds AI reasoning is pattern matching, not true understanding

Researchers from Arizona State University conclude that the reasoning abilities of large language models are a “brittle mirage.” According to an article by Kyle Orland in Ars Technica, these models struggle significantly with problems that deviate from their training data. In a controlled experiment, the researchers found that an AI’s performance collapsed when tasks were …

Read more