Report shows shifts in AI model popularity across text, image, video

Poe, a platform for exploring and comparing AI models, has released its “Early 2025 AI Ecosystem Trends” report revealing significant shifts in user preferences across text, image, and video generation models. According to the report, OpenAI and Anthropic dominate text generation with approximately 85% of message share, while newcomers like DeepSeek and Google’s Gemini are … Read more

Cohere releases Aya Vision, a multilingual vision model with open weights

Cohere’s research division has launched Aya Vision, an open-weight vision model supporting 23 languages. According to Carl Franzen’s report in VentureBeat, the model comes in 8-billion and 32-billion parameter versions and can analyze images, generate text, and translate visual content. Aya Vision outperforms larger models like Llama 90B while requiring fewer computational resources. The model … Read more

Microsoft brings Copilot app to Mac with new features

Microsoft has launched a native Copilot app for macOS users in the US, UK, and Canada. According to Tom Warren from The Verge, the app provides access to Microsoft’s web-based AI assistant, allowing users to generate images and text or upload images. The Mac version includes dark mode support and can be activated with Command … Read more

These diffusion-based language models run 10 times faster than current LLMs

Inception Labs has unveiled Mercury, a new family of diffusion-based large language models (dLLMs) that can generate text up to 10 times faster than conventional autoregressive LLMs. According to the company, Mercury models can process over 1,000 tokens per second on NVIDIA H100 GPUs, speeds previously achievable only with specialized hardware. The company’s first publicly … Read more

Microsoft introduces efficient Phi-4 for text, image, speech processing

Microsoft has unveiled two new AI models in its Phi series: Phi-4-multimodal with 5.6 billion parameters and Phi-4-mini with 3.8 billion parameters. These small language models (SLMs) deliver exceptional performance while requiring significantly less computing power than larger systems, challenging the notion that bigger AI models are always better. The Phi-4-multimodal model stands out for … Read more

OpenAI launches GPT-4.5 model

OpenAI has officially launched GPT-4.5, its newest and largest AI language model to date. The model, previously known internally as “Orion,” is being released as a research preview with claims of enhanced conversational abilities, reduced hallucination rates, and improved emotional intelligence compared to previous models. While OpenAI positions GPT-4.5 as its “largest and best model … Read more

Claude 3.7 Sonnet has customizable reasoning capabilities

Anthropic has introduced Claude 3.7 Sonnet, positioning it as the first hybrid AI reasoning model that combines quick responses with extended thinking capabilities. The model allows users to choose between immediate answers and more thorough analysis, with API users having precise control over the model’s thinking time up to 128,000 tokens. Key Features and Capabilities … Read more

Elon Musk’s xAI launches Grok 3, claiming superior performance to existing AI models

XAI, Elon Musk’s artificial intelligence company, has launched its latest AI model Grok 3, marking a significant development in the competitive AI landscape. The new model family was unveiled during a livestreamed presentation, with Musk claiming it represents a major advancement over its predecessor, Grok 2. Technical Infrastructure and Capabilities The model’s development relied on … Read more

Ai2 Tulu 3 is an open-source language model rivaling leading systems

The Allen Institute for Artificial Intelligence (Ai2) has released Tulu 3 405B, a new AI language model that, according to the institute’s internal testing, outperforms several leading AI systems including DeepSeek V3 and matches capabilities with OpenAI’s GPT-4o on certain benchmarks. The model contains 405 billion parameters and required 256 GPUs running in parallel for … Read more

Google launches Gemini 2.0 Flash Thinking for free

Google has released Gemini 2.0 Flash Thinking, a new AI model that can process up to one million tokens of text while showing its reasoning process. According to Michael Nuñez at VentureBeat, the model is available for free through Google AI Studio under the experimental designation “Exp-01-21.” The system achieved a 73.3% score on the … Read more