Alibaba launches new AI image generator

Alibaba has unveiled Qwen VLo, a new AI model that generates and modifies images from text prompts and existing visuals. The Chinese e-commerce giant introduced the technology as part of its aggressive expansion into AI services, according to Bloomberg reporter Luz Ding. The model can create images from text descriptions like “generate a picture of …

Read more

Alibaba launches Qwen3 models with competitive AI reasoning capabilities

Alibaba has released Qwen3, a new family of large language models that compete with leading AI systems from OpenAI and Google. The lineup includes two mixture-of-experts (MoE) models and six dense models, with parameters ranging from 0.6 billion to 235 billion. According to benchmarks shared by Alibaba, the flagship Qwen3-235B-A22B model outperforms DeepSeek R1 and …

Read more

Alibaba launches QwQ-32B, a powerful reasoning model that rivals larger competitors

Alibaba’s Qwen Team has introduced QwQ-32B, a new open-source language model that matches the performance of much larger models like DeepSeek-R1 despite having significantly fewer parameters. The 32-billion-parameter model, released under the Apache 2.0 license, leverages reinforcement learning (RL) to enhance reasoning capabilities for complex problem-solving tasks. Key features and capabilities QwQ-32B demonstrates impressive performance …

Read more

Alibaba releases new AI models challenging global tech leaders

Alibaba’s Qwen team has launched two significant AI models – Qwen2.5-VL and Qwen2.5-Max – that demonstrate advanced capabilities in various tasks. According to the company, these models can perform text and image analysis, control computers and mobile devices, and compete with established AI systems from OpenAI, Anthropic, and Google on multiple benchmarks. The Qwen2.5-VL model …

Read more

DeepSeek releases new reasoning models and introduces distilled versions

Chinese AI company DeepSeek has announced the release of its new reasoning-focused language models DeepSeek-R1-Zero and DeepSeek-R1, along with six smaller distilled versions. The main models, built on DeepSeek’s V3 architecture, feature 671 billion total parameters with 37 billion activated parameters and a context length of 128,000 tokens. According to company statements, DeepSeek-R1 achieves performance …

Read more

Alibaba cuts prices on Qwen language model by 85 percent

Alibaba Cloud has announced a major price reduction of up to 85 percent on its Qwen-VL large language model, which processes both text and images. According to Ryan Browne from Reuters, this move reflects the intensifying competition in China’s AI market. The price cut follows earlier reductions of up to 97 percent in May. Alibaba’s …

Read more

Alibaba releases new visual AI model QVQ for enhanced reasoning capabilities

Alibaba’s Qwen team has released QVQ-72B-Preview, a new experimental visual AI model designed to enhance visual reasoning capabilities. Built upon their Qwen2-VL-72B architecture, the model aims to combine language and vision processing to tackle complex analytical tasks. According to company statements, QVQ achieved a score of 70.3 on the MMMU benchmark, marking an improvement over …

Read more

Alibaba releases new AI reasoning model to compete with OpenAI o1

Alibaba has released Qwen with Questions (QwQ), a new artificial intelligence reasoning model designed to compete with OpenAI’s o1 system. The model features 32 billion parameters and can process contexts of up to 32,000 tokens. According to Alibaba’s testing, QwQ outperforms OpenAI’s o1-preview on mathematical and scientific reasoning benchmarks AIME and MATH. The company states …

Read more

Alibaba extends Qwen AI model to process one million tokens

Alibaba Cloud has launched an upgraded version of its Qwen2.5-Turbo AI model that can now process contexts of up to one million tokens, equivalent to approximately 1.5 million Chinese characters or 10 full-length novels. The improved model achieves 93.1 points on the RULER long text evaluation benchmark, surpassing GPT-4’s score of 91.6. According to Alibaba, …

Read more

Arch-Function accelerates AI agents

Katanemo has introduced Arch-Function, a collection of open-source large language models (LLMs) designed for ultra-fast function-calling tasks essential for agentic applications in enterprises. According to reporting from VentureBeat, these models operate nearly 12 times faster than OpenAI’s GPT-4 and significantly outperform offerings from other competitors, while also providing substantial cost savings. Arch-Function builds on Katanemo’s …

Read more