Alibaba releases new AI models challenging global tech leaders

Alibaba’s Qwen team has launched two significant AI models – Qwen2.5-VL and Qwen2.5-Max – that demonstrate advanced capabilities in various tasks. According to the company, these models can perform text and image analysis, control computers and mobile devices, and compete with established AI systems from OpenAI, Anthropic, and Google on multiple benchmarks. The Qwen2.5-VL model family includes three versions, with the flagship 72B version requiring special permission for commercial deployment by large-scale users.

The new models, particularly Qwen2.5-Max, utilize a mixture-of-experts architecture that Alibaba claims requires fewer computational resources than traditional approaches. The company reports that its models achieve competitive results against industry leaders like GPT-4o and Claude-3.5-Sonnet in tests of advanced reasoning and knowledge. The Qwen2.5-Max model reportedly shows strong performance in code generation and reasoning tasks, with scores of 38.7% on LiveCodeBench and 89.4% on Arena-Hard testing.

The timing of these releases is notable, coming shortly after Chinese AI lab DeepSeek’s successful launch of its own models, which impacted U.S. technology markets. While the Qwen models demonstrate technical advancement, they maintain certain restrictions on discussion topics in compliance with Chinese regulations. Alibaba Cloud states that the models have been trained on over 20 trillion tokens, though questions about data sovereignty and regulatory compliance remain important considerations for potential users.

Sources: TechCrunch, VentureBeat, Reuters

Related posts:

Stay up-to-date: