Qwen-Image-2512: High quality AI images go open source to challenge Google dominance

Alibaba’s Qwen team has released Qwen-Image-2512. This update improves their foundation model for generating images from text. The model is now available for public use and enterprise integration. According to the developers, this version focuses on three main areas: human realism, natural detail, and text rendering.

The model aims to reduce the artificial appearance often associated with AI-generated content. Alibaba reports that the update provides better rendering of human subjects. This includes more accurate facial details and better environmental context. For example, the model captures age-related features like wrinkles more effectively than previous versions. It also shows improved accuracy in body postures and physical movements described in a prompt.

Beyond human subjects, the update enhances natural elements. This includes finer details in landscapes, animal fur, and water effects. VentureBeat notes that these improvements make the model useful for industries like e-commerce and education. High-quality textures reduce the need for manual image cleanup after generation.

A significant feature of Qwen-Image-2512 is its ability to handle complex layouts and text. The model can generate complete presentation slides, infographics, and posters with accurate Chinese and English text. This capability places the model in direct competition with proprietary systems like Google’s Gemini 3 Pro Image (aka Nano Banana Pro). VentureBeat reports that while Google’s model set a high bar for enterprise visuals, it remains a closed system tied to specific cloud infrastructure.

The new Qwen model follows a different path by using the Apache 2.0 license. This allows individuals and companies to download the model weights from platforms like Hugging Face or ModelScope. Users can modify and host the model on their own hardware. This approach offers several advantages for businesses:

  1. Cost management: Organizations can avoid per-image API fees by using their own infrastructure.
  2. Data privacy: Companies in regulated sectors can maintain control over their data residency.
  3. Customization: Developers can fine-tune the model for specific cultural styles or internal branding.

Alibaba also offers a managed version of the model through its Cloud Model Studio API. This service is priced at $0.075 per generated image. This dual strategy allows for both in-house customization and simplified cloud deployment.

In internal evaluations on the AI Arena platform, Alibaba claims Qwen-Image-2512 is currently the strongest open-source model available. These tests suggest it remains competitive with leading closed-source models. Journalists at VentureBeat observe that this release signals a shift in the market. Open-source models are now matching the features most important to professional users, such as layout control and realistic textures.

Sources: Qwen Blog, VentureBeat

About the author

Related posts:

Stay up-to-date:

Advertisement