New Qwen3-VL model aims to understand and act in the digital world
The QwenTeam has released a new series of open-source vision-language models called Qwen3-VL. According to the team’s official announcement, the models are designed not just to see images and videos but to understand context, reason about events, and perform actions. The flagship model, Qwen3-VL-235B-A22B, is available in two versions. The developers claim the “Instruct” version …