Google shares tips for its image model Nano Banana, now generally available

Google has made its image model, Gemini 2.5 Flash Image, also known as Nano Banana, generally available for production use. According to the company, new features include support for ten different aspect ratios for various formats, from cinematic to vertical, and the option to specify image-only output. In a post, Google provided several tips for …

Read more

New Qwen3-VL model aims to understand and act in the digital world

The QwenTeam has released a new series of open-source vision-language models called Qwen3-VL. According to the team’s official announcement, the models are designed not just to see images and videos but to understand context, reason about events, and perform actions. The flagship model, Qwen3-VL-235B-A22B, is available in two versions. The developers claim the “Instruct” version …

Read more

Qwen3-Omni is an open-source model for text, image, audio, and video

The Chinese technology company Alibaba has released Qwen3-Omni, a new generative AI model that can process a combination of text, images, audio, and video. The model is notable for its “omni-modal” capabilities and its open-source license, positioning it as a direct competitor to proprietary models from U.S. tech companies like OpenAI and Google. According to …

Read more

Google rolls out conversational AI photo editing to more Android users

Google is making its conversational photo editing feature available to all eligible Android users in the U.S. The tool, powered by Google’s Gemini AI, allows people to edit photos by describing changes with voice or text commands. According to an official company post, this avoids the need to switch between different tools or adjust sliders. …

Read more

Google provides tips for maximizing Gemini’s improved image generation

Google DeepMind explains in a new post how to use the improved image generation in Gemini to its full potential. Product Manager Naina Raisinghani shared specific prompting strategies to achieve better results with the updated model. The company recommends including six key elements in prompts: subject, composition, action, location, style, and editing instructions. Users should …

Read more

Google wows with new AI image model focused on editing and consistency

Google has released an updated AI model, Gemini 2.5 Flash Image, designed to provide users with more control over image generation and editing. The model, which was anonymously tested on the crowdsourced evaluation platform LMArena under the codename “nano-banana,” is now integrated into the consumer-facing Gemini app and available to developers through the Gemini API, …

Read more

Alibaba’s new AI model edits images with text commands

Alibaba’s Qwen Team has released a new open-source AI model named Qwen-Image Edit that allows users to modify images using simple text instructions. The tool is capable of performing a wide range of complex editing tasks that challenge established software like Adobe Photoshop. According to an article by Carl Franzen for VentureBeat, users can upload …

Read more

The best AI image generators for professional use compared

Not long ago, generating an image with AI was mostly a technological novelty. It was a nice party trick with limited practical use. As of 2025, that has changed. AI image generation has evolved into a practical tool for marketing professionals. From unique visuals for social media campaigns to website assets and conceptual product shots: …

Read more

Google makes Imagen 4 text-to-image models generally available

Google has released its Imagen 4 family of text-to-image models, making them generally available in the Gemini API and Google AI Studio. According to a post on the Google Developers blog, the release includes the new Imagen 4 Fast model, which is optimized for speed and high-volume tasks. This model is priced at $0.02 per …

Read more

Alibaba’s new open source AI model aims to master text in images

Alibaba’s Qwen Team has released a new open source AI image generator called Qwen-Image. According to its developers, the model specializes in creating images with accurately rendered text in both English and Chinese, a common challenge for many AI systems. Writing for VentureBeat, journalist Carl Franzen notes this capability allows for creating content like posters, …

Read more