What makes ChatGPT’s new image generator so special?

ChatGPT’s new image generation isn’t just an upgrade—it’s a major shift in how AI creates visuals. The result: More accurate images, better handling of complex scenes, and legible, usable text in the image itself. That’s a big deal if you work in design, content creation, marketing, or any other visual field. While other image generators … Read more

OpenAI brings image generation to a new level

OpenAI has launched native image generation capabilities directly within ChatGPT, powered by its multimodal model GPT-4o. This new feature, called “Images in ChatGPT,” is now available to users across Plus, Pro, Team, and Free subscription tiers, with Enterprise, Edu, and API access coming soon. Unlike the previous DALL-E 3 image generator, which was a separate … Read more

Reve Image 1.0 is a promising new AI image generator

Reve AI has released Reve Image 1.0, a new text-to-image generation model that currently ranks first in image generation quality according to third-party evaluator Artificial Analysis. As reported by Carl Franzen in VentureBeat, the model excels at prompt adherence, aesthetics, and typography – outperforming competitors like Midjourney v6.1 and Google’s Imagen 3. The Palo Alto-based … Read more

Stable Virtual Camera creates videos from 2D images

Stability AI has released Stable Virtual Camera, a new multi-view diffusion model that transforms 2D images into videos. According to Ana Guillen, the research preview enables users to generate immersive videos with realistic depth and perspective without complex reconstruction or scene-specific optimization. The model can create videos from a single image or up to 32 … Read more

Google’s Gemini model used to remove image watermarks

A recent discovery shows that Google’s Gemini 2.0 Flash AI model can remove watermarks from images, including those from Getty Images and other stock photo providers. According to reporting by Kyle Wiggers for TechCrunch, users on social media platforms have been sharing examples of this controversial capability. Unlike some competing AI models such as Anthropic’s … Read more

Google introduces native image generation in Gemini 2.0 Flash

Google has announced the release of native image generation capabilities in its Gemini 2.0 Flash model, now available for developer experimentation through Google AI Studio and the Gemini API. This marks a significant advancement as Google becomes the first major U.S. tech company to integrate multimodal image generation directly within a model for consumer use. … Read more

Report shows shifts in AI model popularity across text, image, video

Poe, a platform for exploring and comparing AI models, has released its “Early 2025 AI Ecosystem Trends” report revealing significant shifts in user preferences across text, image, and video generation models. According to the report, OpenAI and Anthropic dominate text generation with approximately 85% of message share, while newcomers like DeepSeek and Google’s Gemini are … Read more

Microsoft brings Copilot app to Mac with new features

Microsoft has launched a native Copilot app for macOS users in the US, UK, and Canada. According to Tom Warren from The Verge, the app provides access to Microsoft’s web-based AI assistant, allowing users to generate images and text or upload images. The Mac version includes dark mode support and can be activated with Command … Read more

Alibaba’s video and image AI model Wan 2.1 now open source

Alibaba Group has made its video and image generation AI model Wan 2.1 publicly available as open source. Reuters reports that four variants of the model are now accessible globally through Alibaba Cloud’s ModelScope and HuggingFace platforms for academic, research, and commercial use. The most powerful variants can process up to 14 billion parameters, enabling … Read more

Napkin AI creates graphics using specialized AI agent teams

A new AI-powered design tool called Napkin AI is changing how professionals create graphics by employing multiple specialized AI agents. According to an article by Matt Marshall in VentureBeat, the tool can generate customizable graphics from text input within five seconds. The system uses different AI agents to handle specific design tasks, similar to how … Read more