OpenAI launches o3 and o4-mini with enhanced reasoning and visual capabilities

OpenAI has released two new AI models, o3 and o4-mini, designed to advance reasoning capabilities and introduce novel features like “thinking with images.” These models represent the company’s latest development in its o-series, coming just days after the release of GPT-4.1. The models’ most distinctive feature is their ability to not just recognize images but …

Read more

Canva unveils Visual Suite 2.0 with AI and spreadsheet features

Canva has announced its largest product update yet, Visual Suite 2.0, during its Canva Create 2025 event. The new suite introduces several AI-powered features aimed at bridging the gap between productivity and creativity. A centerpiece of the update is Canva Sheets, a new spreadsheet tool that incorporates AI capabilities like Magic Insights and Magic Formulas …

Read more

Krea platform unifies AI tools for visual creatives

Krea’s platform aims at visual creatives and integrates multiple generative AI models in one interface, helping designers navigate the overwhelming landscape of AI tools. As reported by Ingrid Lunden in TechCrunch, the San Francisco-based startup has raised $83 million across several funding rounds, including a recent $47 million Series B led by Bain Capital Ventures. …

Read more

Midjourney releases V7 Alpha with voice prompting and draft mode

Midjourney has released V7 Alpha, its first new AI image generation model in nearly a year, featuring voice prompting capabilities and a faster draft mode. The launch comes a week after OpenAI debuted a new image generator in ChatGPT that quickly gained popularity. According to Midjourney CEO David Holz, V7 represents a “totally different architecture” …

Read more

What makes ChatGPT’s new image generator so special?

ChatGPT’s new image generation isn’t just an upgrade—it’s a major shift in how AI creates visuals. The result: More accurate images, better handling of complex scenes, and legible, usable text in the image itself. That’s a big deal if you work in design, content creation, marketing, or any other visual field. While other image generators …

Read more

OpenAI brings image generation to a new level

OpenAI has launched native image generation capabilities directly within ChatGPT, powered by its multimodal model GPT-4o. This new feature, called “Images in ChatGPT,” is now available to users across Plus, Pro, Team, and Free subscription tiers, with Enterprise, Edu, and API access coming soon. Unlike the previous DALL-E 3 image generator, which was a separate …

Read more

Reve Image 1.0 is a promising new AI image generator

Reve AI has released Reve Image 1.0, a new text-to-image generation model that currently ranks first in image generation quality according to third-party evaluator Artificial Analysis. As reported by Carl Franzen in VentureBeat, the model excels at prompt adherence, aesthetics, and typography – outperforming competitors like Midjourney v6.1 and Google’s Imagen 3. The Palo Alto-based …

Read more

Stable Virtual Camera creates videos from 2D images

Stability AI has released Stable Virtual Camera, a new multi-view diffusion model that transforms 2D images into videos. According to Ana Guillen, the research preview enables users to generate immersive videos with realistic depth and perspective without complex reconstruction or scene-specific optimization. The model can create videos from a single image or up to 32 …

Read more

Google’s Gemini model used to remove image watermarks

A recent discovery shows that Google’s Gemini 2.0 Flash AI model can remove watermarks from images, including those from Getty Images and other stock photo providers. According to reporting by Kyle Wiggers for TechCrunch, users on social media platforms have been sharing examples of this controversial capability. Unlike some competing AI models such as Anthropic’s …

Read more

Google introduces native image generation in Gemini 2.0 Flash

Google has announced the release of native image generation capabilities in its Gemini 2.0 Flash model, now available for developer experimentation through Google AI Studio and the Gemini API. This marks a significant advancement as Google becomes the first major U.S. tech company to integrate multimodal image generation directly within a model for consumer use. …

Read more