Stable Virtual Camera creates videos from 2D images

Stability AI has released Stable Virtual Camera, a new multi-view diffusion model that transforms 2D images into videos. According to Ana Guillen, the research preview enables users to generate immersive videos with realistic depth and perspective without complex reconstruction or scene-specific optimization. The model can create videos from a single image or up to 32 …

Read more

Google’s Gemini model used to remove image watermarks

A recent discovery shows that Google’s Gemini 2.0 Flash AI model can remove watermarks from images, including those from Getty Images and other stock photo providers. According to reporting by Kyle Wiggers for TechCrunch, users on social media platforms have been sharing examples of this controversial capability. Unlike some competing AI models such as Anthropic’s …

Read more

Google introduces native image generation in Gemini 2.0 Flash

Google has announced the release of native image generation capabilities in its Gemini 2.0 Flash model, now available for developer experimentation through Google AI Studio and the Gemini API. This marks a significant advancement as Google becomes the first major U.S. tech company to integrate multimodal image generation directly within a model for consumer use. …

Read more

Report shows shifts in AI model popularity across text, image, video

Poe, a platform for exploring and comparing AI models, has released its “Early 2025 AI Ecosystem Trends” report revealing significant shifts in user preferences across text, image, and video generation models. According to the report, OpenAI and Anthropic dominate text generation with approximately 85% of message share, while newcomers like DeepSeek and Google’s Gemini are …

Read more

Microsoft brings Copilot app to Mac with new features

Microsoft has launched a native Copilot app for macOS users in the US, UK, and Canada. According to Tom Warren from The Verge, the app provides access to Microsoft’s web-based AI assistant, allowing users to generate images and text or upload images. The Mac version includes dark mode support and can be activated with Command …

Read more

Alibaba’s video and image AI model Wan 2.1 now open source

Alibaba Group has made its video and image generation AI model Wan 2.1 publicly available as open source. Reuters reports that four variants of the model are now accessible globally through Alibaba Cloud’s ModelScope and HuggingFace platforms for academic, research, and commercial use. The most powerful variants can process up to 14 billion parameters, enabling …

Read more

Napkin AI creates graphics using specialized AI agent teams

A new AI-powered design tool called Napkin AI is changing how professionals create graphics by employing multiple specialized AI agents. According to an article by Matt Marshall in VentureBeat, the tool can generate customizable graphics from text input within five seconds. The system uses different AI agents to handle specific design tasks, similar to how …

Read more

How to: Train an AI image generator on your likeness

A new affordable method to train AI image models on personal photos allows users to generate custom AI images for approximately $3. The technique, detailed in a blog post by software developer Cory Zue, combines the Flux image model with LoRA (Low-Rank Adaptation) training technology on the Replicate platform. The process requires users to upload …

Read more

Generate images with Dall-E on your iPhone for free

Did you know Apple has integrated OpenAI’s Dall-E image generator into its Messages app? According to Tim Hardwick from MacRumors, the feature allows iPhone users to create AI-generated images directly within message conversations. The functionality is available on iPhone 15 Pro and iPhone 16 models with Apple Intelligence enabled. Users can activate the feature by …

Read more

DeepSeek Janus Pro image generator challenges established competitors

Chinese AI company DeepSeek has released a new family of AI models called Janus-Pro, with capabilities in both image analysis and creation. The models, ranging from 1 billion to 7 billion parameters, are available for download on the Hugging Face platform under an MIT license, allowing unrestricted commercial use. According to DeepSeek, the largest model …

Read more