Amazon launches Nova family of AI models for text, image and video generation

Amazon Web Services has introduced Nova, a new family of artificial intelligence models designed for text, image and video generation. The announcement was made by CEO Andy Jassy at the AWS re:Invent conference. The Nova family consists of four text-generating models: Micro, Lite, Pro, and Premier. Micro, Lite, and Pro are immediately available to AWS …

Read more

Luma AI expands with new video and image features

Luma AI has significantly upgraded its Dream Machine AI platform with new video creation capabilities and an intuitive interface accessible via web and mobile apps. According to reporter Carl Franzen, the platform now includes Luma Photon, a new image foundation model that enables advanced text-to-image and video generation features. The system serves over 25 million …

Read more

LLaVA-o1 brings structured reasoning to visual language processing

Chinese researchers have developed LLaVA-o1, an open-source vision language model that introduces a four-stage reasoning process for analyzing images and text. As reported by Ben Dickson for VentureBeat, the model breaks down complex tasks into summary, caption, reasoning, and conclusion phases. The system, built on Llama-3.2-11B-Vision-Instruct and trained on 100,000 image-question-answer pairs, employs a novel …

Read more

Article reviews AI tools for content creation and social media management

An article by HootSuite presents a comprehensive analysis of 18 AI-powered tools designed to help content creators and social media marketers streamline their workflow. Author Chloe West evaluates popular platforms including OwlyWriter, ChatGPT, Claude, and Midjourney, detailing their specific capabilities and limitations. The review covers both paid and free options, focusing on tools for text …

Read more

Mistral AI launches enhanced language model and ChatGPT competitor

French AI startup Mistral has unveiled Pixtral Large, a new 124-billion-parameter language model, alongside major updates to its Le Chat platform, reports Carl Franzen. The new model features advanced multimodal capabilities, including image processing and optical character recognition, while maintaining a significant context window of 128,000 tokens. The model is available for research purposes through …

Read more

OmniGen: First unified model for image generation

Researchers have introduced OmniGen, the first diffusion model capable of unifying various image generation tasks within a single framework. Unlike existing models like Stable Diffusion, OmniGen does not require additional modules to handle different control conditions, according to the authors Shitao Xiao, Yueze Wang, Junjie Zhou, Huaying Yuan, et al. The model can perform text-to-image …

Read more

Moondream raises $4.5M for compact yet powerful AI vision-language model

Moondream, a startup backed by Felicis Ventures, Microsoft’s M12 GitHub Fund, and Ascend, has emerged from stealth with $4.5 million in pre-seed funding. According to VentureBeat’s Michael Nuñez, the company has developed an open-source vision-language model that boasts 1.6 billion parameters but matches the performance of models four times its size. The model, which can …

Read more

First features of Apple Intelligence launched, reviews are mixed

Apple has released iOS 18.1, iPadOS 18.1, and macOS Sequoia 15.1, introducing the first set of Apple Intelligence features. These AI-powered enhancements are available on select devices equipped with A17 Pro, M1, or later chips. Users can opt into Apple Intelligence after downloading the update and will be added to a short waitlist to prepare …

Read more

Google Photos will soon show AI edits

Google Photos will soon show whether images have been edited using artificial intelligence. As Chris Welch reports in his article for The Verge, the AI information will be visible in the “AI info” section of the image details starting next week. The label will apply to edits made with tools like Magic Editor, Magic Eraser, …

Read more

Midjourney now with image editor

AI image generator Midjourney has introduced a new AI image editor that allows users to directly edit and style uploaded images. As Carl Franzen reports for VentureBeat, users can use the new “Edit” feature to turn vintage photos into anime-style images, for example, or turn hand drawings into full-fledged works of art in a matter …

Read more