Stability AI releases enterprise audio model with major speed increase

Stability AI has launched Stable Audio 2.5, a generative audio model built specifically for enterprise use. The company claims a new technique allows the model to produce high-quality audio in just eight computational steps, down from 50. Sean Michael Kerner reports for VentureBeat that this breakthrough can cut production time from weeks to minutes. According …

Read more

Anthropic’s Claude now creates and edits files directly

Anthropic announced that its AI assistant, Claude, can now create and edit files like Excel spreadsheets, documents, PowerPoint presentations, and PDFs. Users can describe their needs or upload data to receive finished files directly within the chat interface, moving beyond simple text responses. According to an official post by Anthropic, this feature allows Claude to …

Read more

Google clarifies usage limits for its Gemini AI models

Google has published specific usage limits for its AI model Gemini, providing new clarity for users on its free and paid plans. The update to its Help Center article replaces previous vague statements about potential caps on prompts and features. As The Verge reports, users now have a clear understanding of their daily and monthly …

Read more

Roblox introduces new generative AI tools for creators

The gaming platform Roblox is launching several new AI tools to simplify development. Aisha Malik reports for TechCrunch that these updates were announced at the Roblox Developers Conference. One tool allows creators to generate fully functional 3D objects, like drivable cars, from a simple text prompt. The company is also introducing real-time voice chat translation …

Read more

Google Photos lets US users animate images for free with Veo 3

Google Photos now allows users in the US to transform still images into four-second videos for free. The Verge reports that the feature uses Google’s Veo 3 AI model and is accessible via the “Create” tab. Users can choose between “Subtle movements” or an “I’m feeling lucky” option to animate their pictures. According to Google …

Read more

OpenAI makes ChatGPT Projects available to free users

OpenAI has expanded its ChatGPT Projects feature to free users after previously restricting it to paid subscribers. The feature allows users to organize AI conversations around specific topics and set custom instructions for responses. Ian Carlos Campbell from Engadget reports that free users can now upload five files per project, while Plus subscribers get 25 …

Read more

Switzerland launches open-source AI model Apertus as transparent ChatGPT alternative

Switzerland has entered the artificial intelligence race with Apertus, a new open-source Large Language Model developed by leading Swiss universities. The model aims to provide a transparent alternative to commercial AI systems like ChatGPT and Meta’s Llama. Apertus, meaning “open” in Latin, was created by the Swiss Federal Institute of Technology Lausanne (EPFL), ETH Zurich, …

Read more

Tencent releases AI model that creates 3D-like videos from single photos

Chinese tech giant Tencent has launched HunyuanWorld-Voyager, an AI model that transforms static images into navigable 3D-like video sequences. Benj Edwards reports about the announcement for Ars Technica. The system generates 49-frame video clips lasting roughly two seconds from a single photograph. Users can define camera movements such as forward, backward, and turning motions to …

Read more

AI startup Nous Research releases unrestricted chatbot that outperforms major competitors

Nous Research has quietly launched Hermes 4, a family of AI language models that the company claims matches leading commercial systems while removing most content restrictions. Michael Nuñez reports about the release for VentureBeat. Unlike ChatGPT or Claude, Hermes 4 responds to nearly any request without safety guardrails that have become standard in commercial AI …

Read more

Google provides tips for maximizing Gemini’s improved image generation

Google DeepMind explains in a new post how to use the improved image generation in Gemini to its full potential. Product Manager Naina Raisinghani shared specific prompting strategies to achieve better results with the updated model. The company recommends including six key elements in prompts: subject, composition, action, location, style, and editing instructions. Users should …

Read more