Google releases AI-powered video creation app for work

Google has announced the general availability of Google Vids, a new AI-powered video creation app for work, to select Google Workspace editions. According to the company’s announcement, Vids is designed to help teams in customer service, learning and development, project ops, and marketing create engaging videos more easily. The app utilizes generative AI capabilities to …

Read more

Midship uses AI to extract usable data from unstructured documents

Midship has developed an AI-powered tool that extracts specific fields and tables from unstructured documents like PDFs and images. According to the founders in a Hacker News post, Midship combines OCR with language models to convert documents into clean, structured data, going beyond simple markdown output. The tool is aimed at both non-technical users via …

Read more

ByteDance’s X-Portrait 2 turns photos into realistic videos

ByteDance, the Chinese company behind TikTok, has revealed its X-Portrait 2 AI system, which can transform still photographs into convincing video performances. The technology, trained on TikTok’s vast database of user-generated videos, captures nuanced facial expressions and movements with unprecedented realism, as demonstrated by its ability to mirror iconic movie scenes. According to an article …

Read more

Nous Research launches Nous Chat, its first user-facing AI chatbot

Nous Research, an AI research group, has launched Nous Chat, a user-facing chatbot that provides access to its large language model, Hermes 3-70B. According to an article by Carl Franzen, the chatbot offers a familiar interface similar to ChatGPT and allows users to interact with the model without needing to run the code themselves. While …

Read more

Mistral AI launches multilingual content moderation API to tackle harmful content

Mistral AI, a French artificial intelligence startup, has released a new content moderation API capable of detecting harmful content across nine categories in 11 languages. The API, powered by Mistral’s fine-tuned Ministral 8B model, offers both raw text and conversational content analysis, as reported by Michael Nuñez for VentureBeat. This launch positions Mistral to compete …

Read more

Microsoft unveils Magentic-One, an open-source framework for managing multi-agent AI systems

Microsoft has released Magentic-One, a new open-source infrastructure that enables a single AI model to manage multiple helper agents working together to complete complex, multi-step tasks in various scenarios. According to a paper by Microsoft researchers, Magentic-One is a generalist agentic system that can “fully realize the long-held vision of agentic systems that can enhance …

Read more

OmniGen: First unified model for image generation

Researchers have introduced OmniGen, the first diffusion model capable of unifying various image generation tasks within a single framework. Unlike existing models like Stable Diffusion, OmniGen does not require additional modules to handle different control conditions, according to the authors Shitao Xiao, Yueze Wang, Junjie Zhou, Huaying Yuan, et al. The model can perform text-to-image …

Read more

Hugging Face releases compact language models for smartphones and edge devices

Hugging Face has released SmolLM2, a new family of compact language models designed to run on smartphones and edge devices with limited processing power and memory. The models, released under the Apache 2.0 license, come in three sizes up to 1.7B parameters and achieve impressive performance on key benchmarks, outperforming larger models like Meta’s Llama …

Read more

Runway unveils powerful 3D camera controls for Gen-3 Alpha Turbo AI video

Runway, a New York City-based AI startup, has launched advanced camera controls for its Gen-3 Alpha Turbo video generation model. According to an article by Carl Franzen, these new features allow users to zoom in and out of AI-generated scenes while preserving character forms and settings, creating a realistic 3D world. The camera controls provide …

Read more

Patronus AI launches API to prevent AI hallucinations in real-time

Patronus AI, a San Francisco startup, has launched a self-serve API that detects and prevents AI failures, such as hallucinations and unsafe responses, in real-time. According to CEO Anand Kannappan in an interview with VentureBeat, the platform introduces several innovations, including “judge evaluators” that allow companies to create custom rules in plain English and Lynx, …

Read more