Microsoft Florence-2 is specialized on image processing

Microsoft has unveiled Florence-2, a versatile AI model that can handle various image processing tasks with a single, unified approach. Available under an MIT license, the model appears to outperform larger specialized models in areas such as image annotation and object recognition, despite its compact size, and could help companies save on investments in separate …

Read more

Meta releases several new AI models

Meta is releasing a series of new AI models for audio, text and watermarks. Meta is also making two sizes of its Chameleon multimodal text model available for research. These models can be used to perform tasks that require both visual and textual understanding, such as image annotation.

Create social media videos with this AI platform

Augie Studio introduces a new AI platform for creating social media videos easily and at scale. The platform includes features such as AI-powered script, voice-over, and image creation, as well as editing tools to customize videos.

Genspark is a new AI-powered search engine

Genspark is a new AI-powered search engine that uses generative AI to create summaries of search results, similar to Google’s AI Overviews or Arc Search, but claims to achieve higher quality through specialized models.

Add sound effects to your videos with this tool

ElevenLabs has released a new tool that allows video creators to quickly and easily add sound effects to their clips. The app analyzes uploaded videos and suggests different sound effects that can be integrated directly into the videos via an interface.

New AI models for video: Luma Dream Machine and Runway Gen-3 Alpha

Luma AI has introduced “Dream Machine”, a new AI system for video generation. Unlike similar systems such as those from OpenAI (“Sora”), Dream Machine is available for free for everyone to use. Users can create 5-second video clips simply by entering text. However, the quality of the results is not always convincing. The startup itself …

Read more

New version and features for ChatGPT alternative Claude

Anthropic’s new language model Claude 3.5 Sonnet is causing a stir in the AI community. It reportedly outperforms previous models such as GPT-4 in benchmark tests and impresses users with its performance. It can handle complex tasks such as game or web development. Despite weaknesses in simple cognitive tasks, Claude 3.5 Sonnet shows the pace …

Read more

Adapter

Imagine you have a universal toolbox that contains many different tools, but is too large and cumbersome for certain tasks. To perform certain tasks efficiently, you can use small, specialized attachments called adapters. These adapters attach to the general-purpose tool and extend its function. For example, you can attach a screwdriver adapter to a drill …

Read more

New sources of better AI training data

Large Language Models (LLMs) are no longer trained solely on data from the Internet. In the past, LLMs were based on the vast data pool of the Internet, but this approach has reached its limits. To advance LLMs, companies like OpenAI are turning to new types of data: targeted annotation and filtering improve the quality …

Read more

×