Image generator Stable Diffusion 3 Medium works on PCs with limited performance

Stability AI has released Stable Diffusion 3 Medium, a smaller version of its image generation model that can run on PCs with as little as 5GB of VRAM. According to Stability AI, the model offers comparable quality to the larger version and could therefore be an attractive option for users with limited resources.

Microsoft Florence-2 is specialized on image processing

Microsoft has unveiled Florence-2, a versatile AI model that can handle various image processing tasks with a single, unified approach. Available under an MIT license, the model appears to outperform larger specialized models in areas such as image annotation and object recognition, despite its compact size, and could help companies save on investments in separate …

Read more

Stability AI release Stable Audio Open

Stability AI releases “Stable Audio Open,” a new AI model for the free creation of sounds and pieces of music up to 47 seconds in length. However, due to the training material, it is limited to English descriptions and Western music styles.

Nvidia Inference Microservices accelerate development

Nvidia introduces NIM (Nvidia Inference Microservices), a new technology that supposedly enables developers to deliver AI applications in minutes instead of weeks. These microservices provide optimized models as containers that can be deployed in clouds, data centers, or on workstations. The goal is to enable organizations to build generative AI applications for co-piloting, chatbots, and …

Read more

Cohere Aya 23 is multilingual

Cohere for AI releases the Aya 23 multilingual AI models with support for 23 languages and open weights. The models outperform its predecessor, Aya 101, and other open models on a variety of tasks, enabling researchers and practitioners to further develop the multilingual models and applications.

Perplexica is an open source AI search engine

Perplexica is an open source search engine with AI support, similar to Perplexity. It provides answers with references. Once installed, it can use local language models such as Llama 3 or Mixtral.

Falcon 2 11B announced

The Technology Innovation Institute (TII) in Abu Dhabi has released Falcon 2 11B, a new, powerful AI model that is freely available and multilingual. Falcon 2 11B outperforms comparable models such as Meta’s Llama 3.

OpenVoice is an AI for voice cloning

OpenVoice allows users to realistically clone voices in different languages and accents, and even control emotions and speaking styles. The latest version, OpenVoice V2, offers improved audio quality, native support for multiple languages, and is available free for commercial use. Source: Hacker News

Arctic’s Snowflake aims at enterprise tasks

Snowflake introduces Arctic, a new open language model designed specifically for complex enterprise tasks such as generating SQL queries and code or following instructions.

Apple releases OpenELM AI models

Apple releases OpenELM, a set of small, freely available AI models that can run directly on devices like laptops or smartphones and perform tasks such as text generation efficiently. While not industry-leading in performance, OpenELM seems to provide a solid foundation for future research and development in on-device AI.