Chinese models lead Hugging Face ranking

Hugging Face’s new ranking of the best freely avaiable language models shows that Chinese models currently lead the way. Alibaba’s Qwen models dominate the top spots in the ranking, which is based on more challenging tests than its predecessor. Skills such as knowledge recall, inferring from long texts, complex mathematics, and following instructions are assessed.

Study: Open weights is not the same as open source

Many AI models that power chatbots advertise themselves as “open source,” but do not fully release the code and training data. A new study shows that many large companies describe their models as “open weights”, meaning that researchers can use them, but have no access to the underlying data and can’t make fundamental changes to …

Read more

Image generator Stable Diffusion 3 Medium works on PCs with limited performance

Stability AI has released Stable Diffusion 3 Medium, a smaller version of its image generation model that can run on PCs with as little as 5GB of VRAM. According to Stability AI, the model offers comparable quality to the larger version and could therefore be an attractive option for users with limited resources.

Microsoft Florence-2 is specialized on image processing

Microsoft has unveiled Florence-2, a versatile AI model that can handle various image processing tasks with a single, unified approach. Available under an MIT license, the model appears to outperform larger specialized models in areas such as image annotation and object recognition, despite its compact size, and could help companies save on investments in separate …

Read more

Stability AI release Stable Audio Open

Stability AI releases “Stable Audio Open,” a new AI model for the free creation of sounds and pieces of music up to 47 seconds in length. However, due to the training material, it is limited to English descriptions and Western music styles.

Nvidia Inference Microservices accelerate development

Nvidia introduces NIM (Nvidia Inference Microservices), a new technology that supposedly enables developers to deliver AI applications in minutes instead of weeks. These microservices provide optimized models as containers that can be deployed in clouds, data centers, or on workstations. The goal is to enable organizations to build generative AI applications for co-piloting, chatbots, and …

Read more

Cohere Aya 23 is multilingual

Cohere for AI releases the Aya 23 multilingual AI models with support for 23 languages and open weights. The models outperform its predecessor, Aya 101, and other open models on a variety of tasks, enabling researchers and practitioners to further develop the multilingual models and applications.

Perplexica is an open source AI search engine

Perplexica is an open source search engine with AI support, similar to Perplexity. It provides answers with references. Once installed, it can use local language models such as Llama 3 or Mixtral.

Falcon 2 11B announced

The Technology Innovation Institute (TII) in Abu Dhabi has released Falcon 2 11B, a new, powerful AI model that is freely available and multilingual. Falcon 2 11B outperforms comparable models such as Meta’s Llama 3.

OpenVoice is an AI for voice cloning

OpenVoice allows users to realistically clone voices in different languages and accents, and even control emotions and speaking styles. The latest version, OpenVoice V2, offers improved audio quality, native support for multiple languages, and is available free for commercial use. Source: Hacker News