Alibaba extends Qwen AI model to process one million tokens

Alibaba Cloud has launched an upgraded version of its Qwen2.5-Turbo AI model that can now process contexts of up to one million tokens, equivalent to approximately 1.5 million Chinese characters or 10 full-length novels. The improved model achieves 93.1 points on the RULER long text evaluation benchmark, surpassing GPT-4’s score of 91.6. According to Alibaba, …

Read more

Mistral AI launches enhanced language model and ChatGPT competitor

French AI startup Mistral has unveiled Pixtral Large, a new 124-billion-parameter language model, alongside major updates to its Le Chat platform, reports Carl Franzen. The new model features advanced multimodal capabilities, including image processing and optical character recognition, while maintaining a significant context window of 128,000 tokens. The model is available for research purposes through …

Read more

Hugging Face releases compact language models for smartphones and edge devices

Hugging Face has released SmolLM2, a new family of compact language models designed to run on smartphones and edge devices with limited processing power and memory. The models, released under the Apache 2.0 license, come in three sizes up to 1.7B parameters and achieve impressive performance on key benchmarks, outperforming larger models like Meta’s Llama …

Read more

Microsoft adds AI-powered text editing to Notepad

Microsoft is introducing an AI-powered text editing feature called Rewrite to its classic Notepad application, allowing users to rephrase sentences, adjust tone, and modify content length. According to the Windows Insider Blog, the feature is currently rolling out in preview to Windows Insiders and requires signing in to a Microsoft account. Emma Roth from The …

Read more

Claude AI chatbot launches desktop apps and dictation

Anthropic has released desktop apps for Mac and Windows for its AI chatbot Claude, bringing Claude’s capabilities to users’ preferred work environments, TechCrunch reports. The apps are available in public beta for both free and premium users. Anthropic also introduced a dictation tool allowing users to upload voice messages up to 10 minutes long for …

Read more

Anthropic releases Claude 3.5 Haiku with increased prices

Anthropic has released its newest and smallest AI model, Claude 3.5 Haiku, which outperforms the previous flagship model, Claude 3 Opus, on various benchmarks at a lower cost, according to the company. However, Anthropic has increased the pricing for Claude 3.5 Haiku to reflect its enhanced capabilities, with input tokens now costing $1 per million …

Read more

Moondream raises $4.5M for compact yet powerful AI vision-language model

Moondream, a startup backed by Felicis Ventures, Microsoft’s M12 GitHub Fund, and Ascend, has emerged from stealth with $4.5 million in pre-seed funding. According to VentureBeat’s Michael Nuñez, the company has developed an open-source vision-language model that boasts 1.6 billion parameters but matches the performance of models four times its size. The model, which can …

Read more

First features of Apple Intelligence launched, reviews are mixed

Apple has released iOS 18.1, iPadOS 18.1, and macOS Sequoia 15.1, introducing the first set of Apple Intelligence features. These AI-powered enhancements are available on select devices equipped with A17 Pro, M1, or later chips. Users can opt into Apple Intelligence after downloading the update and will be added to a short waitlist to prepare …

Read more

Cohere Aya Expanse is a highly performant multilingual model family

Cohere For AI has released Aya Expanse, a family of highly performant multilingual models that, according to Cohere, excel across 23 languages and outperform other leading open-weights models. The models, available in 8 and 32 billion parameters, are part of Cohere’s commitment to multilingual research and expanding high-quality coverage of languages in large language models …

Read more

Meta releases AI models for mobile devices

Meta Platforms has released quantized versions of its Llama 3.2 1B and 3B models, which the company says offer reduced memory requirements, faster on-device inference, accuracy, and portability. The models were developed in close collaboration with Qualcomm and MediaTek and are available on SoCs with Arm CPUs. According to Meta, the average model size has …

Read more