Mistral unveils small models for laptops and smartphones

French AI company Mistral has introduced new generative AI models for laptops and smartphones. Known as “Les Ministraux,” the models are optimized for various applications such as text generation or collaboration with more powerful models. Kyle Wiggers reports for TechCrunch that two variants are available: Ministral 3B and Ministral 8B, both with a context window …

Read more

Zamba2-7B is especially efficient

Zyphra has released Zamba2-7B, a new small language model supposedly outperforming competitors like Mistral, Google’s Gemma, and Meta’s Llama3 in quality and performance. According to the Zyphra team, Zamba2-7B is ideal for consumer devices, GPUs, and enterprise applications. It boasts 25% faster time to first token, 20% more tokens per second, and reduced memory usage …

Read more

What is the best version of ChatGPT?

A Reddit post titled “Which is the best version of chatgpt4” sparked a discussion about the various models of ChatGPT. The original poster inquired about the most accurate version, leading to various responses. Users highlighted the strengths of different models: “4o” is considered best for research, image generation, and creative writing, while “o1 mini” excels …

Read more

Palmyra X 004 is the David of AI models

Writer has launched its new AI language model, Palmyra X 004, which seemingly excels in function calling and workflow execution for businesses. Michael Nuñez reports for VentureBeat that the model outperforms competitors like OpenAI, Anthropic, Google, and Meta by nearly 20% on Berkeley’s Tool Calling Leaderboard, achieving a score of 78.76%. Palmyra X 004 achieves …

Read more

Gemini 1.5 Flash-8B available

Google has released a new version of its AI model Gemini. Gemini 1.5 Flash-8B is now available for production use, as announced on the Google Developers Blog. Compared to its predecessor, the model offers 50% lower prices, twice the request limits, and lower latency for short inputs. Developers can access Gemini 1.5 Flash-8B for free …

Read more

OpenAI adds Canvas to ChatGPT

OpenAI introduces a new feature called “Canvas” for ChatGPT. The new interface allows users to edit text and code directly next to the chat window. Canvas is initially available for ChatGPT Plus and Teams users, later also for Enterprise and Edu customers. The function aims to simplify writing and programming with AI assistance. With Canvas, …

Read more

Nvidia surprises with powerful, open AI models

Nvidia has released a powerful open-source AI model that rivals proprietary systems from industry leaders like OpenAI and Google. The model, called NVLM 1.0, demonstrates exceptional performance in vision and language tasks while also enhancing text-only capabilities. Michael Nuñez reports on this development for VentureBeat. The main model, NVLM-D-72B, with 72 billion parameters, can process …

Read more

Mostly AI helps companies to train AI without privacy concerns

Mostly AI has introduced a new feature for generating synthetic texts. The tool allows companies to use confidential data such as emails and conversations for AI training without privacy concerns. As reported by Shubham Sharma, the platform generates a version of proprietary information free from personally identifiable data. This enables businesses to train and optimize …

Read more

MIT spin-off Liquid AI shows its highly efficient models

A MIT spin-off called Liquid AI has unveiled new AI models that are not based on the usual transformer architecture. According to the company, these “Liquid Foundation Models” (LFMs) already outperform comparable transformer-based models in performance and efficiency. Liquid AI announced this in a statement. Instead of transformers, the developers used approaches from the theory …

Read more

Meta Llama 3.2 is here

Meta has today released the new version of its AI model series: Llama 3.2, which for the first time includes vision models that can process both images and text. The larger versions with 11 and 90 billion parameters should be able to compete with closed systems like Claude 3 Haiku for image processing. Also new …

Read more