H2O.ai’s new models specialize on documents

H2O.ai has unveiled two new vision-language models, H2OVL Mississippi-2B and H2OVL Mississippi-0.8B, aimed at enhancing document analysis and optical character recognition (OCR) tasks. Despite their smaller size, these models demonstrate competitive performance against larger models from major tech companies, with the 0.8B model excelling in the OCRBench Text Recognition task. CEO Sri Ambati emphasized that …

Read more

ChatGPT gets a Windows app

OpenAI has launched a Windows desktop application for ChatGPT, enhancing its integration into daily workflows and productivity. This release follows a similar app for macOS and is currently available to select subscribers, VentureBeat reports. The app allows users to access ChatGPT easily via a keyboard shortcut, aiming to streamline user experience without the need for …

Read more

Differential Transformer could improve text AIs

Microsoft and Tsinghua University have developed a new AI architecture called “Differential Transformer” that improves the performance of large language models. Furu Wei from Microsoft Research told VentureBeat that the new method amplifies attention to relevant contexts and filters out noise. This is designed to reduce problems such as the “lost-in-the-middle” phenomenon and hallucinations in …

Read more

Nvidia releases powerful and open AI model

Nvidia has introduced a new AI model, Llama-3.1-Nemotron-70B-Instruct, which outperforms existing models from OpenAI and others, continuing a significant shift in its AI strategy. The model, available on Hugging Face, achieved impressive benchmark scores, positioning Nvidia as a competitive player in AI language understanding and generation. This development showcases Nvidia’s transition from a GPU manufacturer …

Read more

Mistral unveils small models for laptops and smartphones

French AI company Mistral has introduced new generative AI models for laptops and smartphones. Known as “Les Ministraux,” the models are optimized for various applications such as text generation or collaboration with more powerful models. Kyle Wiggers reports for TechCrunch that two variants are available: Ministral 3B and Ministral 8B, both with a context window …

Read more

Zamba2-7B is especially efficient

Zyphra has released Zamba2-7B, a new small language model supposedly outperforming competitors like Mistral, Google’s Gemma, and Meta’s Llama3 in quality and performance. According to the Zyphra team, Zamba2-7B is ideal for consumer devices, GPUs, and enterprise applications. It boasts 25% faster time to first token, 20% more tokens per second, and reduced memory usage …

Read more

What is the best version of ChatGPT?

A Reddit post titled “Which is the best version of chatgpt4” sparked a discussion about the various models of ChatGPT. The original poster inquired about the most accurate version, leading to various responses. Users highlighted the strengths of different models: “4o” is considered best for research, image generation, and creative writing, while “o1 mini” excels …

Read more

Palmyra X 004 is the David of AI models

Writer has launched its new AI language model, Palmyra X 004, which seemingly excels in function calling and workflow execution for businesses. Michael Nuñez reports for VentureBeat that the model outperforms competitors like OpenAI, Anthropic, Google, and Meta by nearly 20% on Berkeley’s Tool Calling Leaderboard, achieving a score of 78.76%. Palmyra X 004 achieves …

Read more

Gemini 1.5 Flash-8B available

Google has released a new version of its AI model Gemini. Gemini 1.5 Flash-8B is now available for production use, as announced on the Google Developers Blog. Compared to its predecessor, the model offers 50% lower prices, twice the request limits, and lower latency for short inputs. Developers can access Gemini 1.5 Flash-8B for free …

Read more

OpenAI adds Canvas to ChatGPT

OpenAI introduces a new feature called “Canvas” for ChatGPT. The new interface allows users to edit text and code directly next to the chat window. Canvas is initially available for ChatGPT Plus and Teams users, later also for Enterprise and Edu customers. The function aims to simplify writing and programming with AI assistance. With Canvas, …

Read more