Local AI | Page 6 of 9 | ✦ Smart Content Report

Open model DeepSeek-V3 performs similar to closed competition

February 5, 2025December 27, 2024

Chinese AI startup DeepSeek has launched DeepSeek-V3, a powerful new AI model that outperforms existing open-source alternatives. According to reporting by Shubham Sharma at VentureBeat, the model features 671 billion parameters but activates only 37 billion for each task through its mixture-of-experts architecture. The model was trained on 14.8 trillion diverse tokens and demonstrates superior …

New AI evaluation model Glider matches GPT-4’s performance with fewer resources

February 5, 2025December 19, 2024

Startup Patronus AI has developed a breakthrough AI evaluation model that achieves comparable results to much larger systems while using significantly fewer computational resources. As reported by Michael Nuñez for VentureBeat, the new open-source model named Glider uses only 3.8 billion parameters yet matches or exceeds the performance of GPT-4 on key benchmarks. The model …

Nvidia and DataStax launch storage-efficient AI retrieval system

February 5, 2025December 17, 2024

Nvidia and DataStax have introduced a new AI technology that reduces data storage requirements by 35 times for generative AI systems. As reported by Michael Nuñez for VentureBeat, the Nvidia NeMo Retriever microservices, integrated with DataStax’s AI platform, enables faster and more accurate information retrieval across multiple languages. The technology has already shown impressive results …

Cohere launches new compact AI language model Command R7B

February 5, 2025December 17, 2024

AI company Cohere has introduced Command R7B, a new compact language model designed for enterprise applications. According to VentureBeat reporter Taryn Plumb, the model supports 23 languages and specializes in retrieval-augmented generation (RAG). Command R7B outperforms similar-sized models from competitors like Google, Meta, and Mistral in mathematics and coding tasks. The model features a 128K …

Sakana AI develops new memory optimization for language models

February 5, 2025December 17, 2024

Tokyo-based startup Sakana AI has created a breakthrough technique that reduces memory usage in large language models by up to 75%. As reported by Ben Dickson, the system called “universal transformer memory” uses neural attention memory modules (NAMMs) to efficiently manage information processing. These modules analyze the model’s attention layers to determine which information to …

Microsoft’s Phi-4 AI model achieves high performance with fewer resources

February 5, 2025December 13, 2024

Microsoft has introduced a new AI model that delivers superior mathematical reasoning capabilities while using significantly less computing power than larger competitors. According to Michael Nuñez’s report in VentureBeat, the 14-billion-parameter Phi-4 model outperforms larger systems like Google’s Gemini Pro 1.5. The model excels particularly in mathematical problem-solving, achieving top scores on standardized math competition …

NitroFusion creates instant images on basic hardware

February 5, 2025December 11, 2024

The University of Surrey’s Institute for People-Centred Artificial Intelligence (PAI) has unveiled NitroFusion, a revolutionary AI model that generates images in real-time as users type. The groundbreaking technology, developed by the university’s SketchX laboratory, operates on consumer-grade graphics cards, making it accessible to individual creators and small studios. Unlike existing image generation platforms that require …

Meta’s Llama 3.3 70B model runs GPT-4 level AI on high-end laptops

February 5, 2025December 10, 2024

Meta has released Llama 3.3 70B, a new large language model that achieves GPT-4 level performance while running on high-end consumer laptops. The breakthrough was documented by developer Simon Willison testing the model on a 64 GB MacBook Pro M2, demonstrating capabilities comparable to much larger models like Meta’s own Llama 3.1 405B. The model …

New AI model from Hugging Face promises efficient image processing

February 5, 2025November 27, 2024

Hugging Face has introduced SmolVLM, a new vision-language AI model that processes both images and text while using significantly less computing power than comparable solutions. As reported by Michael Nuñez, the model requires only 5.02 GB of GPU RAM, compared to competitors that need up to 13.70 GB. The system uses advanced compression technology to …

Lightricks releases open-source AI video generation model

February 5, 2025November 26, 2024

Israeli tech company Lightricks has launched LTX Video (LTXV), a new open-source AI model that generates five-second videos in just four seconds. As Michael Nuñez reports for VentureBeat, the company aims to challenge major tech firms by making its technology freely available. The model runs efficiently on consumer-grade hardware like Nvidia RTX 4090 GPUs while …