Open Source | Page 9 of 16 | ✦ Smart Content Report

DeepSeek releases new reasoning models and introduces distilled versions

February 5, 2025January 20, 2025

Chinese AI company DeepSeek has announced the release of its new reasoning-focused language models DeepSeek-R1-Zero and DeepSeek-R1, along with six smaller distilled versions. The main models, built on DeepSeek’s V3 architecture, feature 671 billion total parameters with 37 billion activated parameters and a context length of 128,000 tokens. According to company statements, DeepSeek-R1 achieves performance …

MiniMax AI model has record-breaking 4 million token context

February 5, 2025January 20, 2025

Singapore-based AI company MiniMax has launched a new open-source language model that can process up to 4 million tokens at once, doubling the previous record. According to Carl Franzen’s report in VentureBeat, the MiniMax-01 series includes both text and visual capabilities. The model uses an innovative “Lightning Attention” architecture and mixture of experts framework with …

Diffbot launches new AI model with real-time fact checking

February 5, 2025January 20, 2025

Diffbot, a Silicon Valley company, has introduced a new AI model that combines AI capabilities with real-time fact verification. As reported by Michael Nuñez for VentureBeat, the system uses graph retrieval-augmented generation (GraphRAG) technology based on Meta’s Llama 3.3. The model connects to Diffbot’s Knowledge Graph, a database containing over one trillion facts that updates …

Microsoft releases Phi-4 AI model with open-source license

February 5, 2025January 8, 2025

Microsoft has made its Phi-4 AI model freely available as open-source software on the Hugging Face platform. As reported by Carl Franzen, the model was previously only accessible through Microsoft’s Azure AI Foundry platform. The 14-billion-parameter AI model has demonstrated strong capabilities in mathematical reasoning and language understanding tasks, outperforming larger models in specific benchmarks. …

Tested: DeepSeek-V3 matches top AI models at lower cost

February 5, 2025January 2, 2025

A detailed analysis published by Sunil Kumar Dash reveals that DeepSeek’s latest AI model achieves performance comparable to leading closed-source models while offering significant cost advantages. The model outperforms existing open-source alternatives in mathematics and reasoning tasks, according to extensive benchmark testing. The analysis demonstrates that DeepSeek-V3 surpasses GPT-4 and Claude 3.5 Sonnet in mathematical …

Open model DeepSeek-V3 performs similar to closed competition

February 5, 2025December 27, 2024

Chinese AI startup DeepSeek has launched DeepSeek-V3, a powerful new AI model that outperforms existing open-source alternatives. According to reporting by Shubham Sharma at VentureBeat, the model features 671 billion parameters but activates only 37 billion for each task through its mixture-of-experts architecture. The model was trained on 14.8 trillion diverse tokens and demonstrates superior …

IBM launches improved Granite 3.1 language models

February 5, 2025December 19, 2024

IBM has released a new version of its open-source large language models, Granite 3.1, featuring significant improvements in performance and capabilities. According to reporting by Sean Michael Kerner for VentureBeat, the new models offer extended context length and integrated hallucination detection. The Granite 8B Instruct model reportedly outperforms similar-sized competitors including Meta Llama 3.1 and …

Microsoft’s Phi-4 AI model achieves high performance with fewer resources

February 5, 2025December 13, 2024

Microsoft has introduced a new AI model that delivers superior mathematical reasoning capabilities while using significantly less computing power than larger competitors. According to Michael Nuñez’s report in VentureBeat, the 14-billion-parameter Phi-4 model outperforms larger systems like Google’s Gemini Pro 1.5. The model excels particularly in mathematical problem-solving, achieving top scores on standardized math competition …

NitroFusion creates instant images on basic hardware

February 5, 2025December 11, 2024

The University of Surrey’s Institute for People-Centred Artificial Intelligence (PAI) has unveiled NitroFusion, a revolutionary AI model that generates images in real-time as users type. The groundbreaking technology, developed by the university’s SketchX laboratory, operates on consumer-grade graphics cards, making it accessible to individual creators and small studios. Unlike existing image generation platforms that require …

ServiceNow releases open-source AI training accelerator

February 5, 2025December 11, 2024

ServiceNow has launched Fast-LLM, an open-source framework that speeds up artificial intelligence model training by 20%. As reported by Sean Michael Kerner for VentureBeat, the technology has already proven successful in training ServiceNow’s StarCoder 2 language model. Fast-LLM introduces two key innovations: “Breadth-First Pipeline Parallelism” for optimized computation ordering and improved memory management that reduces …