Nvidia releases free Parakeet-TDT-0.6B-V2 speech recognition model

Nvidia has launched a new open-source automatic speech recognition (ASR) model called Parakeet-TDT-0.6B-v2. According to VentureBeat reporter Carl Franzen, the model can transcribe 60 minutes of audio in just one second when running on Nvidia’s GPU hardware. The new model currently tops the Hugging Face Open ASR Leaderboard with a word error rate of only … Read more

Nvidia releases powerful Llama-3.1 Nemotron Ultra language model

Nvidia has launched Llama-3.1-Nemotron-Ultra-253B, a fully open-source language model that outperforms the larger DeepSeek R1 on several benchmarks despite having less than half the parameters. Carl Franzen of VentureBeat reports the model is now available on Hugging Face with open weights and training data. The 253-billion parameter model features a unique toggle for “reasoning on” … Read more

Nvidia unveils Llama Nemotron models to advance AI agents and reasoning capabilities

At the GPU Technology Conference (GTC) 2025, Nvidia announced a new family of AI models called Llama Nemotron designed to enhance reasoning capabilities for autonomous AI agents. These models are based on Meta’s open-source Llama models but have been refined through post-training optimization techniques to improve their performance in complex tasks such as multistep math, … Read more

Nvidia claims AI chip performance exceeds Moore’s Law pace

Nvidia CEO Jensen Huang has announced that his company’s AI chips are advancing at a rate surpassing Moore’s Law, the long-standing principle of computing progress. According to TechCrunch’s recent report, Huang states their latest data center superchip performs 30 times faster for AI tasks than previous generations. The CEO attributes this acceleration to Nvidia’s comprehensive … Read more

Nvidia introduces comprehensive AI agent and video analysis technology

Nvidia has announced a new suite of AI technologies focused on video analysis and agent-based automation, as revealed during CEO Jensen Huang’s CES 2025 keynote address. The company has launched several blueprints that enable developers to create AI agents capable of analyzing video content, processing documents, and automating various enterprise tasks. The new technology includes … Read more

Nvidia announces new AI models and technologies at CES 2025

Nvidia has unveiled multiple new AI initiatives at CES 2025, centered around their Nemotron model families and Cosmos World Foundation Models. The company’s CEO Jensen Huang presented these developments during his opening keynote, introducing AI models designed to advance both enterprise and consumer applications. The Nemotron family includes language and vision models available as NIM … Read more

Nvidia introduces “desktop AI supercomputer” Project Digits for $3,000

At CES 2025 in Las Vegas, Nvidia announced Project Digits, a compact desktop AI supercomputer aimed at researchers, data scientists, and students. The device, scheduled for release in May 2025 at a price point of $3,000, represents the company’s effort to bring powerful AI computing capabilities to individual desks. At the core of Project Digits … Read more

Nvidia introduces new RTX 50 series GPUs with enhanced AI capabilities

Nvidia has announced its new GeForce RTX 50 series graphics cards, featuring the Blackwell architecture and significant AI processing improvements. The flagship RTX 5090, priced at $1,999, incorporates 92 billion transistors and delivers 4,000 AI TOPS (trillion operations per second), representing a major advancement in AI-powered graphics processing. The new GPU series introduces several AI-focused … Read more

Nvidia acquires Run:ai and makes software open source

Nvidia has completed its acquisition of Run:ai, a software company specializing in GPU cloud management for artificial intelligence, as reported by Dean Takahashi. While the purchase price wasn’t officially disclosed, earlier reports valued the deal at $700 million. The company announced plans to make Run:ai’s software platform open source, potentially allowing it to support GPUs … Read more

Apple and Nvidia collaborate to accelerate LLM processing

Apple and Nvidia have announced the integration of Apple’s ReDrafter technology into Nvidia’s TensorRT-LLM framework, enabling faster processing of large language models (LLMs) on Nvidia GPUs. ReDrafter, an open-source speculative decoding approach developed by Apple, uses recurrent neural networks to predict future tokens during text generation, combined with beam search and tree attention algorithms. The … Read more