Apple and Nvidia collaborate to accelerate LLM processing

Apple and Nvidia have announced the integration of Apple’s ReDrafter technology into Nvidia’s TensorRT-LLM framework, enabling faster processing of large language models (LLMs) on Nvidia GPUs. ReDrafter, an open-source speculative decoding approach developed by Apple, uses recurrent neural networks to predict future tokens during text generation, combined with beam search and tree attention algorithms. The … Read more

Nvidia and DataStax launch storage-efficient AI retrieval system

Nvidia and DataStax have introduced a new AI technology that reduces data storage requirements by 35 times for generative AI systems. As reported by Michael Nuñez for VentureBeat, the Nvidia NeMo Retriever microservices, integrated with DataStax’s AI platform, enables faster and more accurate information retrieval across multiple languages. The technology has already shown impressive results … Read more

Nvidia unveils AI audio generation model Fugatto

Nvidia has introduced a new AI model called Fugatto that can generate and modify audio, including music, voice, and sound effects. As reported by Stephen Nellis for Reuters, the technology allows users to transform existing sounds, change voice accents, and create novel audio effects through text prompts. The model, whose name stands for Foundational Generative … Read more

Amazon developing AI chips to reduce reliance on Nvidia

Amazon is investing heavily in developing its own AI chips through Annapurna Labs, an Austin-based startup it acquired in 2015 for $350 million. The company aims to boost efficiency in its data centers and reduce costs for both itself and its AWS customers, the Financial Times reports. According to Dave Brown, vice-president of compute and … Read more

Nvidia surpasses Apple as most valuable company amid AI boom

Nvidia has become the world’s most valuable company, surpassing Apple with a market cap of $3.43 trillion as of November 5, 2024. According to Ryan Vlastelica reporting for Bloomberg, Nvidia’s dominance reflects the immense impact of artificial intelligence on Wall Street. The chipmaker is responsible for a quarter of the S&P 500’s 21% gain this … Read more

India’s major advances in AI

India is making great strides in building its own AI infrastructure and has already trained more than 100,000 AI developers. This was reported by Jensen Huang, CEO of Nvidia, at the Nvidia AI Summit in India, according to VentureBeat’s Dean Takahashi. The country now has more than 2,000 AI startups in the Nvidia Inception Program … Read more

Nvidia releases powerful and open AI model

Nvidia has introduced a new AI model, Llama-3.1-Nemotron-70B-Instruct, which outperforms existing models from OpenAI and others, continuing a significant shift in its AI strategy. The model, available on Hugging Face, achieved impressive benchmark scores, positioning Nvidia as a competitive player in AI language understanding and generation. This development showcases Nvidia’s transition from a GPU manufacturer … Read more

DataStax and Nvidia accelerate AI development for companies

DataStax has unveiled a new AI platform in collaboration with Nvidia, aimed at assisting enterprises with AI development. As reported by Sean Michael Kerner for VentureBeat, the platform combines DataStax’s database technology and visual AI orchestration tool Langflow with Nvidia’s enterprise AI components. According to DataStax, the new solution can reduce AI development time by … Read more

Rental prices for Nvidia GPUs collapse

The market for Nvidia’s H100 GPUs has gone from a shortage last year to a glut in 2024. According to Eugene Cheah, rental prices for the powerful GPUs have dropped from $8 per hour to less than $2. Cheah attributes this to a number of factors. For example, more companies are relying on fine-tuning existing … Read more

Accenture forming Nvidia business group

Accenture is forming a Nvidia business group with 30,000 professionals to help enterprises adopt AI, reports Dean Takahashi. The goal is to train Accenture’s team to advise clients on process optimization and scaling AI solutions. Lan Guan, Chief AI Officer at Accenture, emphasizes the growing demand for generative AI. This new group expands the existing … Read more