Microsoft’s Phi-4 AI model achieves high performance with fewer resources

Microsoft has introduced a new AI model that delivers superior mathematical reasoning capabilities while using significantly less computing power than larger competitors. According to Michael Nuñez’s report in VentureBeat, the 14-billion-parameter Phi-4 model outperforms larger systems like Google’s Gemini Pro 1.5. The model excels particularly in mathematical problem-solving, achieving top scores on standardized math competition … Read more

ServiceNow releases open-source AI training accelerator

ServiceNow has launched Fast-LLM, an open-source framework that speeds up artificial intelligence model training by 20%. As reported by Sean Michael Kerner for VentureBeat, the technology has already proven successful in training ServiceNow’s StarCoder 2 language model. Fast-LLM introduces two key innovations: “Breadth-First Pipeline Parallelism” for optimized computation ordering and improved memory management that reduces … Read more

Meta’s Llama 3.3 70B model runs GPT-4 level AI on high-end laptops

Meta has released Llama 3.3 70B, a new large language model that achieves GPT-4 level performance while running on high-end consumer laptops. The breakthrough was documented by developer Simon Willison testing the model on a 64 GB MacBook Pro M2, demonstrating capabilities comparable to much larger models like Meta’s own Llama 3.1 405B. The model … Read more

AI coding tools show limitations despite productivity gains

A comprehensive analysis reveals that artificial intelligence-assisted coding tools, while boosting developer productivity, are not necessarily leading to better software quality. Software engineer Addy Osmani writes that AI tools can help developers achieve about 70% of a project quickly but struggle with the crucial final 30% that makes software production-ready. The report identifies that experienced … Read more

New AI architecture STAR reduces model cache size by 90 percent

MIT startup Liquid AI has developed a new AI framework called STAR (Synthesis of Tailored Architectures) that significantly improves upon traditional Transformer models. As reported by Carl Franzen for VentureBeat, the system uses evolutionary algorithms to automatically generate and optimize AI architectures. The STAR framework achieved a 90% reduction in cache size compared to traditional … Read more

Hume AI releases voice customization tool for developers

Hume AI has launched Voice Control, a new feature that enables developers to create custom AI voices by adjusting vocal characteristics through an interface with sliding controls. As reported by Carl Franzen for VentureBeat, the tool allows users to modify voices along ten different dimensions including assertiveness, confidence, and enthusiasm without requiring coding skills. The … Read more

Pinecone enhances vector database with new retrieval system

Pinecone has introduced significant updates to its vector database platform, including a new cascading retrieval system that combines dense and sparse vector capabilities. According to an article by Sean Michael Kerner in VentureBeat, the company claims these improvements can increase enterprise AI accuracy by up to 48%. The update features new reranking technologies, including Cohere’s … Read more

AWS announces major AI infrastructure and service updates at re:Invent 2024

Amazon Web Services (AWS) has unveiled several significant artificial intelligence developments at its re:Invent 2024 conference. The announcements focus on new hardware, software, and services designed to enhance AI capabilities for business customers. The company introduced the general availability of its Trainium2 chips, which AWS claims are four times faster than their predecessors. These chips … Read more

Anomalo launches unstructured data quality monitoring for AI systems

Anomalo has expanded its data quality platform to handle unstructured data monitoring for enterprise AI applications. As reported by Sean Michael Kerner for VentureBeat, the new solution aims to reduce AI deployment time by 30% through improved data quality control. The platform adds structured metadata to unstructured documents, helping organizations identify sensitive information and data … Read more

New AI model combines speech recognition with privacy protection

Israeli startup aiOla has released Whisper-NER, an open-source AI model that transcribes audio while automatically masking sensitive information. As reported by Carl Franzen for VentureBeat, the model builds upon OpenAI’s Whisper framework and combines automatic speech recognition with named entity recognition to protect private data during transcription. The tool can identify and obscure sensitive details … Read more