Google’s Gemma 3 models now run on consumer GPUs through quantization

Google has released new versions of its Gemma 3 AI models that can run on consumer-grade graphics cards through a technique called Quantization-Aware Training (QAT). This development makes powerful AI models accessible to users without high-end hardware. The company announced that QAT dramatically reduces memory requirements while maintaining high quality performance. Gemma 3’s largest 27B …

Read more

Kagi Assistant now available to all users with no price increase

Paid search engine Kagi has announced that its AI assistant feature is now available to all users across all subscription plans at no additional cost. According to Kagi’s announcement, the Assistant combines access to leading large language models (LLMs) with optional integration of Kagi Search results. The tool was previously exclusive to Ultimate subscribers but …

Read more

Google introduces Gemini 2.5 Flash with adjustable “thinking” capabilities

Google has released Gemini 2.5 Flash in preview, offering developers unprecedented control over the AI model’s reasoning capabilities. This new version allows users to toggle “thinking” on or off and set specific “thinking budgets” to balance quality, cost, and response time. The pricing structure reveals the cost impact of reasoning: input costs $0.15 per million …

Read more

Switching between AI models proves more complex than expected

Enterprise teams switching between large language models (LLMs) face numerous hidden challenges beyond simply changing API keys. According to an article by Lavanya Gupta, treating model migration as “plug-and-play” often leads to unexpected problems with output quality, costs, and performance. The report explores the complexities of moving between models like GPT-4o, Claude, and Gemini. Key …

Read more

Sam Altman discusses AI safety, ethics and recent OpenAI developments

In a wide-ranging interview at TED2025, OpenAI CEO Sam Altman addressed pressing questions about artificial intelligence safety, ethics, and the company’s future plans. The conversation, conducted by TED curator Chris Anderson, covered topics from OpenAI’s growth trajectory to the ethical implications of generative AI models. Altman revealed that ChatGPT has reached approximately 500 million weekly …

Read more

Guide: GPT-4.1 prompts require more precise instructions

OpenAI has released a comprehensive prompting guide for its new GPT-4.1 family of models, highlighting significant improvements in coding capabilities, instruction following, and long context handling compared to GPT-4o. According to the guide published by OpenAI, developers may need to migrate their prompts because GPT-4.1 follows instructions more literally than previous versions, which tended to …

Read more

ByteDance leverages data from a billion users to power AI ambitions

China’s ByteDance is transforming data from its popular apps TikTok, Douyin, and Toutiao into a competitive advantage in artificial intelligence, reports Meaghan Tobin of The New York Times. The company collects behavioral data from approximately 170 million U.S. TikTok users and around 1 billion users of its Chinese apps. This wealth of information has become …

Read more

Report: Google’s Gemini 2.5 outperforms competitors across AI benchmarks

Google is leading the AI race with its Gemini 2.5 Pro Experimental model, which currently ranks as the best performing AI model across multiple benchmarks. According to Alberto Romero in his newsletter The Algorithmic Bridge, Google now dominates on every AI front. The model tops leaderboards including LMArena, GPQA Diamond, and Humanity’s Last Exam, outperforming …

Read more

OpenAI launches o3 and o4-mini with enhanced reasoning and visual capabilities

OpenAI has released two new AI models, o3 and o4-mini, designed to advance reasoning capabilities and introduce novel features like “thinking with images.” These models represent the company’s latest development in its o-series, coming just days after the release of GPT-4.1. The models’ most distinctive feature is their ability to not just recognize images but …

Read more

ChatGPT becomes most downloaded app globally, beating Instagram and TikTok

ChatGPT surpassed Instagram and TikTok to become the world’s most downloaded non-game app in March 2024, according to app intelligence provider Appfigures. Sarah Perez of TechCrunch reports that the AI chatbot saw downloads increase by 28% from February, reaching 46 million new installations. This represents ChatGPT’s biggest month ever and its first time topping the …

Read more