Research | Page 2 of 12 | ✦ Smart Content Report

Stanford researchers develop test to measure AI chatbot flattery

June 3, 2025

Stanford University researchers have created a new benchmark to measure excessive flattery in AI chatbots after OpenAI rolled back updates to GPT-4o due to complaints about overly polite responses. The research, conducted with Carnegie Mellon University and University of Oxford, was reported by Emilia David. The team developed “Elephant,” a test that evaluates how much …

Google introduces fast new AI model using diffusion technology

May 28, 2025

Google unveiled Gemini Diffusion at its I/O developer conference, marking a significant shift in how AI models generate text. The experimental model uses diffusion technology instead of the traditional transformer approach that powers ChatGPT and similar systems. The key advantage is speed. Gemini Diffusion generates text at 857 to 2,000 tokens per second, which is …

DarkBench framework identifies manipulative behaviors in AI chatbots

May 23, 2025

AI safety researchers have created the first benchmark specifically designed to detect manipulative behaviors in large language models, following a concerning incident with ChatGPT-4o’s excessive flattery toward users. Leon Yen reported on the development for VentureBeat. The DarkBench framework, developed by Apart Research founder Esben Kran and collaborators, identifies six categories of problematic AI behaviors. …

Sakana AI introduces Continuous Thought Machines, a novel neural network that mimics brain processes

May 15, 2025

Sakana AI, co-founded by former Google AI scientists, has unveiled a new neural network architecture called Continuous Thought Machines (CTM). Unlike traditional transformer-based models that process information in parallel, CTMs incorporate a time-based dimension that mimics how biological brains operate, allowing for more flexible and adaptive reasoning. The key innovation in CTMs is their treatment …

New benchmark reveals leading AI models confidently produce false information

May 13, 2025

A new benchmark called Phare has revealed that leading large language models (LLMs) frequently generate false information with high confidence, particularly when handling misinformation. The research, conducted by Giskard with partners including Google DeepMind, evaluated top models from eight AI labs across multiple languages. The Phare benchmark focuses on four critical domains: hallucination, bias and …

Scientists struggle to understand how LLMs work

May 2, 2025

Researchers building large language models (LLMs) face a major challenge in understanding how these AI systems actually function, according to a recent article in Quanta Magazine by James O’Brien. The development process resembles gardening more than traditional engineering, with scientists having limited control over how models develop. Martin Wattenberg, a language model researcher at Harvard …

Study finds LM Arena may favor major AI labs in its benchmarking

May 1, 2025

A new study by researchers from Cohere, Stanford, MIT, and Ai2 alleges that LM Arena, the organization behind the Chatbot Arena AI benchmark, provided preferential treatment to major AI companies. According to Maxwell Zeff’s TechCrunch report, companies like Meta, OpenAI, Google, and Amazon were allowed to privately test multiple model variants and only publish scores …

AI helps scientists develop new experiments and discoveries

May 1, 2025

AI systems are increasingly being used to design experiments and drive scientific discoveries, according to research highlighted in Quanta Magazine. Mario Krenn, a quantum physicist who now leads the Artificial Scientist Lab, developed an AI program called Melvin that successfully designed quantum physics experiments when humans were stuck. Gregory Barber, writing for Quanta Magazine, describes …

Google DeepMind researchers predict “Era of Experience” in AI

May 1, 2025

Google DeepMind’s David Silver and Richard S. Sutton predict a major shift in artificial intelligence development, which they call the “Era of Experience.” In a preprint paper for MIT Press, the researchers argue that AI will increasingly learn from its own experiences rather than human-generated data. The authors suggest that current AI systems, particularly large …

Anthropic develops method to analyze AI’s values in real conversations

April 24, 2025

Anthropic, the company behind the AI assistant Claude, has developed a new technique to observe and analyze how its AI expresses values during real-world conversations with users. The research, conducted by Anthropic’s Societal Impacts team, examines whether Claude adheres to the company’s goal of making it “helpful, honest, and harmless” when interacting with users. The …