Stanford researchers develop test to measure AI chatbot flattery
Stanford University researchers have created a new benchmark to measure excessive flattery in AI chatbots after OpenAI rolled back updates to GPT-4o due to complaints about overly polite responses. The research, conducted with Carnegie Mellon University and University of Oxford, was reported by Emilia David. The team developed “Elephant,” a test that evaluates how much …