Why OpenAI’s new models kept obsessing over goblins and how the company finally stopped it
OpenAI has identified the root cause of an unusual behavior in its latest AI models: a tendency to spontaneously mention goblins, gremlins, raccoons, trolls, ogres, and pigeons in otherwise unrelated conversations. The company traced the problem to a flaw in how it trained models to adopt different personality styles. The issue first appeared after the …