How AI companies are teaching language models to admit their mistakes
Two major tech companies are tackling one of artificial intelligence’s most persistent problems: getting AI systems to stop making things up or hiding their mistakes. OpenAI and Amazon have each developed distinct approaches to make large language models more honest and reliable. OpenAI’s thruth serum OpenAI researchers introduced a technique called “confessions” that functions like …