OpenAI Tries “Confessions” Feature to Help AIs Admit Their Mistakes
By
Olivia Grant
OpenAI introduces an experimental “confessions” system that rewards honest self-reporting of mistakes in GPT-5 Thinking models, aiming to improve AI safety and visibility of failures.