OpenAI explains why its models won't discuss goblins
OpenAI published a technical explainer after Wired revealed hidden instructions telling its coding model to avoid goblins, gremlins, raccoons, trolls, ogres, pigeons, and other creatures.
The startup calls it a "strange habit" the model developed on its own—likely from training data patterns—then addressed via filtering rather than retraining.