Wired→ original

OpenAI Banned Codex from Talking About Goblins and Pigeons in AI Agent Instructions

OpenAI's Codex system instructions contained an unusual string prohibiting the agent from discussing goblins, gremlins, pigeons, and other creatures unless…

AI-processed from Wired; edited by Hamidun News
OpenAI Banned Codex from Talking About Goblins and Pigeons in AI Agent Instructions
Source: Wired. Collage: Hamidun News.
◐ Listen to article

In the system instructions of Codex, OpenAI's new AI tool for programming, an unusual rule was found: the model is directly forbidden from mentioning goblins, gremlins, raccoons, trolls, ogres, pigeons and other creatures if it is not related to the user's request. Based on the reaction of developers and users, the rule appeared after a quite real bug in agent mode.

What was found in the instructions

The impetus was a string in the Codex CLI instructions — a command-line tool that uses a model to generate and edit code. The formulation leaves no room for imagination: the agent is instructed not to talk about "goblins, gremlins, raccoons, trolls, ogres, pigeons and other animals or creatures" unless it is "absolutely and unambiguously" related to the request. Moreover, as users noted, this restriction is repeated several times, meaning it is not a random insertion, but a consciously established behavioral rule.

At first glance, this looks like an internal team joke, but the context of the story is quite serious. OpenAI has just increased its stakes in programming: GPT-5.5 was released with improved coding capabilities, and competition with Anthropic and other players for the AI-tools-for-developers market has intensified sharply. Against this backdrop, any strangeness in the model's behavior ceases to be a harmless curiosity and turns into a product problem, especially if the agent not only works in chat but also manages real applications.

Where the goblins came from

It appears the ban did not come out of nowhere. After screenshots of the instruction were shared on X, users began recalling that OpenAI models in conjunction with OpenClaw indeed sometimes got stuck on such vocabulary. One developer wrote that his claw "suddenly became a goblin" after switching to Codex 5.5. Another noted that the agent constantly called bugs "goblins" and "gremlins". For a regular chatbot this would look like a strange manner of speech, but for a coding agent — like noise that interferes with understanding the output and trusting it.

"Now it's clear why my claw suddenly became a goblin with

Codex 5.5".

WIRED attributes this to how modern models behave inside agent frameworks. The base model predicts the next token and usually stays within task bounds, but in agent mode, memory, system instructions, a set of roles, and service prompts are added to the prompt. The longer and more complex such a chain becomes, the higher the chance that the model will latch onto repetitive metaphors, random associations, or inappropriate style. If the AI is managing a computer, answering emails, or processing purchases, even a minor verbal mutation becomes a signal that the loop needs strict constraints.

How a meme became a product

The story very quickly went beyond the engineering chat and turned into a meme. Users began posting generated scenes with goblins in data centers, and plugins with a game-like "goblin mode" even appeared for Codex. But more importantly: OpenAI employees actually confirmed that the ban is related to the model's actual behavior. Codex developer Nick Pash, in response to the discussion, wrote that this was "really one of the reasons". That is, the company is not just joking, but closing a specific class of behavioral failures.

  • Screenshots of the instruction quickly spread across social media
  • Users began sharing similar incidents in OpenClaw
  • Plugins and memes with "goblin mode" appeared around Codex
  • Sam Altman picked up the wave with a joke about "extra goblins" during GPT-6 training

A separate nuance is the role of OpenClaw. This tool allows you to connect almost any model to a computer, give it access to applications, and choose different personas for the assistant. OpenAI acquired OpenClaw in February shortly after the project's viral rise, so the behavior of models within such an interface is no longer external exotica for the company, but part of its own platform. The deeper OpenAI goes into autonomous agents, the more important it becomes to suppress not only dangerous errors, but also strange speech shifts.

What this means

The goblin story may seem like an anecdote, but in fact it shows something more important: AI-agent developers increasingly treat not only factual errors, but also the model's thinking style in complex scenarios. For the coding-assistant market, this is a signal that the battle is already being fought at the level of fine-tuning behavior. An agent's reliability today is determined not only by the quality of code it writes, but also by how predictably it communicates and acts under load.

ZK
Hamidun News
AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

What do you think?
Loading comments…