OpenAI Admits Failure: Why New ChatGPT Became Dumber
Imagine this situation: you update an application that became your main work tool, and suddenly you notice it started performing poorly. It makes mistakes in…
AI-processed from Futurism; edited by Hamidun News
Imagine this situation: you update an application that became your main work tool, and suddenly you notice it started performing poorly. It makes mistakes in simple calculations, is lazy about writing complete code, and responds with template phrases where it previously showed flexibility. For a long time, OpenAI fans attributed this to the placebo effect or technology fatigue, but now the cards are on the table. Sam Altman officially admitted that the company made the new version of ChatGPT worse than the previous one. His phrase "I think we just fucked that up" sounds like a rare display of sincerity in a world where any bug is typically called a feature.
Let's recall the context. When GPT-4 was released, it seemed like magic. But with each subsequent update — from Turbo to 4o — the developer community cried out louder about degradation. The model began suffering from "neural laziness." Instead of writing a hundred lines of code, it would output ten and suggest the user complete the rest themselves. OpenAI tried to optimize computational costs and make responses faster, but in the process, it apparently threw out the baby with the bathwater. Speed increased, but intelligence declined.
Why did this happen now? The answer lies in incredible market pressure. OpenAI is no longer the only player on the field. When Claude 3.5 Sonnet from Anthropic started outperforming GPT-4o in coding and creativity tests, panic clearly erupted at Altman's headquarters. In a rush, updates are rolled out that should be "safer" and "faster," but AI training algorithms are a finicky thing. Excessive censorship and attempts to squeeze the model into rigid ethical frameworks often result in the neural network becoming cautious and ultimately delivering mediocre results.
Technically, this is explained by skewing in RLHF — reinforcement learning from human feedback. If human trainers reward brevity, the model becomes absurdly concise. If they demand safety, the model starts seeing threats even in a harmless question about how to cook soup. OpenAI, apparently, overtightened the screws in an attempt to make the product mainstream and corporate-safe. As a result, professionals who needed a powerful tool for work felt betrayed.
It's interesting that Altman admits this precisely now. This could be a subtle move before announcing something truly large-scale, such as GPT-5 or the Strawberry model. Admitting past mistakes is the best way to prepare the ground for selling a new "revolutionary" solution. Like, yes, the previous version was a failure, but look how we fixed everything in the new one. This is classic audience priming that we've already seen many times from Apple and other tech giants.
For the industry, this is an important lesson. It turns out that the scaling law doesn't work linearly. You can't just pour in more data and computing power to get a smarter system. Data quality and tuning precision matter much more. While OpenAI engages in self-flagellation, their competitors get an excellent chance to poach their loyal audience. After all, in the AI world, brand loyalty lasts only until another browser tab starts thinking faster.
The bottom line: OpenAI officially confirmed the regression of its models. Can the company regain the trust of professionals by releasing GPT-5, or is Altman's era of dominance coming to an end?
Want to stop reading about AI and start using it?
AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.