TNW→ original

Anthropic Releases Claude Opus 4.7 with Best Results in Coding and Agentic Tasks

Anthropic released Claude Opus 4.7 — its strongest publicly available model. The company claims SWE-bench Pro leadership with 64.3% versus 57.7% for GPT-5.4…

AI-processed from TNW; edited by Hamidun News
Anthropic Releases Claude Opus 4.7 with Best Results in Coding and Agentic Tasks
Source: TNW. Collage: Hamidun News.
◐ Listen to article

Anthropic has released Claude Opus 4.7 and is betting that the next stage of competition between AI models will be decided not only by the quality of responses, but also by the ability to stably execute long work chains. The new version is positioned as the company's strongest publicly available model: it writes and fixes code better, maintains multi-step context more confidently, and makes significantly fewer errors when working with tools.

For the market, this is an important moment also because it is not about an experimental showcase, but about a model that can be purchased and integrated into workflows right now. The main argument of the release is the results on SWE-bench Pro, one of the most notable benchmarks for evaluating models' ability to solve real engineering tasks. According to the company, Claude Opus 4.

7 scored 64.3%, while GPT-5.4 showed 57.

7%. For the market, this is an important signal: the focus is no longer on the abstract "intelligence" of the model, but on how well it can understand code bases, find bugs, suggest patches, and bring tasks to a working result. Such tests are watched especially carefully by teams that are implementing AI in development, support, and internal automation.

The second emphasis of Anthropic is agentic behavior. The company speaks of stronger coordination of multiple agents in scenarios that can last hours. It is about tasks where the model not only answers a single request, but plans steps, invokes tools, checks intermediate results, and continues work without constant human intervention.

It is precisely in this class of tasks that the difference between an impressive demo and a system that can be integrated into a real process is most clearly manifested: the longer the chain of actions, the more expensive errors become, loss of context, and incorrect tool calls. Compared to previous versions, Anthropic also reports a 14-percent improvement in multi-step agentic reasoning and three times fewer errors when working with tools. If these figures are confirmed in practice, this could be even more important than the difference in a single separate benchmark.

For corporate users, reliability is usually valued more than maximum peak quality: if the model less often "breaks" workflows, does not lose state, and more correctly invokes external services, it is easier to allow it to perform operations related to code, analytics, documents, and internal bots. The company separately points out a threefold increase in image resolution, which expands scenarios where the model can be used to read diagrams, interfaces, charts, and other visually rich materials. The pricing remained in the range that Anthropic already uses for senior models: 5 dollars per million input tokens and 25 dollars per million output tokens.

This does not look like an attempt to dump the market, but makes the release understandable for existing customers: the company sells not just another increment in quality, but a more reliable tool for complex work. For teams that count economics through completed tasks, not just through token price, this could be a strong argument: a more stable model requires fewer manual checks, repeated runs, and error corrections after failed calls. Against the backdrop of the race between Anthropic, OpenAI, Google, and other players, this move looks logical.

Right now, the winner is not the one who loudly announces "universal intelligence," but the one whose model better handles applied tasks: writes code, manages tools, endures long sessions, and delivers predictable results in combat conditions. The conclusion is simple: Claude Opus 4.7 is not a cosmetic update, but Anthropic's bid for leadership in the segment of models for development and agentic automation.

If the promised improvements match what teams see in production, pressure on competitors will intensify not because of pretty comparison tables, but because of a more practical question — which model is cheaper and safer to place at the center of a real workflow.

ZK
Hamidun News
AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

What do you think?
Loading comments…