Конституция Anthropic: Claude теперь живет по правилам (и это важно)
Пока OpenAI гонится за чистой мощностью, Anthropic строит «этическую крепость». Компания раскрыла детали своей «Конституции» — набора принципов, которыми Claude
AI-processed from Habr AI; edited by Hamidun News
Imagine you're creating something potentially smarter than you and trying to explain to it what is "good" and what is "bad." This isn't a Hollywood blockbuster scenario, but the everyday reality of Anthropic. While the industry debates how many GPUs you need for happiness, Claude's creators decided to take a different path and write a real Constitution for their neural network. This isn't just a list of forbidden words or hard filters at the input. It's an attempt to embed in the code something like a conscience that should work even when developers aren't looking.
From the start, Anthropic positioned itself as a "safety lab" founded by OpenAI alumni who disagreed with Sam Altman on commercialization and ethics issues. Since then, Dario Amodei's company has methodically built an image as the most responsible player in the market. Publishing the Constitution's text isn't just a gesture of goodwill—it's a strategic move in the big game for influence over world governments. When regulators in Washington or Brussels ask how we'll control the future superintelligence, Anthropic already has a ready answer in the form of a document. They're not just training a model; they're trying to come to an agreement with it, creating clear frameworks for coexistence.
The approach is based on Constitutional AI technology. Unlike the familiar reinforcement learning from human feedback (RLHF) method, where real people spend hours labeling neural network responses, Anthropic uses the model itself for self-editing. The process looks almost philosophical: models are given a list of principles—from the UN Universal Declaration of Human Rights to Apple's data protection rules—and asked to criticize their own drafts for compliance with these norms. As a result, Claude learns not just to imitate human preferences but to follow the logic of the values embedded in it. This saves the company from having to maintain an army of low-wage moderators in developing countries, which is itself a strong ethical move.
Why does the industry need this? First, it's scalable. Human resources are limited, but a neural network can analyze its own errors infinitely. Second, it makes the model's behavior more predictable and transparent. If Claude refuses to answer a question, it does so not because of algorithm whim, but because a specific point in the Constitution forbids it from violating privacy or encouraging violence. This creates a foundation of trust that's critically important for the corporate sector. Business doesn't need an AI that can suddenly say something unpolitical or dangerous. They need a tool with clearly defined boundaries of responsibility.
However, behind the beautiful words about safety lies a fierce struggle for leadership. Dario Amodei in his recent essays directly hints that we're at a critical point in civilization's development, which he calls the "teenage period of technology." Anthropic is trying to seize from OpenAI the role of chief intellectual center that sets the rules of the game. If OpenAI is an aggressive startup that wants to give people a tool for everything, then Anthropic is a kind of "adult in the room" insisting that power without control is meaningless and dangerous.
Interestingly, the Constitution is continuously being refined. It's a living document that reflects current societal fears and hopes. It includes provisions on the inadmissibility of discrimination, the importance of providing objective information, and even how AI should behave in situations where human values conflict. It's reminiscent of an attempt to create an ideal digital citizen who always follows the letter of the law while being intelligent enough to understand its spirit. Ultimately, Anthropic's success will depend on whether users believe in this "artificial morality" or see it as just another layer of corporate censorship.
The key question: Will Anthropic's Constitution become a universal standard for all LLMs, or will we see a war of "corporate moralities" where each model has its own ideas about good and evil?
Want to stop reading about AI and start using it?
AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.