McKinsey outlines four steps to scale agentic AI with high-quality data

Q: What is the source?

Originally published on ZDNet AI. Hamidun News processes and adapts the material with AI.

Q: When was it published?

May 2, 2026. Reading time: 3 min.

McKinsey released a practical plan for companies that want to move agentic AI from pilots to operational scale. The firm estimates that fewer than 10% of…

Hamidun News Editorial

AI monitoring · ZDNet AI

May 2, 2026· 3 min

AI-processed from ZDNet AI; edited by Hamidun News

McKinsey outlines four steps to scale agentic AI with high-quality data — Source: ZDNet AI. Collage: Hamidun News.

◐ Listen to article

McKinsey has released recommendations for companies attempting to scale agentic AI but running into obstacles not with models, but with data. The main point is straightforward: without a common data foundation, clear workflows, and robust governance, agents never move beyond pilots into real business processes.

Why pilots stall

According to McKinsey, nearly two-thirds of companies have already experimented with AI agents, but fewer than 10% have managed to scale them to a level that delivers meaningful business value. In eight out of ten cases, data becomes the barrier: it sits in fragmented systems, has different contexts, fails unified quality checks, and is poorly suited for autonomous solutions. As long as humans manually stitch together sources and recheck results, the pilot can still work.

When the same actions must be done constantly and in real time, the structure begins to fall apart. The problem becomes more acute as autonomy increases. A single agent can sequentially visit multiple systems and make decisions based on fragmented information, while a group of specialized agents can pass errors to each other.

That is why McKinsey places at the center not the model itself, but the company's ability to provide agents with stable access to data, clear definitions, traceability, and access rules. Otherwise, automation looks impressive in a demo but breaks down under the first serious volume of operations.

Four initial steps

McKinsey proposes not to restructure the entire company at once, but to start with four interconnected steps that link strategy, architecture, and operating model. The logic is: first choose processes where autonomy actually pays off, then prepare the infrastructure, and only then scale up. This approach is needed to avoid spending months on expensive pilots with no repeatable effect and to prevent old data problems from carrying over into new agent infrastructure.

Select 1-2 high-impact workflows and assess them by value, feasibility, and strategic impact
Update data architecture layers so agents can safely exchange context
Move from one-off cleanups to continuous quality control of structured and unstructured data
Introduce an operating model and governance: roles, access rights, logs, policies, and human approval points

A particular emphasis is placed on executing all four steps in coordination. If a company chose the right use case but kept the old architecture, the agent will hit incompatible systems. If the architecture is already modern but lacks access rules and action logging, scaling will quickly become a risk to security, compliance, and business decision quality. McKinsey also recommends validating the approach on targeted pilots with clear metrics and immediately looking for data that can later be reused in adjacent workflows.

What foundation is needed

Under an agent-ready architecture, McKinsey understands not a new monolith but a set of modular layers. Data should enter the company once and then be used for analytics, machine learning, and generative AI, without separate parallel pipelines for each task. A semantic layer plays an important role: it describes what each entity means, how objects relate to each other, and what business rules apply.

In practice, this leads to ontologies, knowledge graphs, and data products with clear ownership, quality, and access interfaces. McKinsey separately emphasizes work with unstructured data—documents, images, correspondence, and ticket histories. For agents to reliably use such content, it must be tagged, classified, indexed via embeddings, and linked to the rest of the corporate data model.

For structured data, the priority is different: not periodic manual cleanup, but continuous quality monitoring, automated validation, anomaly detection, and lineage tracking. The same standards should apply to data created by agents themselves. The final layer is governance around the agent lifecycle.

The company must determine in advance what agents are allowed to do, what data they can access, where human confirmation is needed, and who is responsible for results. This includes credential provisioning, telemetry, action logs, performance monitoring, and automatic policy compliance checks. In such a setup, business teams own their workflows and domain models, while central data and AI teams own the common platforms, guardrails, and oversight.

What this means

The market is gradually shifting from the question "which model to choose" to "on what data and processes will it operate." For companies, this is bad news for quick demos, but good for those ready to build agentic AI as part of the business operating system rather than as an isolated experiment.

Hamidun News

AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Telegram channel RSS hamidun.com

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

🎓 Academy — 7 days free Free consultation