Amazon SageMaker and DVC: end-to-end traceability of ML models from data to prediction

AWS published a guide to end-to-end traceability for ML models using DVC, Amazon SageMaker AI, and MLflow Apps. It covers two patterns: lineage at the dataset level and at the individual-record level. Both are deployed in your AWS account through ready-made notebooks.

Khamidun Zhemal

AI monitoring · AWS Machine Learning Blog

Apr 22, 2026· 2 min

AI-processed from AWS Machine Learning Blog; edited by Hamidun News

Amazon SageMaker and DVC: end-to-end traceability of ML models from data to prediction — Source: AWS Machine Learning Blog. Collage: Hamidun News.

◐ Listen to article

AWS Machine Learning Blog published a detailed guide on building end-to-end ML model tracking — from raw data to final prediction — using three tools: DVC, Amazon SageMaker AI and Amazon SageMaker AI MLflow Apps. The problem that the material addresses is well-known to teams working in production: it's unclear which exact data was used to train a specific model, what transformations were applied to it, and how an individual record influenced the prediction. This is critical during audits, debugging data drift, and compliance with regulatory requirements.

DVC (Data Version Control) is an open-source tool that adds versioning of large files and datasets to Git. In combination with SageMaker, it allows you to capture the exact snapshot of data used in each training run. SageMaker MLflow Apps, in turn, stores metrics, parameters, and experiment artifacts — a standard MLflow server managed by AWS without the need to deploy infrastructure manually.

The authors describe two concrete patterns. The first is dataset-level lineage: the dataset version is recorded (via DVC tag), processing pipeline parameters, and a link to the trained model in MLflow. The second is record-level lineage: each specific record from the training set is tied to the model, which allows answering the question "which exactly examples shaped this prediction".

Both patterns are implemented as Jupyter notebooks ready to run in your AWS account. The architecture looks as follows: data is stored in S3 and versioned by DVC, experiment metadata (parameters, metrics, artifacts) — in MLflow App, and the connection between data version and model version is provided through custom tags and DVC artifacts registered in MLflow. The practical value of the approach is in reproducibility.

If in six months it turns out that the model behaves unexpectedly on a certain cohort of users, engineers will be able to precisely recover which dataset it was trained on, which records it included, and with what hyperparameters the training was launched. The material is aimed at ML engineers and MLOps teams already working in the AWS ecosystem. Ready-made notebooks significantly lower the entry barrier: there is no need to design the integration from scratch — it is enough to adapt the examples to your dataset and processing pipeline.

For companies where auditability of ML systems is important — especially in the financial sector, medicine, and government projects — such an approach becomes a de facto standard. AWS is consistently building a toolkit in which lineage stops being an optional "best practice" and becomes an embedded property of the pipeline.

Hamidun News

AI news without noise. Daily editorial selection from 50+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Telegram channel RSS hamidun.com

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

🎓 Academy — 7 days free Free consultation

Amazon SageMaker and DVC: end-to-end traceability of ML models from data to prediction

Want to stop reading about AI and start using it?

The AI world, distilled — once a week