Training

Supervised Learning

Supervised learning is a machine learning paradigm in which a model is trained on a labeled dataset of input-output pairs to learn a mapping function, then applied to predict outputs for unseen inputs; it underlies most production ML systems from spam filters to medical image classifiers.

Supervised learning is the dominant machine learning paradigm, in which a model is trained on a fixed dataset of (input, label) pairs to approximate a function mapping inputs to outputs. The model's parameters are adjusted to minimize a loss function measuring discrepancy between predictions and ground-truth labels, using gradient-based optimization such as stochastic gradient descent and its variants (Adam, AdamW). At inference time, the trained model is expected to generalize — producing accurate predictions on inputs not seen during training.

The paradigm covers two primary task types: classification, where the output is a discrete category (spam detection, object recognition across ImageNet's 1,000 classes, sentiment analysis), and regression, where the output is a continuous value (house price estimation, temperature forecasting). Architectures range from logistic regression and gradient-boosted decision trees — widely used on industry tabular data — to convolutional neural networks for images and Transformer models for language. The standard workflow since roughly 2018 is to pre-train a large model on a massive corpus using self-supervised objectives, then fine-tune on smaller task-specific labeled datasets, dramatically reducing annotation requirements.

Supervised learning's central constraint is the need for labeled data, which is expensive and domain-specific. Medical imaging annotation requires radiologist time; legal document classification requires attorney review. Active learning, data augmentation, and transfer learning have become standard techniques for managing this cost. Instruction-following language models have introduced supervised fine-tuning (SFT) as a critical alignment step: human-curated (prompt, completion) pairs teach a pre-trained model to respond helpfully to instructions before further alignment via RLHF or DPO.

As of 2026, supervised learning remains foundational across production AI systems. Fraud detection, speech recognition, clinical decision support, and the SFT stages of every major LLM pipeline all rely on it. Benchmark performance on canonical supervised tasks — ImageNet for vision, GLUE and SuperGLUE for NLP, SQuAD for reading comprehension — has approached or exceeded human-level accuracy on most metrics, shifting research emphasis toward out-of-distribution generalization, robustness to label noise, and learning from fewer labeled examples.

Example

Google trains its medical imaging classifier to detect diabetic retinopathy by exposing it to hundreds of thousands of retinal photographs labeled by ophthalmologists; after training, the model achieves specialist-level screening accuracy on new patient images it has never seen.

← Glossary

Supervised Learning

Example

Related terms