.

Active learning


Reading time: less than 1 minute

Active learning is a data labeling / annotation method in machine learning. Instead of labeling the data randomly or exhaustively, active learning uses a machine learning model to find difficult samples that need human labeling.

Here’s roughly how it works.

  1. Train an ML model with the currently labeled samples.
  2. Using the model, predict on all the unlabeled data.
  3. Pick the sample that the model is most uncertain about. For binary classification, this would be the sample that was closest to 0.5.
  4. Ask the human to label that sample to get the ground truth. Add this new sample to your training set.
  5. Go to 1 and repeat until there is enough data to train your model to sufficient accuracy.

https://distill.pub/2020/bayesian-optimization/

Citation

If you find this work useful, please cite it as:
@article{yaltirakli,
  title   = "Active learning",
  author  = "Yaltirakli, Gokberk",
  journal = "gkbrk.com",
  year    = "2025",
  url     = "https://www.gkbrk.com/active-learning"
}
Not using BibTeX? Click here for more citation styles.
IEEE Citation
Gokberk Yaltirakli, "Active learning", February, 2025. [Online]. Available: https://www.gkbrk.com/active-learning. [Accessed Feb. 22, 2025].
APA Style
Yaltirakli, G. (2025, February 22). Active learning. https://www.gkbrk.com/active-learning
Bluebook Style
Gokberk Yaltirakli, Active learning, GKBRK.COM (Feb. 22, 2025), https://www.gkbrk.com/active-learning

Comments

© 2025 Gokberk Yaltirakli