Unsupervised learning
What is unsupervised learning and how is it implemented? Learn more about it here.
Definition
Unsupervised learning is an ML method in which the algorithm is trained on unlabeled data - i.e. no predefined correct answers are provided. The system must discover structures, patterns or correlations in the input data itself. In contrast to supervised learning, there is no direct right or wrong from the trainer. Unsupervised learning often aims to find hidden clusters, groupings or data distributions or to reduce the dimensionality of the data (for visualization or pre-processing).
Methods and examples
Typical types of unsupervised learning tasks are
Advantages and challenges
Unsupervised learning is particularly useful when there is little or no labeled data – which is the case in many real-world cases, as label creation can be expensive. It can reveal hidden structures that were not previously thought of. However, evaluation is often more difficult: since there is no clear target, the human has to interpret the discovered structures and evaluate whether they make sense. Two people could interpret different cluster solutions differently. It can also happen that an algorithm finds trivial patterns or patterns that are irrelevant to the question – human expertise is important to validate results.