Deep learning
What is deep learning and how can it be used? Find out more here.
Definition
How deep learning works
In traditional (shallow) machine learning, features often had to be defined manually. For example, in image processing: extracting edges, textures etc., then feeding them to an algorithm. Deep learning automates this process. A multilayer neural network learns end-to-end: from raw input (pixels, raw audio, raw text) to output (e.g. class labels), all transformations are learned by the neurons. Earlier layers learn simple patterns (e.g. edges in images), middle layers learn combinations of these (e.g. shapes, contours) and late layers learn very abstract concepts (faces, objects). This hierarchical feature formation is the heart of deep learning.
It is important to note that deep networks need a lot of data and are trained with backpropagation to adjust the weights. The availability of large data sets (ImageNet, gigantic text corpora) and modern hardware (GPUs, TPUs) was a decisive enabler for deep learning.
Applications
Deep learning has achieved state-of-the-art results in numerous fields:
Meaning
Deep learning is often used almost synonymously with the current success story of AI. It has ended the "AI winter" and ushered in a new era in which machines achieve or surpass human performance in many benchmark tests. However, deep learning is not everything: it requires a lot of data and computing power, has problems with explainability and can be uncertain outside of its training distribution. Nevertheless, the paradigm of chasing data through many layers of processing to learn complex structures is currently the most successful learning method - and research is working on making it even more efficient, robust and understandable.