Maciej Lewenstein & Antonio Acín – Institut de Ciències Fotòniques (ICFO)
A team of ICFO researchers reports in PRL an entirely new anomaly detection method capable of training a system in very few iterations. Machine Learning (ML) has the main goal of analyzing and interpreting data structures and patterns in order to learn from them, reason and carry out a decisionmaking task that is completely independent from human reasoning and engagement. Even though this field of study started in the mid 1900s, recent developments in the area have revolutionized the way on how we can process and find correlations in complex data.
Contrary to supervised learning, unsupervised learning seeks to discover patterns or classify information in large data sets into categories without prior knowledge. That is, it does not have labeled outputs, which means that it basically infers the natural structure that a dataset may have and extracts categorized information from it. This learning has proved to be very efficient for identifying phases and phase transitions of many-body systems. In a study recently published in Physical Review Letters, ICFO researchers Korbinian Kottmann and Patrick Huembeli, led by ICREA Professors at ICFO Antonio Acín and Maciej Lewenstein report on a method that uses an unsupervised machine learning technique based on anomaly detection to automatically map out the phase diagram of a quantum many body system given unlabeled data.
The following example is very illustrative of what they have achieved. In machine learning the most common and known classification task example is to discriminate, for instance, images of cats and dogs. In the study, the method the researchers use anomaly detection, which handles the classification task of discriminating dogs and everything that is not a dog, approaching the system is an entirely different perspective. The idea is to train a special neural network called an autoencoder to efficiently compress and reproduce images of dogs. If the network is later fed with images of cats, the network does not know how to efficiently compress the features of the cat image and it is possible to tell from the higher reconstruction loss that it is not a dog.