The ATLAS Higgs Machine Learning Challenge

D. Rousseau 1 G. Cowan 2 C. Adam Bourdarios 1 Balázs Kégl 1 C. Germain-Renaud 3, 4 I. Guyon 5
3 TAO - Machine Learning and Optimisation
CNRS - Centre National de la Recherche Scientifique : UMR8623, Inria Saclay - Ile de France, UP11 - Université Paris-Sud - Paris 11, LRI - Laboratoire de Recherche en Informatique
Abstract : High Energy Physics has been using Machine Learning techniques (commonly known as Multivariate Analysis) since the 1990s with Artificial Neural Net and more recently with Boosted Decision Trees, Random Forest etc. Meanwhile, Machine Learning has become a full blown field of computer science. With the emergence of Big Data, data scientists are developing new Machine Learning algorithms to extract meaning from large heterogeneous data. HEP has exciting and difficult problems like the extraction of the Higgs boson signal, and at the same time data scientists have advanced algorithms: the goal of the HiggsML project was to bring the two together by a “challenge”: participants from all over the world and any scientific background could compete online to obtain the best Higgs to tau tau signal significance on a set of ATLAS fully simulated Monte Carlo signal and background. Instead of HEP physicists browsing through machine learning papers and trying to infer which new algorithms might be useful for HEP, then coding and tuning them, the challenge has brought realistic HEP data to the data scientists on the Kaggle platform, which is well known in the Machine Learning community. The challenge has been organized by the ATLAS collaboration associated to data scientists, in partnership with the Paris Saclay Center for Data Science, CERN and Google. The challenge ran from May to September 2014, drawing considerable attention. 1785 teams participated, making it the most popular challenge ever on the Kaggle platform. New Machine Learning techniques have been used by the participants with significantly better results than usual HEP tools. This presentation has two parts: the first one describes how a HEP problem was simplified (not too much!) and wrapped up into an online challenge, the second what was learned from the challenge, in terms of new Machine Learning algorithms and techniques which could have an impact on future HEP analysis.
Type de document :
Communication dans un congrès
21st International Conference on Computing in High Energy and Nuclear Physics – CHEP2015, Apr 2015, Okinawa, Japan
Liste complète des métadonnées
Contributeur : Sabine Starita <>
Soumis le : lundi 13 avril 2015 - 16:14:12
Dernière modification le : jeudi 5 avril 2018 - 12:30:12


  • HAL Id : in2p3-01141742, version 1


D. Rousseau, G. Cowan, C. Adam Bourdarios, Balázs Kégl, C. Germain-Renaud, et al.. The ATLAS Higgs Machine Learning Challenge. 21st International Conference on Computing in High Energy and Nuclear Physics – CHEP2015, Apr 2015, Okinawa, Japan. 〈in2p3-01141742〉



Consultations de la notice