The Higgs boson machine learning challenge has been set up to promote collaboration between high energy physicists and data scientists. The ATLAS experiment at CERN provided simulated data used by physicists to optimize the analysis of the Higgs boson. The Challenge is organized by a small group of ATLAS physicists and data scientists. It has been hosted by Kaggle at https://www.kaggle.com/c/higgs-boson; thechallenge data is now available on https://opendata.cern.ch/education/ATLAS.The original document provided the scientific and technical background for the Challenge. It has been minimally modified to serve as a permanentdocumentation for the corresponding dataset permanently available from opendata.cern.c