Skip to content

Unsupervised quark/gluon jet tagging with Poissonian Mixture Models

Topic
natural sciences
Categories
physics
Reading Time 3 min
Abstract

Ever wondered how we can classify jets at the Large Hadron Collider without relying on pre-existing labels? Discover how unsupervised learning is revolutionizing particle physics! Learn how a Poissonian model is changing the game.

Tags
natural-sciencesphysicsgluonjetmixturemodelspoissonianquark

Ever wondered how we can classify jets at the Large Hadron Collider without relying on pre-existing labels? Discover how unsupervised learning is revolutionizing particle physics! Learn how a Poissonian model is changing the game.



  1. What is the main objective of the research presented in the paper? The research aims to develop an unsupervised learning algorithm for classifying jets produced by quarks or gluons. This is crucial for new physics searches at high-energy colliders, such as the LHC.

  2. Why is unsupervised learning important for this task? Existing quark/gluon tagging methods rely heavily on Monte Carlo simulations, which can introduce theoretical and systematic uncertainties. An unsupervised approach minimizes these biases by learning directly from the data without relying on pre-existing labels or simulations.

  3. Which observable is used for jet classification and why? The research utilizes the Iterative SoftDrop Multiplicity (nSD) as the tagging observable. This choice is motivated by nSD’s IRC safety and its Poisson-like distribution at leading-logarithmic accuracy, which allows for the construction of a simple and interpretable probabilistic model.

  4. How does the unsupervised learning algorithm work? The algorithm models the nSD distribution as a mixture of two Poissonians, representing quark- and gluon-initiated jets. By employing techniques like Expectation-Maximization or Stochastic Variational Inference, it estimates the rates of these Poissonians (λq and λg) and their mixing proportions (πq and πg).

  5. How is the performance of the unsupervised classifier evaluated? Both supervised and unsupervised metrics are used for evaluation. Supervised metrics like accuracy, mistag rate, and AUC rely on the true labels from the Monte Carlo datasets, while unsupervised metrics like Hellinger distance and KL divergence assess the consistency between the learned model and the observed data distribution.

  6. How does the classifier handle detector effects? The study incorporates a simplified detector effect simulation by smearing the angular coordinates of jet constituents. Results indicate that the classifier’s accuracy remains robust against these effects, suggesting that the underlying assumptions of the model hold even with detector distortions.

  7. What are the limitations of the proposed method? The algorithm currently relies on the approximation that nSD follows a Poissonian distribution, which may not be perfectly accurate beyond leading-logarithmic order. Additionally, the performance may be affected by unbalanced datasets or contamination from heavy quarks, requiring further model extensions to address these complexities.

  8. What are the potential implications and future directions of this research? The development of an unsupervised, data-driven quark/gluon tagger offers a promising avenue for reducing biases in jet classification and enhancing the sensitivity of new physics searches. Future research could explore incorporating more refined theoretical models for nSD and extending the algorithm to handle more realistic detector simulations and complex data scenarios.


Understanding these findings helps advance our knowledge and inform better decisions. This research represents an important contribution to the field. For the full details, watch the video above and explore the linked resources.


#machinelearning #particlephysics #ai #lhc #unsupervisedlearning #datascience


unsupervised quark gluon jet tagging with poissonian mixture models