GPT-Based Fast Simulation of CLAS12 Detector Hits via Conditional Autoregressive Generation

· Source: Machine Learning · Field: Science & Research — Physical Sciences & Chemistry, Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

A GPT-style autoregressive transformer has been developed as a fast surrogate model for the calorimeter within the CLAS12 experiment at the Thomas Jefferson National Accelerator Facility. This model addresses the increasing demand for high-fidelity detector simulation in modern particle physics, where computational requirements often strain available resources. Inspired by large language models, the deep generative model is conditioned on incident momentum and autoregressively generates realistic detector hits across all nine calorimeter layers, represented as sequences of strip, ADC, and TDC tokens. It faithfully reproduces critical physics characteristics, including hit multiplicity, spatial distributions, energy deposits, and the electromagnetic calorimeter's energy-momentum response. The generator achieves inference rates exceeding 700 events per second on a single GPU, offering a substantial speedup compared to traditional Geant4-based simulations while preserving essential physics fidelity for high-luminosity experimental programs.

Key takeaway

For research scientists developing particle physics simulations, this GPT-style autoregressive transformer offers a compelling alternative to traditional Geant4 methods. You should consider integrating similar deep generative models to achieve substantial speedups, potentially exceeding 700 events per second on a single GPU, without compromising physics fidelity. This approach can significantly reduce computational bottlenecks in high-luminosity experimental programs, allowing for more extensive data analysis and faster iteration cycles.

Key insights

A GPT-style transformer rapidly simulates CLAS12 detector hits, maintaining physics fidelity with significant speedup over traditional methods.

Principles

Method

A GPT-style autoregressive transformer is conditioned on incident momentum to generate sequences of strip, ADC, and TDC tokens across nine calorimeter layers.

In practice

Topics

Best for: AI Scientist, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.