Parallelizing ATLAS Reconstruction and Simulation: Issues and Optimization Solutions for Scaling on Multi- and Many-CPU Platforms

C. Leggett; S. Binet; K. Jackson; D. Levinthal; M. Tatarkhanov; Y. Yao

doi:10.1088/1742-6596/331/4/042015

Communication Dans Un Congrès Année : 2011

Parallelizing ATLAS Reconstruction and Simulation: Issues and Optimization Solutions for Scaling on Multi- and Many-CPU Platforms

(1) , (2) , (1) , , (1) , (1)

1
2

C. Leggett

Fonction : Auteur

Lawrence Berkeley National Laboratory [Berkeley]

S. Binet

Fonction : Auteur

Laboratoire de l'Accélérateur Linéaire

K. Jackson

Fonction : Auteur

Lawrence Berkeley National Laboratory [Berkeley]

D. Levinthal

Fonction : Auteur

M. Tatarkhanov

Fonction : Auteur

Lawrence Berkeley National Laboratory [Berkeley]

Y. Yao

Fonction : Auteur

Lawrence Berkeley National Laboratory [Berkeley]

Résumé

Thermal limitations have forced CPU manufacturers to shift from simply increasing clock speeds to improve processor performance, to producing chip designs with multi- and many-core architectures. Further the cores themselves can run multiple threads as a zero overhead context switch allowing low level resource sharing (Intel Hyperthreading). To maximize bandwidth and minimize memory latency, memory access has become non uniform (NUMA). As manufacturers add more cores to each chip, a careful understanding of the underlying architecture is required in order to fully utilize the available resources. We present AthenaMP and the Atlas event loop manager, the driver of the simulation and reconstruction engines, which have been rewritten to make use of multiple cores, by means of event based parallelism, and final stage I/O synchronization. However, initial studies on 8 andl6 core Intel architectures have shown marked non-linearities as parallel process counts increase, with as much as 30% reductions in event throughput in some scenarios. Since the Intel Nehalem architecture (both Gainestown and Westmere) will be the most common choice for the next round of hardware procurements, an understanding of these scaling issues is essential. Using hardware based event counters and Intel's Performance Tuning Utility, we have studied the performance bottlenecks at the hardware level, and discovered optimization schemes to maximize processor throughput. We have also produced optimization mechanisms, common to all large experiments, that address the extreme nature of today's HEP code, which due to it's size, places huge burdens on the memory infrastructure of today's processors.

Domaines

Modélisation et simulation Performance et fiabilité [cs.PF]

Sabine Starita : Connectez-vous pour contacter le contributeur

https://hal.in2p3.fr/in2p3-00657498

Soumis le : vendredi 6 janvier 2012-16:52:46

Dernière modification le : jeudi 11 avril 2024-13:18:11

Dates et versions

in2p3-00657498 , version 1 (06-01-2012)

Identifiants

HAL Id : in2p3-00657498 , version 1
DOI : 10.1088/1742-6596/331/4/042015

Citer

C. Leggett, S. Binet, K. Jackson, D. Levinthal, M. Tatarkhanov, et al.. Parallelizing ATLAS Reconstruction and Simulation: Issues and Optimization Solutions for Scaling on Multi- and Many-CPU Platforms. Conference on Computing in High Energy and Nuclear Physics 2010 (CHEP 2010), Oct 2010, Taipei, Taiwan. pp.042015, ⟨10.1088/1742-6596/331/4/042015⟩. ⟨in2p3-00657498⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IN2P3 LAL CNRS TDS-MACS UNIV-PARIS-SACLAY

22 Consultations

0 Téléchargements

Parallelizing ATLAS Reconstruction and Simulation: Issues and Optimization Solutions for Scaling on Multi- and Many-CPU Platforms

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager