Multicore in production: advantages and limits of the multiprocess approach in the ATLAS experiment

Abstract : The shared memory architecture of multicore CPUs provides HEP developers with the opportunity to reduce the memory footprint of their applications by sharing memory pages between the cores in a processor. ATLAS pioneered the multi-process approach to parallelize HEP applications. Using Linux fork() and the Copy On Write mechanism we implemented a simple event task farm, which allowed us to achieve sharing of almost 80% of memory pages among event worker processes for certain types of reconstruction jobs with negligible CPU overhead. By leaving the task of managing shared memory pages to the operating system, we have been able to parallelize large reconstruction and simulation applications originally written to be run in a single thread of execution with little to no change to the application code. The process of validating AthenaMP for production took ten months of concentrated effort and is expected to continue for several more months. Besides validating the software itself, an important and time-consuming aspect of running multicore applications in production was to configure the ATLAS distributed production system to handle multicore jobs. This entailed defining multicore batch queues, where the unit resource is not a core, but a whole computing node; monitoring the output of many event workers; and adapting the job definition layer to handle computing resources with different event throughputs. We will present scalability and memory usage studies, based on data gathered both on dedicated hardware and at the CERN Computer Center.
docType_s :
Conference papers
Liliana Teodorescu, David Britton, Nigel Glover, Gudrun Heinrich, Jérôme Lauret, Axel Naumann, Thomas Speer, Pedro Teixeira-Dias. 14th International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT 2011), Sep 2011, Uxbridge, United Kingdom. IOP Publishing, 368, pp.012018, <10.1088/1742-6596/368/1/012018>


http://hal.in2p3.fr/in2p3-00714479
Contributor : Françoise Marechal <>
Submitted on : Wednesday, July 4, 2012 - 4:00:46 PM
Last modification on : Monday, October 13, 2014 - 3:00:43 PM

Identifiers

Collections

Citation

S. Binet, P. Calafiura, M. K. Jha, W. Lavrijsen, C. Leggett, et al.. Multicore in production: advantages and limits of the multiprocess approach in the ATLAS experiment. Liliana Teodorescu, David Britton, Nigel Glover, Gudrun Heinrich, Jérôme Lauret, Axel Naumann, Thomas Speer, Pedro Teixeira-Dias. 14th International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT 2011), Sep 2011, Uxbridge, United Kingdom. IOP Publishing, 368, pp.012018, <10.1088/1742-6596/368/1/012018>. <in2p3-00714479>

Export

Share

Metrics

Consultation de la notice

28