Skip to Main content Skip to Navigation
Conference papers

Multicore in production: advantages and limits of the multiprocess approach in the ATLAS experiment

Abstract : The shared memory architecture of multicore CPUs provides HEP developers with the opportunity to reduce the memory footprint of their applications by sharing memory pages between the cores in a processor. ATLAS pioneered the multi-process approach to parallelize HEP applications. Using Linux fork() and the Copy On Write mechanism we implemented a simple event task farm, which allowed us to achieve sharing of almost 80% of memory pages among event worker processes for certain types of reconstruction jobs with negligible CPU overhead. By leaving the task of managing shared memory pages to the operating system, we have been able to parallelize large reconstruction and simulation applications originally written to be run in a single thread of execution with little to no change to the application code. The process of validating AthenaMP for production took ten months of concentrated effort and is expected to continue for several more months. Besides validating the software itself, an important and time-consuming aspect of running multicore applications in production was to configure the ATLAS distributed production system to handle multicore jobs. This entailed defining multicore batch queues, where the unit resource is not a core, but a whole computing node; monitoring the output of many event workers; and adapting the job definition layer to handle computing resources with different event throughputs. We will present scalability and memory usage studies, based on data gathered both on dedicated hardware and at the CERN Computer Center.
Complete list of metadata
Contributor : Sabine Starita Connect in order to contact the contributor
Submitted on : Wednesday, July 4, 2012 - 4:46:16 PM
Last modification on : Wednesday, September 16, 2020 - 4:26:43 PM

Links full text





S. Binet, P. Calafiura, M. K. Jha, W. Lavrijsen, C. Leggett, et al.. Multicore in production: advantages and limits of the multiprocess approach in the ATLAS experiment. 14th International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT 2011), Sep 2011, Uxbridge, United Kingdom. pp.012018, ⟨10.1088/1742-6596/368/1/012018⟩. ⟨in2p3-00714479⟩



Les métriques sont temporairement indisponibles