MPI support in the DIRAC Pilot Job Workload Management System - IN2P3 - Institut national de physique nucléaire et de physique des particules Accéder directement au contenu
Poster De Conférence Année : 2012

MPI support in the DIRAC Pilot Job Workload Management System

Résumé

Parallel job execution in the grid environment using MPI technology presents a number of challenges for the sites providing this support. Multiple flavors of the MPI libraries, shared working directories required by certain applications, special settings for the batch systems make the MPI support difficult for the site managers. On the other hand the workload management systems with pilot jobs became ubiquitous although the support for the MPI applications in the pilot frameworks was not available. This support was recently added in the DIRAC Project in the context of the GISELA Latin American Grid. Special services for dynamic allocation of virtual computer pools on the grid sites were developed in order to deploy MPI rings corresponding to the requirements of the jobs in the central task queue of the DIRAC Workload Management systems. The required MPI software is installed automatically by the pilot agents using user space file system techniques. The same technique is used to emulate shared working directories for the parallel MPI processes. This makes it possible to execute MPI jobs even on the sites not supporting them officially. Reusing so constructed MPI rings for execution of a series of parallel jobs increases dramatically their efficiency and turnaround.
Fichier non déposé

Dates et versions

in2p3-00703341 , version 1 (01-06-2012)

Identifiants

  • HAL Id : in2p3-00703341 , version 1

Citer

A. Tsaregorodtsev, V. Hamar. MPI support in the DIRAC Pilot Job Workload Management System. MPI Support in the DIRAC Pilot Job Workload Management System, May 2012, New-York, United States. ⟨in2p3-00703341⟩
23 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More