159 articles – 2000 Notices  [english version]
HAL : inria-00597576, version 1

Voir la fiche détaillée  BibTeX,EndNote,...
Versions disponibles
Spherical harmonic transform on heterogeneous architectures using hybrid programming
Mikolaj Szydlarski1, Pierre Esterie2, Joel Falcou2, Laura Grigori1, R. Stompor3

Spherical Harmonic Transforms (SHT) are at the heart of many scientific and practical ap- plications ranging from climate modeling to cosmological observations. In many of these areas a new wave of exciting, cutting-edge science goals have been recently proposed calling for simulations and analyses of actual experimental or observational data at very high resolutions, accompanied by producing or processing unprecedented volumes of the data. Both these aspects pose formidable challenge for the currently existing implementations of the transforms. This paper describes a multi CPU-GPUs implementation of an inverse SHT, based on hybrid program- ming combining MPI and CUDA, and discusses its tests as motivated by these forthcoming applications. We present performance comparisons of the multi GPU version and a hybrid, MPI/OpenMP version of the same transform. We find that one NVIDIA Tesla S1070 can accelerate overall execution time of the SHT by as much as 3 times with respect to the MPI/OpenMP version executed on one quad-core processor (Intel Nehalem 2.93 GHz) and, owing to very good scalability of both versions, 128 Tesla cards perform as good as 256 twelve-core processor (AMD Opteron 2.1 GHz).
1 :  INRIA Saclay - Ile de France - GRAND-LARGE
2 :  LRI - Laboratoire de Recherche en Informatique
3 :  APC - UMR 7164 - AstroParticule et Cosmologie
Informatique/Calcul parallèle, distribué et partagé

Physique/Astrophysique/Cosmologie et astrophysique extra-galactique

Planète et Univers/Astrophysique/Cosmologie et astrophysique extra-galactique

Physique/Physique/Physique Atmosphérique et Océanique


Planète et Univers/Sciences de la Terre/Géophysique

Sciences de l'environnement/Milieux et Changements globaux
Spherical Harmonic Transforms – hybrid architectures – hybrid programming – OpenMP – CUDA – Multi-GPU – CMB
Liste des fichiers attachés à ce document :
RR-7635.pdf(572.1 KB)
RR-7635.ps(5.3 MB)