s'authentifier
version française rss feed
HAL : in2p3-00457039, version 1

Fiche détaillée  Récupérer au format
SuperComputing 2004 (SC2004), Pittsburgh : United States (2004)
RPC-V: Toward Fault-Tolerant RPC for Internet Connected Desktop Grids with Volatile Nodes
S. Djilali1, T. Herault1, O. Lodygensky1, 2, T. Morlier1, G. Fedak1, F. Cappello1
(2004)

RPC is one of the programming models envisioned for the Grid. In Internet connected Large Scale Grids such as Desktop Grids, nodes and networks failures are not rare events. This paper provides several contributions, examining the feasibility and limits of fault-tolerant RPC on these platforms. First, we characterize these Grids from their fundamental features and demonstrate that their applications scope should be safely restricted to stateless services. Second, we present a new fault-tolerant RPC protocol associating an original combination of three-tier architecture, passive replication and message logging. We describe RPC-V, an implementation of the proposed protocol within the XtremWeb Desktop Grid middleware. Third, we evaluate the performance of RPC-V and the impact of faults on the execution time, using a real life application on a Desktop Grid testbed assembling nodes in France and USA. We demonstrate that RPC-V allows the applications to continue their execution while key system components fail.
1 :  LRI - Laboratoire de Recherche en Informatique
2 :  LAL - Laboratoire de l'Accélérateur Linéaire
Informatique/Calcul parallèle, distribué et partagé

Informatique/Performance et fiabilité
Liste des fichiers attachés à ce document : 
PDF
RPCV-SC-2004.pdf(339.2 KB)