Paper published in a book (Scientific congresses and symposiums)
LBG-SQUARE - Fault Tolerant, Locality-Aware Co-allocation in P2P Grids
Dethier, Gérard; Briquet, Cyril; Marchot, Pierre et al.
2008In Huang, Zhiyi; Xu, Zhiwei; Rountree, Nathan et al. (Eds.) Ninth International Conference On Parallel And Distributed Computing, Applications And Technologies : PDCAT 2008
Peer reviewed
 

Files


Full Text
dethier-pdcat08-postprint.pdf
Publisher postprint (386.57 kB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Distributed Computing; Fault Tolerance; Peer-to-Peer (P2P) Grid; Grid Computing
Abstract :
[en] In this paper, the deployment and execution of Iterative Stencil applications on a P2P Grid middleware are investigated. So-called Iterative Stencil applications are composed of sets of heavily-communicating, long-running Tasks. They thus require co-allocation of multiple reliable resources for extended periods of time. P2P Grids are totally decentralized and provide on-demand, transparent access to edge resources, e.g. Internet-connected, non-dedicated desktop computers. A P2P Grid has the potential to provide access to a large number of resources at the fraction of the cost of a dedicated cluster. However, edge resources are heterogeneous in performance and intrinsically unreliable: Task execution failures are common due to resource preemption or resource failure. Furthermore, P2P Grid schedulers usually target sets of independent computational Tasks, i.e. so-called Bags of Tasks applications. It is therefore not trivial to deploy and run an Iterative Stencil application on a P2P Grid. Checkpointing is a common fault-tolerance mechanism in High Performance Distributed Computing, often based on a centralized architecture. Locality-aware co-allocation in P2P Grids has been recently investigated. Checkpointing and locality-aware co-allocation yet have to be integrated in P2P Grids. We propose to provide co-allocation through an existing middleware-level Bag of Tasks scheduling mechanism. We also introduce a layer of fault-tolerance for the Iterative Stencils that relies on a scalable, application-level, P2P checkpointing mechanism. Finally, LBG-SQUARE is described. This software results from the combination of a specific Iterative Stencil application (a Computational Fluid Dynamics simulation software called LaBoGrid) with a P2P Grid middleware (Lightweight Bartering Grid).
Disciplines :
Computer science
Author, co-author :
Dethier, Gérard ;  Université de Liège - ULiège > Département de chimie appliquée > Génie chimique - Opérations physiques unitaires
Briquet, Cyril ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Informatique (ingénierie du logiciel et algorithmique)
Marchot, Pierre ;  Université de Liège - ULiège > Département de chimie appliquée > Génie chimique - Systèmes polyphasiques
de Marneffe, Pierre-Arnoul ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Informatique (ingénierie du logiciel et algorithmique)
Language :
English
Title :
LBG-SQUARE - Fault Tolerant, Locality-Aware Co-allocation in P2P Grids
Publication date :
December 2008
Event name :
Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT'08)
Event organizer :
University of Otago
Event place :
Dunedin, New Zealand
Event date :
1–4 of December, 2008
Audience :
International
Main work title :
Ninth International Conference On Parallel And Distributed Computing, Applications And Technologies : PDCAT 2008
Editor :
Huang, Zhiyi
Xu, Zhiwei
Rountree, Nathan
Lefevre, Laurent
Shen, Hong
Hine, John
Pan, Yi
Publisher :
IEEE Computer Society, Los Alamitos, United States - California
ISBN/EAN :
0-7695-3443-5
Pages :
252-258
Peer reviewed :
Peer reviewed
Funders :
FWB - Fédération Wallonie-Bruxelles [BE]
Commentary :
©2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
Available on ORBi :
since 10 January 2009

Statistics


Number of views
161 (21 by ULiège)
Number of downloads
233 (7 by ULiège)

Scopus citations®
 
0
Scopus citations®
without self-citations
0
OpenCitations
 
0

Bibliography


Similar publications



Contact ORBi