Reference : LBG-SQUARE - Fault Tolerant, Locality-Aware Co-allocation in P2P Grids
Scientific congresses and symposiums : Paper published in a book
Engineering, computing & technology : Computer science
LBG-SQUARE - Fault Tolerant, Locality-Aware Co-allocation in P2P Grids
Dethier, Gérard mailto [Université de Liège - ULg > Département de chimie appliquée > Génie chimique - Opérations physiques unitaires >]
Briquet, Cyril mailto [Université de Liège - ULg > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Informatique (ingénierie du logiciel et algorithmique) >]
Marchot, Pierre mailto [Université de Liège - ULg > Département de chimie appliquée > Génie chimique - Systèmes polyphasiques >]
de Marneffe, Pierre-Arnoul mailto [Université de Liège - ULg > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Informatique (ingénierie du logiciel et algorithmique) >]
Ninth International Conference On Parallel And Distributed Computing, Applications And Technologies : PDCAT 2008
Huang, Zhiyi
Xu, Zhiwei
Rountree, Nathan
Lefevre, Laurent
Shen, Hong
Hine, John
Pan, Yi
IEEE Computer Society
Los Alamitos
Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT'08)
1–4 of December, 2008
University of Otago
New Zealand
[en] Distributed Computing ; Fault Tolerance ; Peer-to-Peer (P2P) Grid ; Grid Computing
[en] In this paper, the deployment and execution of Iterative Stencil applications on a P2P Grid middleware are investigated. So-called Iterative Stencil applications are composed of sets of heavily-communicating, long-running Tasks. They thus require co-allocation of multiple reliable resources for extended periods of

P2P Grids are totally decentralized and provide on-demand, transparent access to edge resources, e.g. Internet-connected, non-dedicated desktop computers. A P2P Grid has the potential to provide access to a large number of resources at the fraction of the cost of a dedicated cluster. However, edge resources are heterogeneous in performance and intrinsically unreliable: Task execution failures are common due to resource preemption or resource failure. Furthermore, P2P Grid schedulers usually target sets of independent computational Tasks, i.e. so-called Bags of Tasks applications. It is therefore not trivial to deploy and run an Iterative Stencil application on a P2P Grid.

Checkpointing is a common fault-tolerance mechanism in High Performance Distributed Computing, often based on a centralized architecture. Locality-aware co-allocation in P2P Grids has been recently investigated. Checkpointing and locality-aware co-allocation yet have to be integrated in P2P Grids.

We propose to provide co-allocation through an existing middleware-level Bag of Tasks scheduling mechanism. We also introduce a layer of fault-tolerance for the Iterative Stencils that relies on a scalable, application-level, P2P checkpointing mechanism. Finally, LBG-SQUARE is described. This software results from the combination of a specific Iterative Stencil application (a Computational Fluid Dynamics simulation software called LaBoGrid) with a P2P Grid middleware (Lightweight Bartering Grid).
Communauté française de Belgique - CfB
©2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

File(s) associated to this reference

Fulltext file(s):

Open access
dethier-pdcat08-postprint.pdfPublisher postprint377.51 kBView/Open

Bookmark and Share SFX Query

All documents in ORBi are protected by a user license.