Data Mining on Desktop Grid Platforms

Abstract : Very large data volumes and high computation costs in data mining applications justify the use for them of Grid-level massive parallelism. The paper concerns Grid-oriented implementation of the DisDaMin (Distributed Data Mining) project, which proposes distributed knowledge discovery through parallelization of data mining tasks. DisDaMin solves data mining problems by using new distributed algorithms based on special clusterized data decomposition and asynchronous task processing, which match the Grid computing features. The DisDaMin algorithms are embedded inside the DG-ADAJ (Desktop-Grid Adaptative Application in Java) system, which is a middleware platform for Desktop Grid. It provides adaptive control of distributed applications written in Java for Grid or Desktop Grid. It allows an optimized distribution of applications on clusters of Java Virtual Machines, monitoring of application execution and dynamic on-line balancing of processing and communication. Simulations were performed to prove the efficiency of the proposed mechanisms. They were carried on using the French national project Grid'5000 (part of the CoreGrid project) and the DG-ADAJ.
Type de document :
Article dans une revue
Lecture notes in computer science, springer, 2008, Parallel Processing and Applied Mathematics, 4967, pp.912-921. 〈10.1007/978-3-540-68111-3_97〉
Liste complète des métadonnées

https://hal.inria.fr/hal-00845802
Contributeur : Richard Olejnik <>
Soumis le : mercredi 17 juillet 2013 - 18:05:53
Dernière modification le : jeudi 11 janvier 2018 - 06:24:24

Identifiants

Citation

Valerie Fiolet, Richard Olejnik, Eryk Laskowski, Łukasz Masko, Marek Tudruj, et al.. Data Mining on Desktop Grid Platforms. Lecture notes in computer science, springer, 2008, Parallel Processing and Applied Mathematics, 4967, pp.912-921. 〈10.1007/978-3-540-68111-3_97〉. 〈hal-00845802〉

Partager

Métriques

Consultations de la notice

218