Grid-based Approaches for Distributed Data Mining Applications

Abstract : The data mining field is an important source of large scale applications and datasets which are getting more and more common. In this paper, we present performance evaluations of basic large scale data mining applications on an experimental grid environment. We test the scalability of a new clustering algorithm and a grid-based frequent itemsets generation using a grid workflow. We also compare the performance analysis of the workflow execution to simulated models and give measurements of the overhead related to the workflow engine and the underlying grid environment.
Type de document :
Communication dans un congrès
DCABES'07, the sixth International Conference on Distributed Computing and Applications for Business, Engineering and Science, Aug 2007, Wuhan, China. 2007
Liste complète des métadonnées

https://hal.inria.fr/hal-00690498
Contributeur : Ist Rennes <>
Soumis le : lundi 23 avril 2012 - 16:56:58
Dernière modification le : lundi 20 juin 2016 - 14:10:32

Identifiants

  • HAL Id : hal-00690498, version 1

Collections

Citation

Lamine M. Aouad, Nhien An Le Khac, Tahar Kechadi. Grid-based Approaches for Distributed Data Mining Applications. DCABES'07, the sixth International Conference on Distributed Computing and Applications for Business, Engineering and Science, Aug 2007, Wuhan, China. 2007. 〈hal-00690498〉

Partager

Métriques

Consultations de la notice

31