R. Celeux, G. , E. Diday, G. Govaert, Y. Lechevallier et al., Classification Automatique des Données, 1989.

G. Costa, G. Manco, R. Ortale, and E. A. Tagarelli, A Tree-Based Approach to Clustering XML Documents by Structure, PKDD, pp.137-148, 2004.
DOI : 10.1007/978-3-540-30116-5_15

T. Dalamagas, T. Cheng, K. Winkel, and T. K. Sellis, Clustering XML Documents Using Structural Summaries, EDBT Workshops, pp.547-556, 2004.
DOI : 10.1007/978-3-540-30192-9_54

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.329.2012

L. Denoyer, Apprentissage et inférence statistique dans les bases de documents structurés : Application aux corpus de documents textuels, 2004.

L. Denoyer, J. Vittaut, P. Gallinari, S. Brunesseaux, and E. S. Brunesseaux, Structured multimedia document classification, Proceedings of the 2003 ACM symposium on Document engineering , DocEng '03, 2003.
DOI : 10.1145/958220.958249

URL : https://hal.archives-ouvertes.fr/hal-01357593

T. Despeyroux, Y. Lechevallier, B. Trousse, and A. Vercoustre, Experiments in clustering homogeneous xml documents to validate an existing typology, Proceedings of the 5th International Conference on Knowledge Management (I-Know), 2005.
URL : https://hal.archives-ouvertes.fr/inria-00000002

A. Doucet and H. Ahonen-myka, Naïve Clustering of a large XML Document Collection, INEX Workshop, pp.81-87, 2002.

F. D. Francesca, G. Gordano, R. Ortale, and E. A. Tagarelli, Distance-based Clustering of XML Documents, MGTS-2003 : Proceedings of the First International Workshop on Mining Graphs, Trees and Sequences ECML/PKDD'03 workshop proceedings, pp.75-78, 2003.

L. Hubert, P. Et, and . Arabie, Comparing partitions, Journal of Classification, vol.78, issue.1, pp.193-218, 1985.
DOI : 10.1007/BF01908075

Y. Jianwu, C. Et, and . Xiaoou, A semi-structured document model for text mining, J. Comput. Sci. Technol, vol.17, issue.5, pp.603-610, 2002.

B. Larsen, C. Et, and . Aone, Fast and effective text mining using linear-time document clustering, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '99, pp.16-22, 1999.
DOI : 10.1145/312129.312186

J. Liu, J. T. Wang, W. Hsu, and K. G. Herbert, XML Clustering by Principal Component Analysis, ICTAI, pp.658-662, 2004.

A. Nierman and H. V. Jagadish, Evaluating Structural Similarity in XML Documents, Proceedings of the Fifth International Workshop on the Web and Databases, 2002.

M. F. Porter, An algorithm for suffix stripping, Readings in information retrieval, pp.313-316, 1997.
DOI : 10.1108/eb046814

A. Termier, M. Rousset, and E. M. Sebag, TreeFinder: a first step towards XML data mining, 2002 IEEE International Conference on Data Mining, 2002. Proceedings., p.450, 2002.
DOI : 10.1109/ICDM.2002.1183987

R. Verde, F. A. De-carvalho, and E. Y. Lechevallier, A Dynamical Clustering Algorithm for Symbolic Objects, Tutorial on Symbolic Data Analysis, GfK1 Conference, 2001.

J. Yi, N. Et, and . Sundaresan, A classifier for semi-structured documents, Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '00, pp.340-344, 2000.
DOI : 10.1145/347090.347164

J. P. Yoon, V. Raghavan, V. Chakilam, and E. L. Kerschberg, BitCube: a three-dimensional bitmap indexing for XML documents, Proceedings Thirteenth International Conference on Scientific and Statistical Database Management. SSDBM 2001, pp.241-254, 2001.
DOI : 10.1109/SSDM.2001.938548

Y. Zhao, G. Et, and . Karypis, Criterion functions for document clustering : Experiments and analysis, 2001.