L. Candillier, I. Tellier, and F. Torre, Transforming XML Trees for Efficient Classification and Clustering, INEX 2005 Workshop on Mining XML documents, 2005.
DOI : 10.1109/TKDE.2004.1264824

L. Candillier, I. Tellier, F. Torre, and O. Bousquet, SSC: Statistical Subspace Clustering, 4th International Conference on Machine Learning and Data Mining in Pattern Recognition (MLDM'2005), volume LNAI 3587 of LNCS, pp.100-109, 2005.
DOI : 10.1007/11510888_11

URL : https://hal.archives-ouvertes.fr/inria-00536697

G. Costa, G. Manco, R. Ortale, and A. Tagarelli, A Tree-Based Approach to Clustering XML Documents by Structure, PKDD, pp.137-148, 2004.
DOI : 10.1007/978-3-540-30116-5_15

T. Dalamagas, T. Cheng, K. Winkel, and T. K. Sellis, Clustering XML Documents Using Structural Summaries, EDBT Workshops, pp.547-556, 2004.
DOI : 10.1007/978-3-540-30192-9_54

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.329.2012

L. Denoyer, Apprentissage et inférence statistique dans les bases de documents structurés : Application aux corpus de documents textuels, 2004.

L. Denoyer and P. Gallinari, Categorization and Clustering of XML documents using Structure and Content Information, INEX 2005 Preproceedings, 2005.

L. Denoyer, J. Vittaut, P. Gallinari, S. Brunesseaux, and S. Brunesseaux, Structured multimedia document classification, Proceedings of the 2003 ACM symposium on Document engineering , DocEng '03, pp.153-160, 2003.
DOI : 10.1145/958220.958249

URL : https://hal.archives-ouvertes.fr/hal-01357593

A. Doucet and H. Ahonen-myka, Na¨?veNa¨?ve Clustering of a large XML Document Collection, INEX Workshop, pp.81-87, 2002.

S. Flesca, G. Manco, E. Masciari, L. Pontieri, and A. Pugliese, Detecting Structural Similarities between XML Documents, WebDB, pp.55-60, 2002.

F. D. Francesca, G. Gordano, R. Ortale, and A. Tagarelli, Distance-based Clustering of XML Documents, MGTS-2003 : Proceedings of the First International Workshop on Mining Graphs, Trees and Sequences ECML/PKDD'03 workshop proceedings, pp.75-78, 2003.

D. Guillaume and F. Murtagh, Clustering of XML documents, Computer Physics Communications, vol.127, issue.2-3, pp.215-227, 2000.
DOI : 10.1016/S0010-4655(99)00511-1

L. Hubert and P. Arabie, Comparing partitions, Journal of Classification, vol.78, issue.1, pp.193-218, 1985.
DOI : 10.1007/BF01908075

Y. Jianwu and C. Xiaoou, A semi-structured document model for text mining, J. Comput. Sci. Technol, vol.17, issue.5, pp.603-610, 2002.

B. Larsen and C. Aone, Fast and effective text mining using linear-time document clustering, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '99, pp.16-22, 1999.
DOI : 10.1145/312129.312186

W. Lian, D. W. Cheung, N. Mamoulis, and S. Yiu, An efficient and scalable algorithm for clustering XML documents by structure, IEEE Transactions on Knowledge and Data Engineering, vol.16, issue.1, pp.82-96, 2004.
DOI : 10.1109/TKDE.2004.1264824

J. Liu, J. T. Wang, W. Hsu, and K. G. Herbert, XML Clustering by Principal Component Analysis, ICTAI, pp.658-662, 2004.

R. Nayak and S. Xu, XML documents clustering by structures with XCLS, INEX 2005 Workshop on Mining XML documents, 2005.

A. Nierman and H. V. Jagadish, Evaluating Structural Similarity in XML Documents, WebDB, pp.61-66, 2002.

M. F. Porter, An algorithm for suffix stripping, Readings in information retrieval, pp.313-316, 1997.

A. Termier, M. Rousset, and M. Sebag, TreeFinder: a first step towards XML data mining, 2002 IEEE International Conference on Data Mining, 2002. Proceedings., p.450, 2002.
DOI : 10.1109/ICDM.2002.1183987

F. Trentini, M. Hagenbuchner, A. Sperduti, A. Tsoi, F. Scarselli et al., Clustering XML Documents using Self-Organizing Maps for Structures, INEX 2005 Workshop on Mining XML documents, 2005.

A. Vercoustre, M. Fegas, Y. Lechevallier, and T. Despeyroux, Classification de documents XMLàXMLà partir d'une représentation linéaire des arbres de ces documents, Actes des 6` eme journées Extraction et Gestion des Connaissances Revue des Nouvelles Technologies de l'Information (RNTI-E-6), pp.433-444, 2006.

J. Yi and N. Sundaresan, A classifier for semi-structured documents, Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '00, pp.340-344, 2000.
DOI : 10.1145/347090.347164

J. P. Yoon, V. Raghavan, V. Chakilam, and L. Kerschberg, BitCube: a three-dimensional bitmap indexing for XML documents, Proceedings Thirteenth International Conference on Scientific and Statistical Database Management. SSDBM 2001, pp.241-254, 2001.
DOI : 10.1109/SSDM.2001.938548

M. J. Zaki and C. C. Aggarwal, XRules, Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '03, pp.316-325, 2003.
DOI : 10.1145/956750.956787

Y. Zhao and G. Karypis, Criterion functions for document clustering: Experiments and analysis, 2001.