Hal will be stopped for maintenance from friday on june 10 at 4pm until monday june 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Pre-Processing and Clustering Complex Data in E-Commerce Domain

Abstract : This paper presents our preprocessing and clustering method on a clickstream dataset issued from e-commerce domain. The main contributions of this article are double. First, after presenting the clickstream dataset, we show how we build a rich data warehouse based an advanced preprocessing method. We take into account the intersite aspects in the given e-commerce domain, which offers an interesting data structuration. A preliminary statistical analysis based on such complex data i.e. time period clickstreams is given, emphasing the importance of intersite user visits in such a context. Secondly, we describe our crossed-clustering method which is applied on data generated from our data warehouse. Our preliminary results are interesting and promising illustrating the benefits of our WUM methods, even if more investigations are needed on the same dataset.
Document type :
Conference papers
Complete list of metadata

Cited literature [2 references]  Display  Hide  Download

Contributor : Sergiu Chelcea Connect in order to contact the contributor
Submitted on : Wednesday, November 30, 2005 - 3:11:53 PM
Last modification on : Friday, February 4, 2022 - 3:13:44 AM
Long-term archiving on: : Monday, September 17, 2012 - 3:45:43 PM


  • HAL Id : inria-00000881, version 1



Sergiu Chelcea, Alzennyr da Silva, Yves Lechevallier, Doru Tanasa, Brigitte Trousse. Pre-Processing and Clustering Complex Data in E-Commerce Domain. Proceedings of the First International Workshop on Mining Complex Data 2005 (IEEE MCD'2005), held in conjunction with the Fifth IEEE International Conference on Data Mining (ICDM'05), Nov 2005, Houston, Texas. ⟨inria-00000881⟩



Record views


Files downloads