Skip to Main content Skip to Navigation
Conference papers

Pre-Processing and Clustering Complex Data in E-Commerce Domain

Abstract : This paper presents our preprocessing and clustering method on a clickstream dataset issued from e-commerce domain. The main contributions of this article are double. First, after presenting the clickstream dataset, we show how we build a rich data warehouse based an advanced preprocessing method. We take into account the intersite aspects in the given e-commerce domain, which offers an interesting data structuration. A preliminary statistical analysis based on such complex data i.e. time period clickstreams is given, emphasing the importance of intersite user visits in such a context. Secondly, we describe our crossed-clustering method which is applied on data generated from our data warehouse. Our preliminary results are interesting and promising illustrating the benefits of our WUM methods, even if more investigations are needed on the same dataset.
Document type :
Conference papers
Complete list of metadata

Cited literature [2 references]  Display  Hide  Download

https://hal.inria.fr/inria-00000881
Contributor : Sergiu Chelcea <>
Submitted on : Wednesday, November 30, 2005 - 3:11:53 PM
Last modification on : Friday, May 25, 2018 - 12:02:04 PM
Long-term archiving on: : Monday, September 17, 2012 - 3:45:43 PM

Identifiers

  • HAL Id : inria-00000881, version 1

Collections

Citation

Sergiu Chelcea, Alzennyr da Silva, Yves Lechevallier, Doru Tanasa, Brigitte Trousse. Pre-Processing and Clustering Complex Data in E-Commerce Domain. Proceedings of the First International Workshop on Mining Complex Data 2005 (IEEE MCD'2005), held in conjunction with the Fifth IEEE International Conference on Data Mining (ICDM'05), Nov 2005, Houston, Texas. ⟨inria-00000881⟩

Share

Metrics

Record views

286

Files downloads

1247