Skip to Main content Skip to Navigation
Conference papers

Web Usage Mining: Extracting Unexpected Periods from Web Logs

Florent Masseglia 1 Pascal Poncelet 2 Maguelonne Teisseire 2 Alice Marascu 1
1 AxIS - Usage-centered design, analysis and improvement of information systems
CRISAM - Inria Sophia Antipolis - Méditerranée , Inria Paris-Rocquencourt
2 TATOO - Fouille de données environnementales
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : Existing Web Usage Mining techniques are currently based on an arbitrary division of the data (e.g. “one log per month”) or guided by presumed results (e.g “what is the customers behaviour for the period of Christmas purchases?”). Those approaches have two main drawbacks. First, they depend on this arbitrary organization of the data. Second, they cannot automatically extract “seasons peaks” among the stored data. In this paper, we propose to perform a specific data mining process (and particularly to extract frequent behaviours) in order to automatically discover the densest periods. Our method extracts, among the whole set of possible combinations, the frequent sequential patterns related to the extracted periods. A period will be considered as dense if it contains at least one frequent sequential pattern for the set of users connected to the Web site in that period. Our experiments show that the extracted periods are relevant and our approach is able to extract both frequent sequential patterns and the associated dense periods.
Document type :
Conference papers
Complete list of metadata
Contributor : Alice-Maria Marascu Connect in order to contact the contributor
Submitted on : Saturday, March 6, 2010 - 3:43:19 PM
Last modification on : Friday, January 21, 2022 - 3:08:47 AM


  • HAL Id : inria-00461877, version 1



Florent Masseglia, Pascal Poncelet, Maguelonne Teisseire, Alice Marascu. Web Usage Mining: Extracting Unexpected Periods from Web Logs. IEEE 2nd Workshop on Temporal Data Mining (TDM'05). Held in conjunction with ICDM'05, Nov 2005, Houston, United States. ⟨inria-00461877⟩



Les métriques sont temporairement indisponibles