Web Usage Mining: Sequential Pattern Extraction with a Very Low Support - Inria - Institut national de recherche en sciences et technologies du numérique Access content directly
Conference Papers Year : 2004

Web Usage Mining: Sequential Pattern Extraction with a Very Low Support

Abstract

The goal of this work is to increase the relevance and the interestingness of patterns discovered by a Web Usage Mining process. Indeed, the sequential patterns extracted on web log files, unless they are found under constraints, often lack interest because of their obvious content. Our goal is to discover minority users' behaviors having a coherence which we want to be aware of (like hacking activities on the Web site or a users' activity limited to a specific part of the Web site). By means of a clustering method on the extracted sequential patterns, we propose a recursive division of the problem. The developed clustering method is based on patterns summaries and neural networks. Our experiments show that we obtain the targeted patterns whereas their extraction by means of a classical process is impossible because of a very weak support (down to 0.006%). The diversity of users' behaviors is so large that the minority ones are both numerous and difficult to locate

Dates and versions

hal-00950768 , version 1 (22-02-2014)

Identifiers

Cite

Florent Masseglia, Doru Tanasa, Brigitte Trousse. Web Usage Mining: Sequential Pattern Extraction with a Very Low Support. Advanced Web Technologies and Applications: 6th Asia-Pacific Web Conference, APWeb 2004, Apr 2004, Hangzhou, China. pp.513--522, ⟨10.1007/978-3-540-24655-8_56⟩. ⟨hal-00950768⟩
118 View
0 Download

Altmetric

Share

Gmail Facebook X LinkedIn More