Web Usage Mining: Sequential Pattern Extraction with a Very Low Support

Florent Masseglia 1 Doru Tanasa 1 Brigitte Trousse 1
1 AxIS - Usage-centered design, analysis and improvement of information systems
CRISAM - Inria Sophia Antipolis - Méditerranée , Inria Paris-Rocquencourt
Abstract : The goal of this work is to increase the relevance and the interestingness of patterns discovered by a Web Usage Mining process. Indeed, the sequential patterns extracted on web log files, unless they are found under constraints, often lack interest because of their obvious content. Our goal is to discover minority users' behaviors having a coherence which we want to be aware of (like hacking activities on the Web site or a users' activity limited to a specific part of the Web site). By means of a clustering method on the extracted sequential patterns, we propose a recursive division of the problem. The developed clustering method is based on patterns summaries and neural networks. Our experiments show that we obtain the targeted patterns whereas their extraction by means of a classical process is impossible because of a very weak support (down to 0.006%). The diversity of users' behaviors is so large that the minority ones are both numerous and difficult to locate
Type de document :
Communication dans un congrès
Jeffrey Xu Yu and Xuemin Lin and Hongjun Lu and Yanchun Zhang. Advanced Web Technologies and Applications: 6th Asia-Pacific Web Conference, APWeb 2004, Apr 2004, Hangzhou, China. Springer Verlag, 3007, pp.513--522, 2004, Lecture Notes in Computer Science. 〈10.1007/978-3-540-24655-8_56〉
Liste complète des métadonnées

https://hal.inria.fr/hal-00950768
Contributeur : Brigitte Trousse <>
Soumis le : samedi 22 février 2014 - 16:50:34
Dernière modification le : mercredi 21 novembre 2018 - 19:48:06

Identifiants

Collections

Citation

Florent Masseglia, Doru Tanasa, Brigitte Trousse. Web Usage Mining: Sequential Pattern Extraction with a Very Low Support. Jeffrey Xu Yu and Xuemin Lin and Hongjun Lu and Yanchun Zhang. Advanced Web Technologies and Applications: 6th Asia-Pacific Web Conference, APWeb 2004, Apr 2004, Hangzhou, China. Springer Verlag, 3007, pp.513--522, 2004, Lecture Notes in Computer Science. 〈10.1007/978-3-540-24655-8_56〉. 〈hal-00950768〉

Partager

Métriques

Consultations de la notice

177