Mining Data Streams for Frequent Sequences Extraction - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2005

Mining Data Streams for Frequent Sequences Extraction

Résumé

In recent years, emerging applications introduced new constraints for data mining methods. These constraints are particularly linked to new kinds of data that can be considered as complex data. One typical kind of such data is known as data streams. In a data stream processing, memory usage is restricted, new elements are generated continuously and have to be considered as fast as possible, no blocking operator can be performed and the data can be examined only once. At this time and to the best of our knowledge, no method has been proposed for mining sequential patterns in data streams. We argue that the main reason is the combinatory phenomenon related to sequential pattern mining. In this paper, we propose an algorithm based on sequences alignment for mining approximate sequential patterns in Web usage data streams. To meet the constraint of one scan, a greedy clustering algorithm associated to an alignment method are proposed. We will show that our proposal is able to extract relevant sequences with very low thresholds.
Fichier non déposé

Dates et versions

inria-00461876 , version 1 (06-03-2010)

Identifiants

  • HAL Id : inria-00461876 , version 1

Citer

Alice Marascu, Florent Masseglia. Mining Data Streams for Frequent Sequences Extraction. IEEE first Workshop on Mining Complex Data (MCD'05). Held in conjunction with ICDM'05, Nov 2005, Houston, United States. ⟨inria-00461876⟩
101 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More