Time Aware Mining of Itemsets

Bashar Saleh 1 Florent Masseglia 1
1 AxIS - Usage-centered design, analysis and improvement of information systems
CRISAM - Inria Sophia Antipolis - Méditerranée , Inria Paris-Rocquencourt
Abstract : Frequent behavioural pattern mining is a very important topic of knowledge discovery, intended to extract correlations between items recorded in large databases or Web acces logs. However, those databases are usually considered as a whole and hence, itemsets are extracted over the entire set of records. Our claim is that possible periods, hidden within the structure of the data and containing compact itemsets, may exist. These periods, as well as the itemsets they contain, might not be found by traditional data mining methods due to their very weak support. Furthermore, these periods might be lost depending on an arbitrary division of the data. The goal of our work is to find itemsets that are frequent over a specific period but would not be extracted by traditional methods since their support is very low over the whole dataset. In this paper, we introduce the definition of solid itemsets, which represent a coherent and compact behavior over a specific period, and we propose SIM, an algorithm for their extraction. This work may find many applications in sensitive domains such as fraud or intrusion detection.
Document type :
Conference papers
Complete list of metadatas

Cited literature [9 references]  Display  Hide  Download

https://hal.inria.fr/inria-00359182
Contributor : Florent Masseglia <>
Submitted on : Friday, February 6, 2009 - 11:16:29 AM
Last modification on : Saturday, February 23, 2019 - 7:06:02 PM
Long-term archiving on : Tuesday, June 8, 2010 - 9:57:34 PM

File

time08.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Bashar Saleh, Florent Masseglia. Time Aware Mining of Itemsets. TIME, Jun 2008, Montreal, Canada. pp.93-97, ⟨10.1109/TIME.2008.12⟩. ⟨inria-00359182⟩

Share

Metrics

Record views

180

Files downloads

242