Mining Rare Association Rules

Laszlo Szathmary 1 Sandy Maumus 1 Amedeo Napoli
1 ORPAILLEUR - Knowledge representation, reasonning
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : In this paper, we address the problem of generating relevant rare association rules. In the literature, this problem has not yet been studied in detail, although rare association rules can also contain important information just as frequent association rules do. Our work is motivated by the long-standing open question of devising an efficient algorithm for finding rules with low support and very high confidence. In order to find such rules using conventional frequent itemset mining algorithms like Apriori, the minimum support must be set very low, which drastically increases the runtime of the algorithm. Moreover, when minimum support is set very low, Apriori produces a huge number of frequent itemsets. This is also known as the "rare item problem". For this long-existing problem we propose a solution. A particularly relevant field for rare itemsets and rare association rules is medical diagnosis. For example it may be that in a large group of patients diagnosed with the same sickness, a few patients exhibit unusual symptoms. It is important for the doctor to take this fact into consideration.
Document type :
Reports
Complete list of metadatas

Cited literature [24 references]  Display  Hide  Download

https://hal.inria.fr/inria-00102909
Contributor : Laszlo Szathmary <>
Submitted on : Friday, October 6, 2006 - 11:25:13 PM
Last modification on : Thursday, January 11, 2018 - 6:19:53 AM
Long-term archiving on : Friday, November 25, 2016 - 12:11:27 PM

File

 Restricted access
To satisfy the distribution rights of the publisher, the document is embargoed until : jamais

Please log in to resquest access to the document

Identifiers

  • HAL Id : inria-00102909, version 1

Collections

Citation

Laszlo Szathmary, Sandy Maumus, Amedeo Napoli. Mining Rare Association Rules. [Research Report] 2006, pp.19. ⟨inria-00102909⟩

Share

Metrics

Record views

283