A New Probabilistic Measure of Interestingness for Association Rules, Based on the Likelihood of the Link

Israël-César Lerman 1 Jérôme Azé 2
1 SYMBIOSE - Biological systems and models, bioinformatics and sequences
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : The interestingness measures for pattern associations proposed in the data mining literature depend only on the observation of relative frequencies obtained from 2×2 contingency tables. They can be called “absolute measures”. The underlying scale of such a measure makes statistical decisions difficult. In this paper we present the foundations and the construction of a probabilistic interestingness measure that we call likelihood of the link index. This enables to capture surprising association rules. Indeed, its underlying principle can be related to that of information theory philosophy; but at a relational level. Two facets are developed for this index: symmetrical and asymmetrical. Two stages are needed to build this index. The first is “local” and associated with the two single boolean attributes to be compared. The second corresponds to a discriminant extension of the obtained probabilistic index for measuring an association rule in the context of a relevant set of association rules. Our construction is situated in the framework of the proposed indices in the data mining literature. Thus, new measures have been derived. Finally, we designed experiments to estimate the relevance of our statistical approach, this being theoretically validated, previously.
Type de document :
Chapitre d'ouvrage
Guillet, F. and Hamilton, H. Quality Measures in Data Mining. Studies in Computational Intelligence, Springer, pp.207-236, 2007
Liste complète des métadonnées

https://hal.inria.fr/inria-00180117
Contributeur : Israel-César Lerman <>
Soumis le : mercredi 17 octobre 2007 - 16:55:41
Dernière modification le : mercredi 16 mai 2018 - 11:23:05

Identifiants

  • HAL Id : inria-00180117, version 1

Citation

Israël-César Lerman, Jérôme Azé. A New Probabilistic Measure of Interestingness for Association Rules, Based on the Likelihood of the Link. Guillet, F. and Hamilton, H. Quality Measures in Data Mining. Studies in Computational Intelligence, Springer, pp.207-236, 2007. 〈inria-00180117〉

Partager

Métriques

Consultations de la notice

293