HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

How Far Association Rules and Statistical Indices help Structure Terminology?

Hacène Cherfi 1 Yannick Toussaint 1
1 ORPAILLEUR - Knowledge representation, reasonning
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : Automatic or semi-automatic structuring of terminology extracted from large corpora still remain a bottleneck issue for managing the fast growing textual sources. This paper aims at defining a methodology to tackle this point using a text mining process for association rules extraction. We show the ability of the rules to enhance the quality of the terminology by filtering the ambiguous, noisy terms of a domain of speciality. However, the mining process often generates a huge number of rules. This issue leads us to raise the question of how can we find a subset of rules that constitutes a valid relational structure according to the knowledge domain. We use statistical indices to rank the rules that are more capable of reflecting the complex semantic relations between terms. We also study how far some rules can help the expert with identifying synonymy/hypernymy relations or with filtering terms.
Document type :
Conference papers
Complete list of metadata

Contributor : Publications Loria Connect in order to contact the contributor
Submitted on : Tuesday, September 26, 2006 - 2:50:25 PM
Last modification on : Friday, February 26, 2021 - 3:28:05 PM


  • HAL Id : inria-00100767, version 1



Hacène Cherfi, Yannick Toussaint. How Far Association Rules and Statistical Indices help Structure Terminology?. Workshop of ECAI2002: Natural Language Processing and Machine Learning for Ontology Engineering OLT'02, In conjunction with ECAI 2002: 15th European Conference on Artificial Intelligence, Jul 2002, Lyon, France, pp.5-9. ⟨inria-00100767⟩



Record views