Skip to Main content Skip to Navigation

Mesurer la cohésion sémantique dans les corpus de documents

Guy Melançon 1, 2 Benjamin Renoust 1, 2, 3 Marie-Luce Viaud 3 
1 GRAVITE - Graph Visualization and Interactive Exploration
Université Sciences et Technologies - Bordeaux 1, Inria Bordeaux - Sud-Ouest, École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB), CNRS - Centre National de la Recherche Scientifique : UMR
Abstract : Exploring document collections remains a focus of research. This task can be tackled using various techniques, typically ranking documents according to a relevance index or grouping documents based on various clustering algorithms. The task complexity produces results of varying quality that inevitably carry noise. Users must be careful when interpreting document relevance or groupings. We address this problem by computing cohesion measures for a group of documents con rming/in rming whether it can be trusted to form a semantically cohesive unit. The index is inspired from past work in social network analysis (SNA) and illustrates how document exploration can bene t from SNA techniques.
Complete list of metadata

Cited literature [23 references]  Display  Hide  Download
Contributor : Guy Melançon Connect in order to contact the contributor
Submitted on : Friday, September 28, 2012 - 9:03:25 PM
Last modification on : Saturday, June 25, 2022 - 8:29:56 PM
Long-term archiving on: : Friday, December 16, 2016 - 5:58:10 PM


Files produced by the author(s)


  • HAL Id : hal-00736724, version 1


Guy Melançon, Benjamin Renoust, Marie-Luce Viaud. Mesurer la cohésion sémantique dans les corpus de documents. [Research Report] RR-8075, INRIA. 2012, pp.21. ⟨hal-00736724⟩



Record views


Files downloads