Mesurer la cohésion sémantique dans les corpus de documents

Guy Melançon 1, 2 Benjamin Renoust 1, 2, 3 Marie-Luce Viaud 3
1 GRAVITE - Graph Visualization and Interactive Exploration
Université Sciences et Technologies - Bordeaux 1, Inria Bordeaux - Sud-Ouest, École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB), CNRS - Centre National de la Recherche Scientifique : UMR
Abstract : Exploring document collections remains a focus of research. This task can be tackled using various techniques, typically ranking documents according to a relevance index or grouping documents based on various clustering algorithms. The task complexity produces results of varying quality that inevitably carry noise. Users must be careful when interpreting document relevance or groupings. We address this problem by computing cohesion measures for a group of documents con rming/in rming whether it can be trusted to form a semantically cohesive unit. The index is inspired from past work in social network analysis (SNA) and illustrates how document exploration can bene t from SNA techniques.
Complete list of metadatas

Cited literature [23 references]  Display  Hide  Download

https://hal.inria.fr/hal-00736724
Contributor : Guy Melançon <>
Submitted on : Friday, September 28, 2012 - 9:03:25 PM
Last modification on : Wednesday, October 9, 2019 - 11:44:04 AM
Long-term archiving on : Friday, December 16, 2016 - 5:58:10 PM

File

RR-8075.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00736724, version 1

Citation

Guy Melançon, Benjamin Renoust, Marie-Luce Viaud. Mesurer la cohésion sémantique dans les corpus de documents. [Research Report] RR-8075, INRIA. 2012, pp.21. ⟨hal-00736724⟩

Share

Metrics

Record views

430

Files downloads

294