Service interruption on Monday 11 July from 12:30 to 13:00: all the sites of the CCSD (HAL, Epiciences, SciencesConf, AureHAL) will be inaccessible (network hardware connection).
Skip to Main content Skip to Navigation
Conference papers

Discovering Cross-Language Links in Wikipedia through Semantic Relatedness

Abstract : Wikipedia is a large multilingual collection of interlinked articles, used and contributed by millions of users over the Internet, that provides editions up to 283 languages. Two articles in different language versions of Wikipedia may have information on the exactly the same concept, in which case they are often connected through a cross-language link. However, many cross-language links are either missing or incorrect and this negatively affects both the readers of Wikipedia and multilingual information retrieval applications. In this paper, we propose WIKICL, an algorithm for discovering cross-language links using the semantic relatedness of two articles derived from the Wikipedia graph structure. Our evaluation shows that we achieve comparable, and in some cases, better results than previous methods with much less computational time.
Document type :
Conference papers
Complete list of metadata

https://hal.inria.fr/hal-00787452
Contributor : Chantal Reynaud Connect in order to contact the contributor
Submitted on : Tuesday, February 12, 2013 - 10:59:24 AM
Last modification on : Sunday, June 26, 2022 - 11:58:15 AM

Identifiers

  • HAL Id : hal-00787452, version 1

Collections

Citation

A. Penta, Gianluca Quercini, Chantal Reynaud, Nigel Shadbolt. Discovering Cross-Language Links in Wikipedia through Semantic Relatedness. ECAI, Aug 2012, Montpellier, France. ⟨hal-00787452⟩

Share

Metrics

Record views

70