Concept Based Representations as Complement of Bag of Words in Information Retrieval

Abstract : Information Retrieval models, which do not represent texts merely as collections of the words they contain, but rather as collections of the concepts they contain through synonym sets or latent dimensions, are known as Bag-of-Concepts (BoC) representations. In this paper we use random indexing, which uses co-occurrence information among words to generate semantic context vectors and then represent the documents and queries as BoC. In addition, we use a novel representation, Holographic Reduced Representation, previously proposed in cognitive models, which can encode relations between words. We show that these representations can be successfully used in information retrieval, can associate terms, and when they are combined with the traditional vector space model, they improve effectiveness, in terms of mean average precision.
Document type :
Conference papers
Complete list of metadatas

Cited literature [25 references]  Display  Hide  Download

https://hal.inria.fr/hal-01060663
Contributor : Hal Ifip <>
Submitted on : Friday, November 17, 2017 - 3:56:59 PM
Last modification on : Thursday, February 7, 2019 - 4:52:59 PM
Long-term archiving on : Sunday, February 18, 2018 - 4:45:40 PM

File

CarrilloL10.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Maya Carrillo, Aurelio López-López. Concept Based Representations as Complement of Bag of Words in Information Retrieval. 6th IFIP WG 12.5 International Conference on Artificial Intelligence Applications and Innovations (AIAI), Oct 2010, Larnaca, Cyprus. pp.154-161, ⟨10.1007/978-3-642-16239-8_22⟩. ⟨hal-01060663⟩

Share

Metrics

Record views

391

Files downloads

149