Skip to Main content Skip to Navigation
Conference papers

A Word Embedding based Method for Question Retrieval in Community Question Answering

Abstract : Community Question Answering (cQA) continues to gain momentum owing to the unceasing rise of user-generated content that dominates the web. CQA are platforms that enable people with different backgrounds to share knowledge by freely asking and answering each other. In this paper, we focus on question retrieval which is deemed to be a key task in cQA. It aims at finding similar archived questions given a new query, assuming that the answers to the similar questions should also answer the new one. This is known to be a challenging task due to the ver-boseness in natural language and the word mismatch between the questions. Most traditional methods measure the similarity between questions based on the bag-of-words (BOWs) representation capturing no semantics between words. In this paper , we rely on word representation to capture the words semantic information in language vector space. Questions are then ranked using cosine similarity based on the vector-based word representation for each question. Experiments conducted on large-scale cQA data show that our method gives promising results.
Document type :
Conference papers
Complete list of metadata

Cited literature [26 references]  Display  Hide  Download
Contributor : Kamel Smaïli Connect in order to contact the contributor
Submitted on : Saturday, December 9, 2017 - 5:41:18 PM
Last modification on : Wednesday, November 3, 2021 - 7:57:43 AM


Files produced by the author(s)


  • HAL Id : hal-01660005, version 1



Nouha Othman, Rim Faiz, Kamel Smaili. A Word Embedding based Method for Question Retrieval in Community Question Answering. ICNLSSP 2017 - International Conference on Natural Language, Signal and Speech Processing, ISGA, Dec 2017, Casablanca, Morocco. ⟨hal-01660005⟩



Record views


Files downloads