Combining Statistical Information and Semantic Similarity for Short Text Feature Extension - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Combining Statistical Information and Semantic Similarity for Short Text Feature Extension

Résumé

A short text feature extension method combining statistical information and semantic similarity is proposed,Firstly, After defining the contribution of word, mutual information, an associated word-pairs set is generated by comparing the value of mutual information with threshold, then it is taken as the query words set to search for HowNet. For each word-pairs, senses are found in knowledge base HowNet, and semantic similarity of query word-pairs are calculated. Common sememe satisfied condition is added into the original term vector as extended feature, otherwise, semantic relationship is computed and the corresponding sememe is expanded into feature set. The above process is repeated, an extended feature set is finally obtained. Experimental results show the effectiveness of our method.
Fichier principal
Vignette du fichier
433802_1_En_21_Chapter.pdf (680.4 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01614984 , version 1 (11-10-2017)

Licence

Paternité

Identifiants

Citer

Xiaohong Li, Yun Su, Huifang Ma, Lin Cao. Combining Statistical Information and Semantic Similarity for Short Text Feature Extension. 9th International Conference on Intelligent Information Processing (IIP), Nov 2016, Melbourne, VIC, Australia. pp.205-210, ⟨10.1007/978-3-319-48390-0_21⟩. ⟨hal-01614984⟩
187 Consultations
100 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More