Combining Statistical Information and Semantic Similarity for Short Text Feature Extension

Abstract : A short text feature extension method combining statistical information and semantic similarity is proposed,Firstly, After defining the contribution of word, mutual information, an associated word-pairs set is generated by comparing the value of mutual information with threshold, then it is taken as the query words set to search for HowNet. For each word-pairs, senses are found in knowledge base HowNet, and semantic similarity of query word-pairs are calculated. Common sememe satisfied condition is added into the original term vector as extended feature, otherwise, semantic relationship is computed and the corresponding sememe is expanded into feature set. The above process is repeated, an extended feature set is finally obtained. Experimental results show the effectiveness of our method.
Type de document :
Communication dans un congrès
9th International Conference on Intelligent Information Processing (IIP), Nov 2016, Melbourne, VIC, Australia. IFIP Advances in Information and Communication Technology, AICT-486, pp.205-210, 2016, Intelligent Information Processing VIII. 〈10.1007/978-3-319-48390-0_21〉
Liste complète des métadonnées

Littérature citée [9 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01614984
Contributeur : Hal Ifip <>
Soumis le : mercredi 11 octobre 2017 - 16:57:32
Dernière modification le : mercredi 11 octobre 2017 - 17:00:33
Document(s) archivé(s) le : vendredi 12 janvier 2018 - 15:48:44

Fichier

 Accès restreint
Fichier visible le : 2019-01-01

Connectez-vous pour demander l'accès au fichier

Licence


Distributed under a Creative Commons Paternité 4.0 International License

Identifiants

Citation

Xiaohong Li, Yun Su, Huifang Ma, Lin Cao. Combining Statistical Information and Semantic Similarity for Short Text Feature Extension. 9th International Conference on Intelligent Information Processing (IIP), Nov 2016, Melbourne, VIC, Australia. IFIP Advances in Information and Communication Technology, AICT-486, pp.205-210, 2016, Intelligent Information Processing VIII. 〈10.1007/978-3-319-48390-0_21〉. 〈hal-01614984〉

Partager

Métriques

Consultations de la notice

28