Elimination Method Study of Ambiguous Words in Chinese Automatic Indexing - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

Elimination Method Study of Ambiguous Words in Chinese Automatic Indexing

Résumé

Faced with huge amounts of information to realize the accurate retrieval under the network environment, the first step is indexing words cannot appear ambiguity word. Because Chinese’s the basic unit is Chinese characters, Chinese characters form words, Word is divided into monosyllabic word and compound word, and there’s no space between Chinese keywords and there are a lot of ambiguous concept. Therefore a lot of ambiguity in the indexing process will be produced. The result detected information of irrelevant or mistakenly identified. The paper focuses on a method to eliminating the crossed meanings ambiguous words in the automatic indexing. The paper puts forward a method to eliminating ambiguous words combined algorithm of exhaustive method and disambiguation rules. Experiments show that it can avoid a great lot segmenting ambiguities with better segmenting results.
Fichier principal
Vignette du fichier
978-3-642-54341-8_9_Chapter.pdf (4 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01220817 , version 1 (27-10-2015)

Licence

Paternité

Identifiants

Citer

Wang Dan, Yang Xiaorong, Zhang Jie. Elimination Method Study of Ambiguous Words in Chinese Automatic Indexing. 7th International Conference on Computer and Computing Technologies in Agriculture (CCTA), Sep 2013, Beijing, China. pp.79-88, ⟨10.1007/978-3-642-54341-8_9⟩. ⟨hal-01220817⟩
84 Consultations
97 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More