Multi-Category Support Vector Machines for Identifying Arabic Topics - Inria - Institut national de recherche en sciences et technologies du numérique Access content directly
Conference Papers Year : 2009

Multi-Category Support Vector Machines for Identifying Arabic Topics

Abstract

It is known that Support Vector Machines were designed for binary classification. Nevertless, it would be fruitful to extend this operation to what is called Multi-category classification. That is why, Multi-category Support Nector Machines (MSVM) become nowadays the current subject of several serious researches, aiming to achieve high levels of multi-category classification tasks. This technique has been assessed recently recently in some fields as text categorization, Cancer classification, etc. We should notify that experiments which have been realized until now using MSVM are limited to small data sets, since its computation is more expensive. In this paper, we are interested in the use of this method, for the first time in topic identification. The experiments conducted concern topic identification of Arabic language. The corpora are extracted from ALWATAN newspaper. Achieved results lead to an improvement of MSVM performance i comparison to the baseline SVM method. Nevertheless, SVM still outperforms MSVM when using larger sizes of the vocabulary.
Fichier principal
Vignette du fichier
CiclingpaperAbbasSmaili.pdf (1.58 Mo) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

inria-00403102 , version 1 (09-07-2009)

Identifiers

  • HAL Id : inria-00403102 , version 1

Cite

Mourad Abbas, Kamel Smaïli, Daoud Berkani. Multi-Category Support Vector Machines for Identifying Arabic Topics. 10th International Conference on Intelligent Text Processing and Computational Linguistics - CICLing 2009, Mar 2009, Mexico, Mexico. ⟨inria-00403102⟩
169 View
65 Download

Share

Gmail Facebook X LinkedIn More