Dealing with highly imbalanced textual data gathered into similar classes - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Dealing with highly imbalanced textual data gathered into similar classes

Résumé

This paper deals with a new feature selection and feature contrasting approach for classification of highly imbalanced textual data with a high degree of similarity between associated classes. An example of such classification context is illustrated by the task of classifying bibliographic references into a patent classification scheme. This task represents one of the domains of investigation of the QUAERO project, with the final goal of helping experts to evaluate upcoming patents through the use of related research.
Fichier non déposé

Dates et versions

hal-00939036 , version 1 (30-01-2014)

Identifiants

Citer

Jean-Charles Lamirel. Dealing with highly imbalanced textual data gathered into similar classes. IJCNN - 2013 International Joint Conference on Neural Networks, Aug 2013, Dallas, United States. ⟨10.1109/IJCNN.2013.6707044⟩. ⟨hal-00939036⟩
138 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More