Dealing with highly imbalanced textual data gathered into similar classes

Jean-Charles Lamirel

doi:10.1109/IJCNN.2013.6707044

Communication Dans Un Congrès Année : 2013

Dealing with highly imbalanced textual data gathered into similar classes

(1)

Jean-Charles Lamirel

Fonction : Auteur
PersonId : 8202
IdHAL : jean-charles-lamirel

Natural Language Processing : representations, inference and semantics

Résumé

This paper deals with a new feature selection and feature contrasting approach for classification of highly imbalanced textual data with a high degree of similarity between associated classes. An example of such classification context is illustrated by the task of classifying bibliographic references into a patent classification scheme. This task represents one of the domains of investigation of the QUAERO project, with the final goal of helping experts to evaluate upcoming patents through the use of related research.

Domaines

Réseau de neurones [cs.NE]

Jean-Charles Lamirel : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00939036

Soumis le : jeudi 30 janvier 2014-07:15:11

Dernière modification le : lundi 11 septembre 2023-17:41:18

Dates et versions

hal-00939036 , version 1 (30-01-2014)

Identifiants

HAL Id : hal-00939036 , version 1
DOI : 10.1109/IJCNN.2013.6707044

Citer

Jean-Charles Lamirel. Dealing with highly imbalanced textual data gathered into similar classes. IJCNN - 2013 International Joint Conference on Neural Networks, Aug 2013, Dallas, United States. ⟨10.1109/IJCNN.2013.6707044⟩. ⟨hal-00939036⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE LORIA LORIA-NLPKD

138 Consultations

0 Téléchargements

Dealing with highly imbalanced textual data gathered into similar classes

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager