Skip to Main content Skip to Navigation
New interface
Conference papers

Dealing with highly imbalanced textual data gathered into similar classes

Jean-Charles Lamirel 1 
1 SYNALP - Natural Language Processing : representations, inference and semantics
LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper deals with a new feature selection and feature contrasting approach for classification of highly imbalanced textual data with a high degree of similarity between associated classes. An example of such classification context is illustrated by the task of classifying bibliographic references into a patent classification scheme. This task represents one of the domains of investigation of the QUAERO project, with the final goal of helping experts to evaluate upcoming patents through the use of related research.
Document type :
Conference papers
Complete list of metadata
Contributor : Jean-Charles Lamirel Connect in order to contact the contributor
Submitted on : Thursday, January 30, 2014 - 7:15:11 AM
Last modification on : Saturday, October 16, 2021 - 11:26:06 AM




Jean-Charles Lamirel. Dealing with highly imbalanced textual data gathered into similar classes. IJCNN - 2013 International Joint Conference on Neural Networks, Aug 2013, Dallas, United States. ⟨10.1109/IJCNN.2013.6707044⟩. ⟨hal-00939036⟩



Record views