Sequential approaches for learning datum-wise sparse representations

Gabriel Dulac-Arnold; Ludovic Denoyer; Philippe Preux; Patrick Gallinari

doi:10.1007/s10994-012-5306-7

Article Dans Une Revue Machine Learning Année : 2012

Sequential approaches for learning datum-wise sparse representations

(1) , (1) , (2) , (1)

1
2

Gabriel Dulac-Arnold

Fonction : Auteur
PersonId : 905215

Machine Learning and Information Retrieval

Ludovic Denoyer

Fonction : Auteur
PersonId : 9178
IdHAL : ludovic-denoyer
ORCID : 0000-0002-7348-788X
IdRef : 089291255

Machine Learning and Information Retrieval

Philippe Preux

Fonction : Auteur
PersonId : 5488
IdHAL : preux-philippe
IdRef : 059896353

Sequential Learning

Patrick Gallinari

Fonction : Auteur
PersonId : 751615
IdHAL : patrick-gallinari
ORCID : 0000-0001-9060-9001
IdRef : 070709076

Machine Learning and Information Retrieval

Résumé

In supervised classification, data representation is usually considered at the dataset level: one looks for the "best" representation of data assuming it to be the same for all the data in the data space. We propose a different approach where the representations used for classification are tailored to each datum in the data space. One immediate goal is to obtain sparse datum-wise representations: our approach learns to build a representation specific to each datum that contains only a small subset of the features, thus allowing classification to be fast and efficient. This representation is obtained by way of a sequential decision process that sequentially chooses which features to acquire before classifying a particular point; this process is learned through algorithms based on Reinforcement Learning. The proposed method performs well on an ensemble of medium-sized sparse classification problems. It offers an alternative to global sparsity approaches, and is a natural framework for sequential classification problems. The method extends easily to a whole family of sparsity-related problem which would otherwise require developing specific solutions. This is the case in particular for cost-sensitive and limited-budget classification, where feature acquisition is costly and is often performed sequentially. Finally, our approach can handle non-differentiable loss functions or combinatorial optimization encountered in more complex feature selection problems.

Mots clés

supervised classification reinforcement learning data representation

Domaines

Apprentissage [cs.LG]

Fichier principal

versionPublieeMLJ.pdf (2.88 Mo)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Preux Philippe : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00747724

Soumis le : jeudi 8 novembre 2012-15:25:43

Dernière modification le : mardi 11 avril 2023-15:16:28

Archivage à long terme le : samedi 9 février 2013-03:41:36

Dates et versions

hal-00747724 , version 1 (08-11-2012)

Identifiants

HAL Id : hal-00747724 , version 1
DOI : 10.1007/s10994-012-5306-7

Citer

Gabriel Dulac-Arnold, Ludovic Denoyer, Philippe Preux, Patrick Gallinari. Sequential approaches for learning datum-wise sparse representations. Machine Learning, 2012, 89 (1-2), pp.87-122. ⟨10.1007/s10994-012-5306-7⟩. ⟨hal-00747724⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC UNIV-LILLE3 CNRS INRIA LAGIS LIP6 INRIA2 SORBONNE-UNIVERSITE SU-SCIENCES

453 Consultations

350 Téléchargements

Sequential approaches for learning datum-wise sparse representations

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager