HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Journal articles

Resampling methods for parameter-free and robust feature selection with mutual information

Abstract : Combining the mutual information criterion with a forward feature selection strategy offers a good trade-off between optimality of the selected feature subset and computation time. However, it requires to set the parameter(s) of the mutual information estimator and to determine when to halt the forward procedure. These two choices are difficult to make because, as the dimensionality of the subset increases, the estimation of the mutual information becomes less and less reliable. This paper proposes to use resampling methods, a K-fold cross-validation and the permutation test, to address both issues. The resampling methods bring information about the variance of the estimator, information which can then be used to automatically set the parameter and to calculate a threshold to stop the forward procedure. The procedure is illustrated on a synthetic dataset as well as on real-world examples.
Complete list of metadata

Cited literature [34 references]  Display  Hide  Download

Contributor : Fabrice Rossi Connect in order to contact the contributor
Submitted on : Sunday, September 23, 2007 - 3:35:36 PM
Last modification on : Thursday, February 3, 2022 - 11:16:44 AM
Long-term archiving on: : Thursday, April 8, 2010 - 8:52:39 PM


Files produced by the author(s)




Damien François, Fabrice Rossi, Vincent Wertz, Michel Verleysen. Resampling methods for parameter-free and robust feature selection with mutual information. Neurocomputing, Elsevier, 2007, 70 (7-9), pp.1276-1288. ⟨10.1016/j.neucom.2006.11.019⟩. ⟨inria-00174298⟩



Record views


Files downloads