Heat Map based Feature Ranker: In Depth Comparison with Popular Methods

Abstract : The new era of technology allows us to gather more data than ever before, complex data emerge and a lot of noise can be found among high dimensional datasets. In order to discard useless features and help build more generalized models, feature selection seeks a reduced subset of features that improve the performance of the learning algorithm. The evaluation of features and their interactions are an expensive process, hence the need for heuristics. In this work, we present HeatMap Based Feature Ranker, an algorithm to estimate feature importance purely based on its interaction with other variables. A compression mechanism reduces evaluation space up to 66% without compromising efficacy. Our experiments show that our proposal is very competitive against popular algorithms, producing stable results across different types of data. We also show how noise reduction through feature selection aids data visualization using emergent self-organizing maps.
Document type :
Journal articles
Complete list of metadatas

Cited literature [39 references]  Display  Hide  Download

https://hal.inria.fr/hal-01848544
Contributor : Christian Raymond <>
Submitted on : Tuesday, July 24, 2018 - 5:51:52 PM
Last modification on : Friday, September 13, 2019 - 9:48:07 AM
Long-term archiving on : Thursday, October 25, 2018 - 4:34:33 PM

File

IDA218.pdf
Files produced by the author(s)

Identifiers

Citation

Carlos Huertas, Reyes Juárez-Ramírez, Christian Raymond. Heat Map based Feature Ranker: In Depth Comparison with Popular Methods. Intelligent Data Analysis, IOS Press, In press, 22 (5), pp.1009-1037. ⟨10.3233/IDA-173481⟩. ⟨hal-01848544⟩

Share

Metrics

Record views

163

Files downloads

375