Empirical Evaluation of the Impact of Data Pre-Processing on the Performance of Predictive SHM of Jet Engines

Abstract : We evaluate the impact of data pre-processing on the performance of predictive Structural Health Monitoring algorithm on a real case study involving dozens of jet engines. A simple robust four-step framework is designed to this effect, made of 1) outliers removal, 2) range scaling, 3) variable selection (either by Òmanuallyî evaluating variable correlations or by quantification of variable importance via random forests) and 4) evaluation of the predictive performance of a unique selected binary classifier (random forests). The results contrast with the intuition and the literature, since pre-processing raw data decreases predictive performance in half of the cases analyzed. The isolated influence of each of the pre-processing techniques rank in this order: important variables chosen through random forests has the highest positive impact, followed closely by variable scaling and outlier removal to a lower extent, while the Òmanualî variable selection via the correlation matrix exerts a slightly negative impact on predictive performance. The influence of combining pre-processing techniques is in line with the isolated influence of each technique. However, a detailed evaluation should be done for every application since these results might be due to the high data quality of aerospace engines or to the characteristics of random forests.
Type de document :
Communication dans un congrès
Le Cam, Vincent and Mevel, Laurent and Schoefs, Franck. EWSHM - 7th European Workshop on Structural Health Monitoring, Jul 2014, Nantes, France. 2014
Liste complète des métadonnées

Littérature citée [12 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01020463
Contributeur : Anne Jaigu <>
Soumis le : mardi 8 juillet 2014 - 10:14:31
Dernière modification le : mardi 8 juillet 2014 - 13:55:51
Document(s) archivé(s) le : mercredi 8 octobre 2014 - 12:16:06

Fichier

0344.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01020463, version 1

Collections

Citation

Jean-Loup Loyer. Empirical Evaluation of the Impact of Data Pre-Processing on the Performance of Predictive SHM of Jet Engines. Le Cam, Vincent and Mevel, Laurent and Schoefs, Franck. EWSHM - 7th European Workshop on Structural Health Monitoring, Jul 2014, Nantes, France. 2014. 〈hal-01020463〉

Partager

Métriques

Consultations de la notice

119

Téléchargements de fichiers

172