A sliced inverse regression approach for data stream - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2012

A sliced inverse regression approach for data stream

Résumé

In this article, we focus on data arriving sequentially by block in a stream. A semiparametric regression model involving a common EDR (Effective Dimension Reduction) direction is assumed in each block. Our goal is to estimate this direction at each arrival of a new block. A simple direct approach consists in pooling all the observed blocks and estimate the EDR direction by the SIR (Sliced Inverse Regression) method. But some disadvantages appear in practice such as the storage of the blocks and the running time for high dimensional data. To overcome these drawbacks, we propose an adaptive SIR estimator of based on the SIR approach for a stratified population developed by Chavent et al. (2011). The proposed approach is faster both from computational complexity and running time points of view, and provides data storage benefits. We show the consistency of our estimator at the root-n rate and give its asymptotic distribution. We propose an extension to multiple indices model. We also provide a graphical tool in order to detect if a drift occurs in the EDR direction or if some aberrant blocks appear in the data stream. In a simulation study, we illustrate the good numerical behavior of our estimator. One important advantage of this approach is its adaptability to changes in the underlying model. Finally we apply it on real data concerning the estimation of Mars surface physical properties.
Fichier principal
Vignette du fichier
article-SIRdatastream-soumis-fevrier2012.pdf (1.78 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00688609 , version 1 (18-04-2012)
hal-00688609 , version 2 (02-10-2012)

Identifiants

  • HAL Id : hal-00688609 , version 1

Citer

Marie Chavent, Stéphane Girard, Vanessa Kuentz, Benoit Liquet, Thi Mong Ngoc Nguyen, et al.. A sliced inverse regression approach for data stream. 2012. ⟨hal-00688609v1⟩
361 Consultations
185 Téléchargements

Partager

Gmail Facebook X LinkedIn More