Detecting Outliers in High-Dimensional Neuroimaging Datasets with Robust Covariance Estimators

Abstract : Medical imaging datasets often contain deviant observations, the so-called outliers, due to acquisition or preprocessing artifacts or resulting from large intrinsic inter-subject variability. These can undermine the statistical procedures used in group studies as the latter assume that the cohorts are composed of homogeneous samples with anatomical or functional features clustered around a central mode. The effects of outlying subjects can be mitigated by detecting and removing them with explicit statistical control. With the emergence of large medical imaging databases, exhaustive data screening is no longer possible, and automated outlier detection methods are currently gaining interest. The datasets used in medical imaging are often high-dimensional and strongly correlated. The outlier detection procedure should therefore rely on high-dimensional statistical multivariate models. However, state-of-the-art procedures are not well-suited for such high-dimensional settings. In this work, we introduce regularization in the MCD framework and investigate different regularization schemes. We carry out extensive simulations to provide backing for practical choices in absence of ground truth knowledge. We demonstrate on functional neuroimaging datasets that outlier detection can be performed with small sample sizes and improves group studies.
Complete list of metadatas

Cited literature [38 references]  Display  Hide  Download
Contributor : Virgile Fritsch <>
Submitted on : Thursday, May 24, 2012 - 7:35:30 PM
Last modification on : Thursday, March 7, 2019 - 3:34:14 PM
Long-term archiving on : Saturday, August 25, 2012 - 2:45:24 AM


Files produced by the author(s)




Virgile Fritsch, Gaël Varoquaux, Benjamin Thyreau, Jean-Baptiste Poline, Bertrand Thirion. Detecting Outliers in High-Dimensional Neuroimaging Datasets with Robust Covariance Estimators. Medical Image Analysis, Elsevier, 2012, 16, pp.1359-1370. ⟨10.1016/⟩. ⟨hal-00701225⟩



Record views


Files downloads