Sparse conditional logistic regression for analyzing large-scale matched data from epidemiological studies: a simple algorithm - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue BMC Bioinformatics Année : 2015

Sparse conditional logistic regression for analyzing large-scale matched data from epidemiological studies: a simple algorithm

Résumé

This paper considers the problem of estimation and variable selection for large high-dimensional data (high number of predictors p and large sample size N, without excluding the possibility that N < p) resulting from an individually matched case-control study. We develop a simple algorithm for the adaptation of the Lasso and related methods to the conditional logistic regression model. Our proposal relies on the simplification of the calculations involved in the likelihood function. Then, the proposed algorithm iteratively solves reweighted Lasso problems using cyclical coordinate descent, computed along a regularization path. This method can handle large problems and deal with sparse features efficiently. We discuss benefits and drawbacks with respect to the existing available implementations. We also illustrate the interest and use of these techniques on a pharmacoepidemiological study of medication use and traffic safety.
Fichier principal
Vignette du fichier
1471-2105-16-S6-S1-1.pdf (4 Ko) Télécharger le fichier
Origine : Publication financée par une institution
Loading...

Dates et versions

hal-01217312 , version 1 (19-10-2015)

Identifiants

Citer

Marta Avalos, Hélène Pouyes, Yves Grandvalet, Ludivine Orriols, Emmanuel Lagarde. Sparse conditional logistic regression for analyzing large-scale matched data from epidemiological studies: a simple algorithm. BMC Bioinformatics, 2015, 16 (Suppl 6), pp.S1. ⟨10.1186/1471-2105-16-S6-S1⟩. ⟨hal-01217312⟩
216 Consultations
341 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More