Implicit differentiation for fast hyperparameter selection in non-smooth convex learning

Finding the optimal hyperparameters of a model can be cast as a bilevel optimization problem, typically solved using zero-order techniques. In this work we study first-order methods when the inner optimization problem is convex but non-smooth. We show that the forward-mode differentiation of proximal gradient descent and proximal coordinate descent yield sequences of Jacobians converging toward the exact Jacobian. Using implicit differentiation, we show it is possible to leverage the non-smoothness of the inner problem to speed up the computation. Finally, we provide a bound on the error made on the hypergradient when the inner optimization problem is solved approximately. Results on regression and classification problems reveal computational benefits for hyperparameter optimization, especially when multiple hyperparameters are required.

Keywords

Convex optimization hyperparameter optimization hyperparameter selec- tion bilevel optimization Lasso generalized linear models

Domains

Statistics [math.ST] Optimization and Control [math.OC]

Fichier principal

journal.pdf (1.41 Mo)

Origin : Files produced by the author(s)

Quentin Bertrand : Connect in order to contact the contributor

https://hal.science/hal-03228663

Submitted on : Tuesday, October 18, 2022-4:36:11 PM

Last modification on : Friday, April 26, 2024-1:43:36 PM

Dates and versions

hal-03228663 , version 1 (18-05-2021)

hal-03228663 , version 2 (18-10-2022)

Identifiers

HAL Id : hal-03228663 , version 2

Cite

Quentin Bertrand, Quentin Klopfenstein, Mathurin Massias, Mathieu Blondel, Samuel Vaiter, et al.. Implicit differentiation for fast hyperparameter selection in non-smooth convex learning. Journal of Machine Learning Research, 2022, 23 (149), pp.1-48. ⟨hal-03228663v2⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA UNIV-BOURGOGNE CNRS INRIA I3M_UMR5149 INSMI IMB_UMR5584 IMAG-MONTPELLIER INRIA2 CEA-UPSAY TDS-MACS UNIV-PARIS-SACLAY UNIV-MONTPELLIER JOLIOT CEA-DRF NEUROSPIN ANR GS-COMPUTER-SCIENCE GS-LIFE-SCIENCES-HEALTH

159 View

183 Download