hal-00642998, version 1
Algorithms for Hyper-Parameter Optimization
James Bergstra 1R. Bardenet 2, 3Yoshua Bengio 4Balázs Kégl
2, 3, 5
25th Annual Conference on Neural Information Processing Systems (NIPS 2011) (2011)
Résumé : Several recent advances to the state of the art in image classification benchmarks have come from better configurations of existing techniques rather than novel ap- proaches to feature learning. Traditionally, hyper-parameter optimization has been the job of humans because they can be very efficient in regimes where only a few trials are possible. Presently, computer clusters and GPU processors make it pos- sible to run more trials and we show that algorithmic approaches can find better results. We present hyper-parameter optimization results on tasks of training neu- ral networks and deep belief networks (DBNs). We optimize hyper-parameters using random search and two new greedy sequential methods based on the ex- pected improvement criterion. Random search has been shown to be sufficiently efficient for learning neural networks for several datasets, but we show it is unreli- able for training DBNs. The sequential algorithms are applied to the most difficult DBN learning problems from [1] and find significantly better results than the best previously reported. This work contributes novel techniques for making response surface models P(y|x) in which many elements of hyper-parameter assignment (x) are known to be irrelevant given particular values of other elements.
- 1 : The Rowland Institute
- Harvard university (Cambridge, USA)
- 2 : Laboratoire de Recherche en Informatique (LRI)
- CNRS : UMR8623 – Université Paris XI - Paris Sud
- 3 : TAO (INRIA Saclay - Ile de France)
- INRIA – CNRS : UMR8623 – Université Paris XI - Paris Sud
- 4 : Département d'Informatique et de Recherche Opérationnelle [Montreal] (DIRO)
- Université de Montréal
- 5 : Laboratoire de l'Accélérateur Linéaire (LAL)
- CNRS : UMR8607 – IN2P3 – Université Paris XI - Paris Sud
- Domaine : Informatique/Apprentissage
- Référence interne : LAL 11-308
- hal-00642998, version 1
- http://hal.inria.fr/hal-00642998
- oai:hal.inria.fr:hal-00642998
- Contributeur : Balázs Kégl
- Soumis le : Dimanche 20 Novembre 2011, 22:39:52
- Dernière modification le : Mardi 30 Octobre 2012, 17:06:04






Documents associés
Exporter