Fast DNN training based on auxiliary function technique

Dung Tran; Nobutaka Ono; Emmanuel M. Vincent

Communication Dans Un Congrès Année : 2015

Fast DNN training based on auxiliary function technique

(1) , (2) , (1)

1
2

Dung Tran

Fonction : Auteur
PersonId : 953494

Analysis, perception and recognition of speech

Nobutaka Ono

Fonction : Auteur

National Institute of Informatics

Emmanuel M. Vincent

Fonction : Auteur
PersonId : 1256
IdHAL : emmanuelv
ORCID : 0000-0002-0183-7289
IdRef : 089360176

Analysis, perception and recognition of speech

Résumé

Deep neural networks (DNN) are typically optimized with stochastic gradient descent (SGD) using a fixed learning rate or an adaptive learning rate approach (ADAGRAD). In this paper, we introduce a new learning rule for neural networks that is based on an auxiliary function technique without parameter tuning. Instead of minimizing the objective function, a quadratic auxiliary function is recursively introduced layer by layer which has a closed-form optimum. We prove the monotonic decrease of the new learning rule. Our experiments show that the proposed algorithm converges faster and to a better local minimum than SGD. In addition, we propose a combination of the proposed learning rule and ADAGRAD which further accelerates convergence. Experimental evaluation on the MNIST database shows the benefit of the proposed approach in terms of digit recognition accuracy.

Mots clés

adaptivelearningrate auxiliaryfunctiontechnique gradientdescent back-propagation IndexTerms—DNN

Domaines

Son [cs.SD]

Fichier principal

Dung2015ICASSP_v5_final.pdf (142.55 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Dung Tran : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01107809

Soumis le : jeudi 29 janvier 2015-09:13:05

Dernière modification le : lundi 11 septembre 2023-17:41:19

Archivage à long terme le : samedi 12 septembre 2015-06:40:11

Dates et versions

hal-01107809 , version 1 (21-01-2015)

hal-01107809 , version 2 (29-01-2015)

hal-01107809 , version 3 (30-01-2015)

hal-01107809 , version 4 (11-02-2015)

Identifiants

HAL Id : hal-01107809 , version 2

Citer

Dung Tran, Nobutaka Ono, Emmanuel M. Vincent. Fast DNN training based on auxiliary function technique. 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2015, Apr 2015, Brisbane, Queensland, Australia. ⟨hal-01107809v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

321 Consultations

677 Téléchargements

Fast DNN training based on auxiliary function technique

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager