A new non-convex framework to improve asymptotical knowledge on generic stochastic gradient descent

Jean-Baptiste Fest; Audrey Repetti; Emilie Chouzenoux

Conference Papers Year : 2023

A new non-convex framework to improve asymptotical knowledge on generic stochastic gradient descent

(1, 2) , (3) , (1)

1
2
3

Jean-Baptiste Fest

Function : Author

OPtimisation Imagerie et Santé

Centre de vision numérique

Audrey Repetti

Function : Author

Heriot-Watt University [Edinburgh]

Emilie Chouzenoux

Function : Author
PersonId : 10209
IdHAL : emilie-chouzenoux
ORCID : 0000-0003-3631-6093
IdRef : 192528572

OPtimisation Imagerie et Santé

Abstract

Stochastic gradient optimization methods are broadly used to minimize non-convex smooth objective functions, for instance when training deep neural networks. However, theoretical guarantees on the asymptotic behaviour of these methods remain scarce. Especially, ensuring almost-sure convergence of the iterates to a stationary point is quite challenging. In this work, we introduce a new Kurdyka-Łojasiewicz theoretical framework to analyze asymptotic behavior of stochastic gradient descent (SGD) schemes when minimizing non-convex smooth objectives. In particular, our framework provides new almost-sure convergence results, on iterates generated by any SGD method satisfying mild conditional descent conditions. We illustrate the proposed framework by means of several toy simulation examples. We illustrate the role of the considered theoretical assumptions, and investigate how SGD iterates are impacted whether these assumptions are either fully or partially satisfied.

Keywords

Stochastic gradient descent non-convex optimization Kurdyka-Lojasiewicz convergence analysis

Domains

Optimization and Control [math.OC]

Fichier principal

MLSP_2023.pdf (327.71 Ko)

Origin : Files produced by the author(s)

Emilie Chouzenoux : Connect in order to contact the contributor

https://inria.hal.science/hal-04165342

Submitted on : Tuesday, July 18, 2023-7:27:26 PM

Last modification on : Monday, April 22, 2024-2:30:13 PM

Dates and versions

hal-04165342 , version 1 (18-07-2023)

Licence

Attribution

Identifiers

HAL Id : hal-04165342 , version 1

Cite

Jean-Baptiste Fest, Audrey Repetti, Emilie Chouzenoux. A new non-convex framework to improve asymptotical knowledge on generic stochastic gradient descent. MLSP 2023 - IEEE International Workshop on Machine Learning for Signal Processing, Sep 2023, Rome, Italy. ⟨hal-04165342⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA CVN CENTRALESUPELEC INRIA2 TDS-MACS UNIV-PARIS-SACLAY GS-COMPUTER-SCIENCE HUB-IA

15 View

29 Download

A new non-convex framework to improve asymptotical knowledge on generic stochastic gradient descent

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Share