HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

SEARNN: Training RNNs with global-local losses

Abstract : We propose SEARNN, a novel training algorithm for recurrent neural networks (RNNs) inspired by the " learning to search " (L2S) approach to structured prediction. RNNs have been widely successful in structured prediction applications such as machine translation or parsing, and are commonly trained using maximum likelihood estimation (MLE). Unfortunately, this training loss is not always an appropriate surrogate for the test error: by only maximizing the ground truth probability, it fails to exploit the wealth of information offered by structured losses. Further, it introduces discrepancies between training and predicting (such as exposure bias) that may hurt test performance. Instead, SEARNN leverages test-alike search space exploration to introduce global-local losses that are closer to the test error. We demonstrate improved performance over MLE on three different tasks: OCR, spelling correction and text chunking. Finally, we propose a subsampling strategy to enable SEARNN to scale to large vocabulary sizes.
Document type :
Preprints, Working Papers, ...
Complete list of metadata

Cited literature [26 references]  Display  Hide  Download

https://hal.inria.fr/hal-01665263
Contributor : Rémi Leblond Connect in order to contact the contributor
Submitted on : Friday, December 22, 2017 - 1:39:55 PM
Last modification on : Thursday, March 17, 2022 - 10:08:53 AM

Links full text

Identifiers

  • HAL Id : hal-01665263, version 1
  • ARXIV : 1706.04499

Collections

Citation

Rémi Leblond, Jean-Baptiste Alayrac, Anton Osokin, Simon Lacoste-Julien. SEARNN: Training RNNs with global-local losses. 2017. ⟨hal-01665263⟩

Share

Metrics

Record views

309