Structured Penalties for Log-linear Language Models

Anil Nelakanti; Cédric Archambeau; Julien Mairal; Francis Bach; Guillaume Bouchard

Conference Papers Year : 2013

Structured Penalties for Log-linear Language Models

(1) , (1) , (2) , (3, 4) , (1)

1
2
3
4

Anil Nelakanti

Function : Author

Xerox Research Centre Europe [Meylan]

Cédric Archambeau

Function : Author

Xerox Research Centre Europe [Meylan]

Julien Mairal

Function : Author
PersonId : 1034832
ORCID : 0000-0001-6991-2110
IdRef : 152125256

Learning and recognition in vision

Francis Bach

Function : Author
PersonId : 863086

Laboratoire d'informatique de l'école normale supérieure

Statistical Machine Learning and Parsimony

Guillaume Bouchard

Function : Correspondent author
PersonId : 948434

Connectez-vous pour contacter l'auteur

Xerox Research Centre Europe [Meylan]

Abstract

Language models can be formalized as loglinear regression models where the input features represent previously observed contexts up to a certain length m. The complexity of existing algorithms to learn the parameters by maximum likelihood scale linearly in nd, where n is the length of the training corpus and d is the number of observed features. We present a model that grows logarithmically in d, making it possible to efficiently leverage longer contexts. We account for the sequential structure of natural language using treestructured penalized objectives to avoid overfitting and achieve better generalization.

Domains

Computer Vision and Pattern Recognition [cs.CV]

Fichier principal

anil_emnlp.pdf (195.54 Ko)

Origin : Files produced by the author(s)

Julien Mairal : Connect in order to contact the contributor

https://inria.hal.science/hal-00904820

Submitted on : Friday, November 15, 2013-12:00:56 PM

Last modification on : Thursday, April 4, 2024-6:22:05 PM

Long-term archiving on: Sunday, February 16, 2014-4:31:00 AM

Dates and versions

hal-00904820 , version 1 (15-11-2013)

Identifiers

HAL Id : hal-00904820 , version 1

Cite

Anil Nelakanti, Cédric Archambeau, Julien Mairal, Francis Bach, Guillaume Bouchard. Structured Penalties for Log-linear Language Models. EMNLP - Empirical Methods in Natural Language Processing, Oct 2013, Seattle, United States. pp.233-243. ⟨hal-00904820⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS UNIV-RENNES1 UGA CNRS INRIA IRISA LJK LJK_GI LJK_GI_LEAR INRIA2 PSL UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

488 View

185 Download

Structured Penalties for Log-linear Language Models

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Share