Conquering Language: Using NLP on a Massive Scale to Build High Dimensional Language Models from the Web - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2007

Conquering Language: Using NLP on a Massive Scale to Build High Dimensional Language Models from the Web

Résumé

Dictionaries only contain some of the information we need to know about a language. The growth of the Web, the maturation of linguistic process-ing tools, and the decline in price of memory storage allow us to envision de-scriptions of languages that are much larger than before. We can conceive of building a complete language model for a language using all the text that is found on the Web for this language. This article describes our current project to do just that.
Fichier principal
Vignette du fichier
GrefenstettefinalCICLING.pdf (79.56 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01081036 , version 1 (06-11-2014)

Identifiants

Citer

Gregory Grefenstette. Conquering Language: Using NLP on a Massive Scale to Build High Dimensional Language Models from the Web. CICLing, Feb 2007, Mexico, Mexico. pp.35 - 49, ⟨10.1007/978-3-540-70939-8_4⟩. ⟨hal-01081036⟩

Collections

INRIA INRIA2
53 Consultations
150 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More