Skip to Main content Skip to Navigation
Conference papers

Conquering Language: Using NLP on a Massive Scale to Build High Dimensional Language Models from the Web

Gregory Grefenstette 1, *
* Corresponding author
Abstract : Dictionaries only contain some of the information we need to know about a language. The growth of the Web, the maturation of linguistic process-ing tools, and the decline in price of memory storage allow us to envision de-scriptions of languages that are much larger than before. We can conceive of building a complete language model for a language using all the text that is found on the Web for this language. This article describes our current project to do just that.
Document type :
Conference papers
Complete list of metadata

Cited literature [24 references]  Display  Hide  Download

https://hal.inria.fr/hal-01081036
Contributor : Gregory Grefenstette <>
Submitted on : Thursday, November 6, 2014 - 5:18:13 PM
Last modification on : Thursday, February 9, 2017 - 3:47:19 PM
Long-term archiving on: : Saturday, February 7, 2015 - 11:20:11 AM

File

GrefenstettefinalCICLING.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Gregory Grefenstette. Conquering Language: Using NLP on a Massive Scale to Build High Dimensional Language Models from the Web. CICLing, Feb 2007, Mexico, Mexico. pp.35 - 49, ⟨10.1007/978-3-540-70939-8_4⟩. ⟨hal-01081036⟩

Share

Metrics

Record views

133

Files downloads

301