Building a treebank of noisy user-generated content: The French Social Media Bank

Abstract : We introduce the French Social Media Bank, the first user-generated content treebank for French. Its first release contains 1,700 sentences from various Web 2.0 and social media sources (FACEBOOK, TWITTER, web forums), including data specifically chosen for their high noisiness.
Document type :
Conference papers
Complete list of metadatas

https://hal.inria.fr/hal-00780898
Contributor : Djamé Seddah <>
Submitted on : Friday, January 25, 2013 - 1:48:14 AM
Last modification on : Thursday, August 29, 2019 - 2:24:03 PM

Identifiers

  • HAL Id : hal-00780898, version 1

Citation

Djamé Seddah, Benoît Sagot, Marie Candito, Virginie Mouilleron, Vanessa Combet. Building a treebank of noisy user-generated content: The French Social Media Bank. TLT 11 - The 11th International Workshop on Treebanks and Linguistic Theories, Nov 2012, Lisbonne, Portugal. ⟨hal-00780898⟩

Share

Metrics

Record views

319