Building a treebank of noisy user-generated content: The French Social Media Bank - Inria - Institut national de recherche en sciences et technologies du numérique Access content directly
Conference Papers Year : 2012

Building a treebank of noisy user-generated content: The French Social Media Bank

Abstract

We introduce the French Social Media Bank, the first user-generated content treebank for French. Its first release contains 1,700 sentences from various Web 2.0 and social media sources (FACEBOOK, TWITTER, web forums), including data specifically chosen for their high noisiness.
No file

Dates and versions

hal-00780898 , version 1 (25-01-2013)

Identifiers

  • HAL Id : hal-00780898 , version 1

Cite

Djamé Seddah, Benoît Sagot, Marie Candito, Virginie Mouilleron, Vanessa Combet. Building a treebank of noisy user-generated content: The French Social Media Bank. TLT 11 - The 11th International Workshop on Treebanks and Linguistic Theories, Nov 2012, Lisbonne, Portugal. ⟨hal-00780898⟩
226 View
0 Download

Share

Gmail Facebook X LinkedIn More