Abstract : This paper aims at presenting some preliminary results for data driven lemmatization for Italian. Besides intrinsic evaluation for this task, we want to measure its usefulness and adequacy by using our system as input for the task of parsing, following a methodology developed on French. This approach achieves state-of-the-art parsing accuracy without requiring any prior knowledge of the language.
https://hal.inria.fr/hal-00702618
Contributor : Djamé Seddah <>
Submitted on : Wednesday, May 30, 2012 - 6:28:57 PM Last modification on : Saturday, March 28, 2020 - 2:22:29 AM
Djamé Seddah, Joseph Le Roux, Benoît Sagot. Data Driven Lemmatization for Statistical Constituent Parsing of Italian. Proceedings of EVALITA 2011, Jan 2012, Roma, Italy, Italy. ⟨hal-00702618⟩