Skip to Main content Skip to Navigation


Abstract : We present CROC (Coreference Resolution for Oral Corpus), the first machine learning system for coreference resolution in French. One specific aspect of the system is that it has been trained on data that are exclusively oral, namely ANCOR (ANaphora and Coreference in ORal corpus), the first corpus in oral French with anaphorical relations annotations. In its current state, the CROC system requires pre-annotated mentions. We detail the features that we chose to be used by the learning algorithms, and we present a set of experiments with these features. The scores we obtain are close to those of state-of-the-art systems for written English. Then we give future works on the design of an end-to-end system for oral and written French.
Complete list of metadata


Present sur SoftwareHeritage
Contributor : Clément Plancq <>
Submitted on : Thursday, July 12, 2018 - 10:20:59 AM
Last modification on : Thursday, July 1, 2021 - 5:46:02 PM




Record views


Files downloads