The Logoscope: a Semi-Automatic Tool for Detecting and Documenting French New Words From the Linguistic Project to the Web Interface

Abstract : In this article we present the design and implementation of the Logoscope, the first tool especially developed to detect new words of the French language, to document them and allow a public access through a web interface. This semi-automatic tool collects new words daily by browsing the online versions of French well known newspapers such as Le Monde, Le Figaro, L'Équipe, Libération, La Croix, Les Échos. In contrast to other existing tools essentially dedicated to dictionary development, the Logoscope attempts to give a more complete account of the context in which the new words occur. In addition to the commonly given morpho-syntactic information it also provides information about the textual and discursive contexts of the word creation; in particular, it automatically determines the (journalistic) topics of the text containing the new word. In this article we first give a general overview of the developed tool. We then describe the approach taken, we discuss the linguistic background which guided our design decisions and present the computational methods we used to implement it.
Complete list of metadatas

Cited literature [52 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01896796
Contributor : Ingrid Falk <>
Submitted on : Tuesday, October 16, 2018 - 2:49:26 PM
Last modification on : Thursday, February 7, 2019 - 4:53:31 PM
Long-term archiving on : Thursday, January 17, 2019 - 2:27:26 PM

File

logo_manuscript.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01896796, version 1

Collections

Citation

Ingrid Falk, Delphine Bernhard, Christophe Gérard. The Logoscope: a Semi-Automatic Tool for Detecting and Documenting French New Words From the Linguistic Project to the Web Interface. [Research Report] Université Strasbourg. 2018. ⟨hal-01896796⟩

Share

Metrics

Record views

57

Files downloads

67