Inferring inflection classes with description length

Abstract : We discuss the notion of an inflection class system, a traditional ingredient of the description of inflection systems of nontrivial complexity. We distinguish systems of microclasses, which partition a set of lexemes in classes with identical behavior, and systems of macroclasses, which group lexemes that are similar enough in a few larger classes. On the basis of the intuition that macroclasses should contribute to a concise description of the system, we propose one algorithmic method for inferring macroclasses from raw inflectional paradigms, based on minimisation of the description length of the system under a given strategy of identifying morphological alternations in paradigms. We then exhibit classifications produced by our implementation on French and European Portuguese conjugation data and argue that they constitute an appropriate systematisation of traditional classifications. To arrive at such a convincing systematisation, it was crucial for us to use a local approach to inflection class similarity (based on pairwise comparisons of paradigm cells) rather than a global approach (based on the simultaneous comparison of all cells). We conclude that it is indeed possible to infer inflectional macroclasses objectively.
Complete list of metadatas

Cited literature [50 references]  Display  Hide  Download

https://hal.inria.fr/hal-01718879
Contributor : Benoît Sagot <>
Submitted on : Tuesday, February 27, 2018 - 5:36:03 PM
Last modification on : Thursday, February 21, 2019 - 12:52:02 PM
Long-term archiving on : Monday, May 28, 2018 - 1:28:18 PM

File

184-1460-1-PB.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01718879, version 1

Collections

Citation

Sacha Beniamine, Olivier Bonami, Benoît Sagot. Inferring inflection classes with description length. Journal of Language Modelling, Institute of Computer Science, Polish Academy of Sciences, Poland, 2018, 5 (3), pp.465-525. ⟨hal-01718879⟩

Share

Metrics

Record views

343

Files downloads

232