Skip to Main content Skip to Navigation
New interface
Journal articles

Inferring inflection classes with description length

Abstract : We discuss the notion of an inflection class system, a traditional ingredient of the description of inflection systems of nontrivial complexity. We distinguish systems of microclasses, which partition a set of lexemes in classes with identical behavior, and systems of macroclasses, which group lexemes that are similar enough in a few larger classes. On the basis of the intuition that macroclasses should contribute to a concise description of the system, we propose one algorithmic method for inferring macroclasses from raw inflectional paradigms, based on minimisation of the description length of the system under a given strategy of identifying morphological alternations in paradigms. We then exhibit classifications produced by our implementation on French and European Portuguese conjugation data and argue that they constitute an appropriate systematisation of traditional classifications. To arrive at such a convincing systematisation, it was crucial for us to use a local approach to inflection class similarity (based on pairwise comparisons of paradigm cells) rather than a global approach (based on the simultaneous comparison of all cells). We conclude that it is indeed possible to infer inflectional macroclasses objectively.
Complete list of metadata

Cited literature [50 references]  Display  Hide  Download
Contributor : Benoît Sagot Connect in order to contact the contributor
Submitted on : Tuesday, February 27, 2018 - 5:36:03 PM
Last modification on : Wednesday, June 8, 2022 - 12:50:06 PM
Long-term archiving on: : Monday, May 28, 2018 - 1:28:18 PM


Files produced by the author(s)


  • HAL Id : hal-01718879, version 1


Sacha Beniamine, Olivier Bonami, Benoît Sagot. Inferring inflection classes with description length. Journal of Language Modelling, 2018, 5 (3), pp.465-525. ⟨hal-01718879⟩



Record views


Files downloads