Dependency Distances and Their Frequencies in Indo-European Language - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Journal of Quantitative Linguistics Année : 2020

Dependency Distances and Their Frequencies in Indo-European Language

Résumé

The present study investigates the relationship between two features of dependencies, namely, dependency distances and dependency frequencies. The study is based on the analysis of a parallel dependency treebank that includes 10 Indo-European languages. Two corresponding random dependency treebanks are generated as baselines for comparison. After computing the values of dependency distances and their frequencies in these treebanks, for each lan-guage, we fit four functions, namely quadratic, exponent, logarithm, and power-law func-tions, to its original and random datasets. The preliminary result shows that there is a rela-tion between the two dependency features for all 10 Indo-European languages. The relation can be further formalized as a power-law function which can distinguish the observed data from randomly generated datasets.

Domaines

Linguistique
Fichier non déposé

Dates et versions

hal-03168332 , version 1 (12-03-2021)

Identifiants

Citer

Xinying Chen, Kim Gerdes. Dependency Distances and Their Frequencies in Indo-European Language. Journal of Quantitative Linguistics, 2020, pp.1-20. ⟨10.1080/09296174.2020.1771135⟩. ⟨hal-03168332⟩
50 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More