Measuring the evolution of ontology complexity: the gene ontology case study

Abstract : Ontologies support automatic sharing, combination and analysis of life sciences data. They undergo regular curation and enrichment. We studied the impact of an ontology evolution on its structural complexity. As a case study we used the sixty monthly releases between January 2008 and December 2012 of the Gene Ontology and its three independent branches, i.e. biological processes (BP), cellular components (CC) and molecular functions (MF). For each case, we measured complexity by computing metrics related to the size, the nodes connectivity and the hierarchical structure. The number of classes and relations increased monotonously for each branch, with different growth rates. BP and CC had similar connectivity, superior to that of MF. Connectivity increased monotonously for BP, decreased for CC and remained stable for MF, with a marked increase for the three branches in November and December 2012. Hierarchy-related measures showed that CC and MF had similar proportions of leaves, average depths and average heights. BP had a lower proportion of leaves, and a higher average depth and average height. For BP and MF, the late 2012 increase of connectivity resulted in an increase of the average depth and average height and a decrease of the proportion of leaves, indicating that a major enrichment effort of the intermediate-level hierarchy occurred. The variation of the number of classes and relations in an ontology does not provide enough information about the evolution of its complexity. However, connectivity and hierarchy-related metrics revealed different patterns of values as well as of evolution for the three branches of the Gene Ontology. CC was similar to BP in terms of connectivity, and similar to MF in terms of hierarchy. Overall, BP complexity increased, CC was refined with the addition of leaves providing a finer level of annotations but decreasing slightly its complexity, and MF complexity remained stable.
Type de document :
Article dans une revue
PLoS ONE, Public Library of Science, 2013, 8 (10), pp.e75993. 〈10.1371/journal.pone.0075993〉
Liste complète des métadonnées
Contributeur : Olivier Dameron <>
Soumis le : lundi 28 octobre 2013 - 13:33:39
Dernière modification le : mercredi 11 avril 2018 - 02:00:46

Lien texte intégral



Olivier Dameron, Charles Bettembourg, Nolwenn Le Meur. Measuring the evolution of ontology complexity: the gene ontology case study. PLoS ONE, Public Library of Science, 2013, 8 (10), pp.e75993. 〈10.1371/journal.pone.0075993〉. 〈hal-00877378〉



Consultations de la notice