Automated Enzyme classification by Formal Concept Analysis

Abstract : Enzymes are molecules with a catalytic activity that make them essential for any biochemical reaction. High throughput genomic technics give access to the protein sequence of new enzymes found in living organisms. Guessing the enzyme functional activity from its sequence is a crucial task that can be approached by comparing the new sequences with those of already known enzymes labeled by a family class. This task is difficult because the activity is based on a combination of small sequence patterns and sequences greatly evolved over time. This paper presents a classifier based on the identification of common subsequence blocks between known and new enzymes and the search of formal concepts built on the cross product of blocks and sequences for each class. Since new enzyme families may emerge, it is important to propose a first classification of enzymes that cannot be assigned to a known family. FCA offer a nice framework to set the task as an optimization problem on the set of concepts. The classifier has been tested with success on a particular set of enzymes present in a large variety of species, the haloacid dehalogenase superfamily.
Type de document :
Communication dans un congrès
ICFCA - 12th International Conference on Formal Concept Analysis, Jun 2014, Cluj-Napoca, Romania. Springer, 2014
Liste complète des métadonnées

Littérature citée [21 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01063727
Contributeur : Gaëlle Garet <>
Soumis le : lundi 15 septembre 2014 - 11:13:19
Dernière modification le : mercredi 16 mai 2018 - 11:23:35
Document(s) archivé(s) le : mardi 16 décembre 2014 - 10:26:02

Fichiers

main.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01063727, version 1

Citation

François Coste, Gaëlle Garet, Agnès Groisillier, Jacques Nicolas, Thierry Tonon. Automated Enzyme classification by Formal Concept Analysis. ICFCA - 12th International Conference on Formal Concept Analysis, Jun 2014, Cluj-Napoca, Romania. Springer, 2014. 〈hal-01063727〉

Partager

Métriques

Consultations de la notice

808

Téléchargements de fichiers

420