inria-00070340, version 1
A Similar Fragments Merging Approach to Learn Automata on Proteins
François Coste
a, 1Goulven Kerbellec
1
N° RR-5672 (2005)
Résumé : We propose here to learn automata for the characterization of proteins families to overcome the limitations of the position-specific characterizations classically used in Pattern Discovery. We introduce a new heuristic approach learning non-deterministic automata based on selection and ordering of significantly similar fragments to be merged and on physico-chemical properties identification. Quality of the characterization of the major intrinsic protein (MIP) family is assessed by leave-one-out cross-validation for a large range of models specificity.
- a – INRIA
- 1 : SYMBIOSE (INRIA - IRISA)
- CNRS : UMR6074 – INRIA – INSA Rennes – Université de Rennes 1
- Domaine : Informatique/Autre
- Mots-clés : GRAMMATICAL INFERENCE / AUTOMATA / PROTEINS
- Référence interne : RR-5672
- inria-00070340, version 1
- http://hal.inria.fr/inria-00070340
- oai:hal.inria.fr:inria-00070340
- Contributeur : Rapport De Recherche Inria
- Soumis le : Vendredi 19 Mai 2006, 20:10:33
- Dernière modification le : Jeudi 18 Janvier 2007, 11:36:27






Documents associés

Exporter