inria-00070340, version 1
A Similar Fragments Merging Approach to Learn Automata on Proteins
François Coste
a, 1Goulven Kerbellec
1
N° RR-5672 (2005)
Abstract: We propose here to learn automata for the characterization of proteins families to overcome the limitations of the position-specific characterizations classically used in Pattern Discovery. We introduce a new heuristic approach learning non-deterministic automata based on selection and ordering of significantly similar fragments to be merged and on physico-chemical properties identification. Quality of the characterization of the major intrinsic protein (MIP) family is assessed by leave-one-out cross-validation for a large range of models specificity.
- a – INRIA
- 1: SYMBIOSE (INRIA - IRISA)
- CNRS : UMR6074 – INRIA – INSA Rennes – Université de Rennes 1
- Domain : Computer Science/Other
- Keywords : GRAMMATICAL INFERENCE / AUTOMATA / PROTEINS
- Internal note : RR-5672
- inria-00070340, version 1
- http://hal.inria.fr/inria-00070340
- oai:hal.inria.fr:inria-00070340
- From: Rapport De Recherche Inria
- Submitted on: Friday, 19 May 2006 20:10:33
- Updated on: Thursday, 18 January 2007 11:36:27






Associated documents

Export