sign in
english version rss feed

inria-00070340, version 1

A Similar Fragments Merging Approach to Learn Automata on Proteins

François Coste () a1, Goulven Kerbellec () 1

N° RR-5672 (2005)

Abstract: We propose here to learn automata for the characterization of proteins families to overcome the limitations of the position-specific characterizations classically used in Pattern Discovery. We introduce a new heuristic approach learning non-deterministic automata based on selection and ordering of significantly similar fragments to be merged and on physico-chemical properties identification. Quality of the characterization of the major intrinsic protein (MIP) family is assessed by leave-one-out cross-validation for a large range of models specificity.

  • a –  INRIA
  • 1:  SYMBIOSE (INRIA - IRISA)
  • CNRS : UMR6074 – INRIA – INSA Rennes – Université de Rennes 1
  • Domain : Computer Science/Other
  • Keywords : GRAMMATICAL INFERENCE / AUTOMATA / PROTEINS
  • Internal note : RR-5672
 
  • inria-00070340, version 1
  • oai:hal.inria.fr:inria-00070340
  • From: 
  • Submitted on: Friday, 19 May 2006 20:10:33
  • Updated on: Thursday, 18 January 2007 11:36:27
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...