Skip to Main content Skip to Navigation
Conference papers

Combining Protein Secondary Structure Prediction Models with Ensemble Methods of Optimal Complexity

Yann Guermeur 1 Dominique Zelus
1 MODBIO - Computational models in molecular biology
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : The idea of combining models instead of simply selecting the ``best'' one, in order to improve performance, has a long theoretical background in statistics. However, theoretical results are ordinarily based on strong hypotheses, seldom satisfied in practice. When dealing with real-world problems, overfitting is often the main limitation, which cannot be overcome but with a strict complexity control of the combiner selected. SVMs should thus be well suited for these difficult situations. Investigating this idea, we introduce a new family of multi-class SVMs, and assess them as ensemble methods for protein secondary structure prediction. Experimental evidence highlights the gain in prediction accuracy resulting from combining some of the current best prediction methods with our SVMs rather than with the combiners traditionally used in the field.
Document type :
Conference papers
Complete list of metadata

https://hal.inria.fr/inria-00101092
Contributor : Publications Loria <>
Submitted on : Tuesday, September 26, 2006 - 2:56:27 PM
Last modification on : Friday, February 26, 2021 - 3:28:04 PM

Identifiers

  • HAL Id : inria-00101092, version 1

Collections

Citation

Yann Guermeur, Dominique Zelus. Combining Protein Secondary Structure Prediction Models with Ensemble Methods of Optimal Complexity. Journées Ouvertes Biologie Informatique Mathématiques - JOBIM'2001, INRA Toulouse, 2001, Toulouse, France, pp.97-104. ⟨inria-00101092⟩

Share

Metrics

Record views

158