HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Combining Protein Secondary Structure Prediction Models with Ensemble Methods of Optimal Complexity

Yann Guermeur 1 Dominique Zelus
1 MODBIO - Computational models in molecular biology
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : The idea of combining models instead of simply selecting the ``best'' one, in order to improve performance, has a long theoretical background in statistics. However, theoretical results are ordinarily based on strong hypotheses, seldom satisfied in practice. When dealing with real-world problems, overfitting is often the main limitation, which cannot be overcome but with a strict complexity control of the combiner selected. SVMs should thus be well suited for these difficult situations. Investigating this idea, we introduce a new family of multi-class SVMs, and assess them as ensemble methods for protein secondary structure prediction. Experimental evidence highlights the gain in prediction accuracy resulting from combining some of the current best prediction methods with our SVMs rather than with the combiners traditionally used in the field.
Document type :
Conference papers
Complete list of metadata

Contributor : Publications Loria Connect in order to contact the contributor
Submitted on : Tuesday, September 26, 2006 - 2:56:27 PM
Last modification on : Friday, February 4, 2022 - 3:31:25 AM


  • HAL Id : inria-00101092, version 1



Yann Guermeur, Dominique Zelus. Combining Protein Secondary Structure Prediction Models with Ensemble Methods of Optimal Complexity. Journées Ouvertes Biologie Informatique Mathématiques - JOBIM'2001, INRA Toulouse, 2001, Toulouse, France, pp.97-104. ⟨inria-00101092⟩



Record views