Grid-based Evolutionary Strategies Applied to the Conformational Sampling Problem

Benjamin Parent 1 Alexandru-Adrian Tantar 2 Nouredine Melab 3 El-Ghazali Talbi 3, 4 Dragos Horvath 1
2 ALEA - Advanced Learning Evolutionary Algorithms
Inria Bordeaux - Sud-Ouest, UB - Université de Bordeaux, CNRS - Centre National de la Recherche Scientifique : UMR5251
3 DOLPHIN - Parallel Cooperative Multi-criteria Optimization
LIFL - Laboratoire d'Informatique Fondamentale de Lille, Inria Lille - Nord Europe
Abstract : Computational simulations of conformational sampling in general, and of macromolecular folding in particular represent one of the most important and yet one of the most challenging applications of computer science in biology and medicinal chemistry. The advent of GRID computing may trigger some major progress in this field. This paper presents our first attempts to design GRID-based conformational sampling strategies, exploring the extremely rugged energy response surface in function of molecular geometry, in search of low energy zones through phase spaces of hundreds of degrees of freedom. We have generalized the classical island model deployment of genetic algorithms (GA) to a "planetary" model where each node of the grid is assimilated to a "planet" harboring quasi-independent multi-island simulations based on a hybrid GA-driven sampling approach. Although different "planets" do not communicate to each other-thus minimizing inter-CPU exchanges on the GRID-each new simulation will benefit from the preliminary knowledge extracted from the centralized pool of already visited geometries, located on the dispatcher machine, and which is disseminated to any new "planet". This "panspermic" strategy allows new simulations to be conducted such as to either be attracted towards an apparently promising phase space zone (biasing strategies, intensification procedures) or to avoid already in-depth sampled (tabu) areas. Successful folding of mini-proteins typically used in benchmarks for all- atoms protein simulations has been observed, although the reproducibility of these highly stochastic simulations in huge problem spaces is still in need of improvement. Work on two structured peptides (the "tryptophane cage" 1L2Y and the "tryptophane zipper" 1LE1) used as benchmarks for all-atom protein folding simulations has shown that the planetary model is able to reproducibly sample conformers from the neighborhood of the native geometries. However, within these neighborhoods (within - ensembles of conformers similar to models published on hand of experimental geometry determinations), the energy landscapes are still extremely rugged. Therefore, simulations in general produce "correct" geometries (similar enough to experimental model for any practical purposes) which sometimes unfortunately correspond to relatively high energy levels and therefore are less stable than the most stable among misfolded conformers. The method thus reproducibly visits the native phase space zone, but fails to reproducibly hit the bottom of its rugged energy well. Intensifications of local sampling may in principle solve this problematic behavior, but is limited by computational resources. The quest for the optimal time point at which a phase space zone should stop being intensively searched and declared tabu, a very difficult problem, is still awaiting for a practically useful solution.
Type de document :
Communication dans un congrès
Evolutionary Computation, 2007. CEC 2007. IEEE Congress on, Sep 2007, Singapore, Singapore. 2007, 〈10.1109/CEC.2007.4424484〉
Liste complète des métadonnées
Contributeur : Ist Rennes <>
Soumis le : vendredi 13 avril 2012 - 13:50:37
Dernière modification le : mercredi 25 avril 2018 - 13:04:02



Benjamin Parent, Alexandru-Adrian Tantar, Nouredine Melab, El-Ghazali Talbi, Dragos Horvath. Grid-based Evolutionary Strategies Applied to the Conformational Sampling Problem. Evolutionary Computation, 2007. CEC 2007. IEEE Congress on, Sep 2007, Singapore, Singapore. 2007, 〈10.1109/CEC.2007.4424484〉. 〈hal-00687522〉



Consultations de la notice