https://hal.inria.fr/hal-01702810Popov, PetrPetrPopovMIPT - Moscow Institute of Physics and Technology [Moscow]Grudinin, SergeiSergeiGrudininNANO-D - Algorithms for Modeling and Simulation of Nanosystems - Inria Grenoble - Rhône-Alpes - Inria - Institut National de Recherche en Informatique et en Automatique - Grenoble INP - Institut polytechnique de Grenoble - Grenoble Institute of Technology - LJK - Laboratoire Jean Kuntzmann - Grenoble INP - Institut polytechnique de Grenoble - Grenoble Institute of Technology - Inria - Institut National de Recherche en Informatique et en Automatique - CNRS - Centre National de la Recherche Scientifique - UGA [2016-2019] - Université Grenoble Alpes [2016-2019]Eurecon: Equidistant Uniform Rigid-body Ensemble ConstructorHAL CCSD2018[INFO.INFO-BI] Computer Science [cs]/Bioinformatics [q-bio.QM][INFO.INFO-CG] Computer Science [cs]/Computational Geometry [cs.CG]Grudinin, Sergei2018-02-26 17:28:232023-03-24 14:53:062018-02-26 18:37:21enJournal articleshttps://hal.inria.fr/hal-01702810/document10.1016/j.jmgm.2018.01.015application/pdf1Conformational ensembles comprise one of the fundamental concepts in statistical bioinformatics and appear in a variety of applications, e.g. molecular docking, virtual screening, searching for pharmacophores, etc. High-throughput applications require billions of conformations to be considered, thus, one often uses the rigid-body representation of molecules or its fragments to cope with the computational cost. Of particular interest is generation of the near-native conformational ensembles, which consist of conformations structurally close to the biologically relevant ones. One possible way to compose such ensembles is to control the root mean square deviation (RMSD) between the original and the generated conformations. To the best of our knowledge there is no computational approach that guarantees that all the generated conformations have the desired RMSD with respect to the reference structure. In this study we presented a fast algorithm for the construction of rigid-body conformational ensembles, which possess two main properties: (i) each generated conformation has a fixed RMSD with respect to the original conformation, (ii) generated conformations are distributed uniformly over the sphere of axes corresponding to the rigid-body motions. The algorithm is very efficient, it does not require any standard RMSD computation between the conformations and has the O(N + M) complexity to generate the required rigid-body transforms, where N is the number of atoms in the system, and M is the size of the conformational ensemble. Eurecon is applicable to an arbitrary atomic system, thus, it could be used for molecular systems of various size and type. We demonstrated Eurecon application by generating near-native conformational ensembles for a ligand placed inside a binding site, a protein dimer embedded into a membrane, and a ribosomal complex. We implemented the developed algorithm in C++ and called it Eurecon, which stands for Equidistant Uniform Rigid-body Ensemble CONstructor. A user-friendly interface allows to define the desired RMSD value, the relative amplitudes for rotation and translation motions by means of the partition parameter, and the set of axes corresponding to the rigid-body motions. Eurecon is available as the SAMSON Element (https://samson-connect.net).