Smiles2Monomers: a link between chemical and biological structures for polymers - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Journal of Cheminformatics Année : 2015

Smiles2Monomers: a link between chemical and biological structures for polymers

Résumé

Background: The monomeric composition of polymers is powerful for structure comparison and synthetic biology , among others. Many databases give access to the atomic structure of compounds but the monomeric structure of polymers is often lacking. We have designed a smart algorithm, implemented in the tool Smiles2Monomers (s2m), to infer efficiently and accurately the monomeric structure of a polymer from its chemical structure. Results: Our strategy is divided into two steps: first, monomers are mapped on the atomic structure by an efficient subgraph-isomorphism algorithm ; second, the best tiling is computed so that non-overlapping monomers cover all the structure of the target polymer. The mapping is based on a Markovian index built by a dynamic programming algorithm. The index enables s2m to search quickly all the given monomers on a target polymer. After, a greedy algorithm combines the mapped monomers into a consistent monomeric structure. Finally, a local branch and cut algorithm refines the structure. We tested this method on two manually annotated databases of polymers and reconstructed the structures de novo with a sensitivity over 90 %. The average computation time per polymer is 2 s. Conclusion: s2m automatically creates de novo monomeric annotations for polymers, efficiently in terms of time computation and sensitivity. s2m allowed us to detect annotation errors in the tested databases and to easily find the accurate structures. So, s2m could be integrated into the curation process of databases of small compounds to verify the current entries and accelerate the annotation of new polymers. The full method can be downloaded or accessed via a website for peptide-like polymers at http://bioinfo.lifl.fr/norine/smiles2monomers.jsp.
Fichier principal
Vignette du fichier
s13321-015-0111-5.pdf (1.4 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-01250619 , version 1 (05-01-2016)

Identifiants

Citer

Yoann Dufresne, Laurent Noé, Valérie Leclère, Maude Pupin. Smiles2Monomers: a link between chemical and biological structures for polymers. Journal of Cheminformatics, 2015, 7, ⟨10.1186/s13321-015-0111-5⟩. ⟨hal-01250619⟩
729 Consultations
142 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More