Skip to Main content Skip to Navigation
Journal articles

Smiles2Monomers: a link between chemical and biological structures for polymers

Yoann Dufresne 1, 2 Laurent Noé 1, 2 Valérie Leclère 3, 1, 2 Maude Pupin 1, 2
2 BONSAI - Bioinformatics and Sequence Analysis
Université de Lille, Sciences et Technologies, Inria Lille - Nord Europe, CRIStAL - Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189, CNRS - Centre National de la Recherche Scientifique
Abstract : Background: The monomeric composition of polymers is powerful for structure comparison and synthetic biology , among others. Many databases give access to the atomic structure of compounds but the monomeric structure of polymers is often lacking. We have designed a smart algorithm, implemented in the tool Smiles2Monomers (s2m), to infer efficiently and accurately the monomeric structure of a polymer from its chemical structure. Results: Our strategy is divided into two steps: first, monomers are mapped on the atomic structure by an efficient subgraph-isomorphism algorithm ; second, the best tiling is computed so that non-overlapping monomers cover all the structure of the target polymer. The mapping is based on a Markovian index built by a dynamic programming algorithm. The index enables s2m to search quickly all the given monomers on a target polymer. After, a greedy algorithm combines the mapped monomers into a consistent monomeric structure. Finally, a local branch and cut algorithm refines the structure. We tested this method on two manually annotated databases of polymers and reconstructed the structures de novo with a sensitivity over 90 %. The average computation time per polymer is 2 s. Conclusion: s2m automatically creates de novo monomeric annotations for polymers, efficiently in terms of time computation and sensitivity. s2m allowed us to detect annotation errors in the tested databases and to easily find the accurate structures. So, s2m could be integrated into the curation process of databases of small compounds to verify the current entries and accelerate the annotation of new polymers. The full method can be downloaded or accessed via a website for peptide-like polymers at http://bioinfo.lifl.fr/norine/smiles2monomers.jsp.
Document type :
Journal articles
Complete list of metadatas

Cited literature [35 references]  Display  Hide  Download

https://hal.inria.fr/hal-01250619
Contributor : Maude Pupin <>
Submitted on : Tuesday, January 5, 2016 - 9:45:26 AM
Last modification on : Friday, January 8, 2021 - 3:14:06 PM
Long-term archiving on: : Thursday, April 7, 2016 - 2:59:47 PM

File

s13321-015-0111-5.pdf
Publisher files allowed on an open archive

Identifiers

Citation

Yoann Dufresne, Laurent Noé, Valérie Leclère, Maude Pupin. Smiles2Monomers: a link between chemical and biological structures for polymers. Journal of Cheminformatics, Chemistry Central Ltd. and BioMed Central, 2015, 7, ⟨10.1186/s13321-015-0111-5⟩. ⟨hal-01250619⟩

Share

Metrics

Record views

1299

Files downloads

528