Knodle: A Support Vector Machines-Based Automatic Perception of Organic Molecules from 3D Coordinates

Maria Kadukova 1, 2 Sergei Grudinin 1
1 NANO-D - Algorithms for Modeling and Simulation of Nanosystems
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : Here we address the problem of the assignment of atom types and bond orders in low molecular weight compounds. For this purpose, we have developed a prediction model based on nonlinear Support Vector Machines (SVM), implemented in a KNOwledge-Driven Ligand Extractor called Knodle, a software library for the recognition of atomic types, hybridization states, and bond orders in the structures of small molecules. We trained the model using an excessive amount of structural data collected from the PDBbindCN database. Accuracy of the results and the running time of our method is comparable with other popular methods, such as NAOMI, fconv, and I-interpret. On the popular Labute’s benchmark set consisting of 179 protein–ligand complexes, Knodle makes five to six perception errors, NAOMI makes seven errors, I-interpret makes nine errors, and fconv makes 13 errors. On a larger set of 3,000 protein–ligand structures collected from the PDBBindCN general data set (v2014), Knodle and NAOMI have a comparable accuracy of approximately 3.9% and 4.7% of errors, I-interpret made 6.0% of errors, while fconv produced approximately 12.8% of errors. On a more general set of 332,974 entries collected from the Ligand Expo database, Knodle made 4.5% of errors. Overall, our study demonstrates the efficiency and robustness of nonlinear SVM in structure perception tasks. Knodle is available at https://team.inria.fr/nano-d/software/Knodle.
Type de document :
Article dans une revue
Journal of Chemical Information and Modeling, American Chemical Society, 2016, 56 (8), pp.1410-1419. 〈http://pubs.acs.org/doi/abs/10.1021/acs.jcim.5b00512〉. 〈10.1021/acs.jcim.5b00512〉
Liste complète des métadonnées

Littérature citée [26 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01381010
Contributeur : Nano-D Equipe <>
Soumis le : mardi 8 novembre 2016 - 17:42:21
Dernière modification le : mercredi 11 avril 2018 - 01:58:25
Document(s) archivé(s) le : mardi 14 mars 2017 - 23:32:25

Licence


Distributed under a Creative Commons Paternité - Pas d'utilisation commerciale - Partage selon les Conditions Initiales 4.0 International License

Identifiants

Collections

Citation

Maria Kadukova, Sergei Grudinin. Knodle: A Support Vector Machines-Based Automatic Perception of Organic Molecules from 3D Coordinates. Journal of Chemical Information and Modeling, American Chemical Society, 2016, 56 (8), pp.1410-1419. 〈http://pubs.acs.org/doi/abs/10.1021/acs.jcim.5b00512〉. 〈10.1021/acs.jcim.5b00512〉. 〈hal-01381010〉

Partager

Métriques

Consultations de la notice

824

Téléchargements de fichiers

292