Knodle: A Support Vector Machines-Based Automatic Perception of Organic Molecules from 3D Coordinates - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Journal of Chemical Information and Modeling Année : 2016

Knodle: A Support Vector Machines-Based Automatic Perception of Organic Molecules from 3D Coordinates

Résumé

Here we address the problem of the assignment of atom types and bond orders in low molecular weight compounds. For this purpose, we have developed a prediction model based on nonlinear Support Vector Machines (SVM), implemented in a KNOwledge-Driven Ligand Extractor called Knodle, a software library for the recognition of atomic types, hybridization states, and bond orders in the structures of small molecules. We trained the model using an excessive amount of structural data collected from the PDBbindCN database. Accuracy of the results and the running time of our method is comparable with other popular methods, such as NAOMI, fconv, and I-interpret. On the popular Labute’s benchmark set consisting of 179 protein–ligand complexes, Knodle makes five to six perception errors, NAOMI makes seven errors, I-interpret makes nine errors, and fconv makes 13 errors. On a larger set of 3,000 protein–ligand structures collected from the PDBBindCN general data set (v2014), Knodle and NAOMI have a comparable accuracy of approximately 3.9% and 4.7% of errors, I-interpret made 6.0% of errors, while fconv produced approximately 12.8% of errors. On a more general set of 332,974 entries collected from the Ligand Expo database, Knodle made 4.5% of errors. Overall, our study demonstrates the efficiency and robustness of nonlinear SVM in structure perception tasks. Knodle is available at https://team.inria.fr/nano-d/software/Knodle.
Fichier principal
Vignette du fichier
Knodle-AuthorVersion.pdf (1.19 Mo) Télécharger le fichier
Supplementary2.pdf (1.35 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01381010 , version 1 (08-11-2016)

Licence

Paternité - Pas d'utilisation commerciale - Partage selon les Conditions Initiales

Identifiants

Citer

Maria Kadukova, Sergei Grudinin. Knodle: A Support Vector Machines-Based Automatic Perception of Organic Molecules from 3D Coordinates. Journal of Chemical Information and Modeling, 2016, 56 (8), pp.1410-1419. ⟨10.1021/acs.jcim.5b00512⟩. ⟨hal-01381010⟩
632 Consultations
481 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More