Identification of Key Features of CNS Drugs Based on SVM and Greedy Algorithm

Author(s): Ruilin Zhang, Yanrui Ding*

Journal Name: Current Computer-Aided Drug Design

Volume 16 , Issue 6 , 2020

Become EABM
Become Reviewer

Graphical Abstract:


Abstract:

Introduction: The research and development of drugs, related to the central nervous system (CNS) diseases is a long and arduous process with high cost, long cycle and low success rate. Identification of key features based on available CNS drugs is of great significance for the discovery of new drugs.

Methods: In this paper, based on the PaDEL descriptors of CNS drugs and non-CNS drugs, a support vector machine (SVM) model was constructed to identify the key features of CNS drugs. Firstly, the random forest algorithm was used to rank descriptors according to the feature significance that contributes to the identification of CNS drugs. Then, a reliable SVM model was constructed, and the optimal combination of descriptors was determined based on greedy algorithm and recursive feature elimination method.

Results: It was found, based on the optimal combination of 40 descriptors, the prediction accuracy of CNS drugs and non-CNS drugs reached 94.2% and 94.4% respectively.

Conclusion: nF11HeteroRing, AATSC3v, SpMin6_Bhi, maxdssC, AATS4v, E1v, E3e, GATS5s, minsOH and minHBint4 are the key features to distinguish between CNS drugs and non-CNS drugs.

Keywords: CNS drugs, support vector machine, greedy algorithm, key features, drug identification, blood-brain barrier.

Rights & PermissionsPrintExport Cite as

Article Details

VOLUME: 16
ISSUE: 6
Year: 2020
Published on: 18 January, 2021
Page: [725 - 733]
Pages: 9
DOI: 10.2174/1573409915666191212095340
Price: $65

Article Metrics

PDF: 12