Generic placeholder image

Current Pharmaceutical Design


ISSN (Print): 1381-6128
ISSN (Online): 1873-4286

Review Article

Application of Artificial Intelligence in Drug Discovery

Author(s): Hitesh Chopra, Atif A. Baig, Rupesh K. Gautam* and Mohammad A. Kamal*

Volume 28, Issue 33, 2022

Published on: 22 June, 2022

Page: [2690 - 2703] Pages: 14

DOI: 10.2174/1381612828666220608141049

Price: $65


Due to the heap of data sets available for drug discovery, modern drug discovery has taken the shape of big data. Usage of Artificial intelligence (AI) can help to modify drug discovery based on big data to precised, knowledgeable data. The pharmaceutical companies have already geared their departments for this and started a race to search for new novel drugs. The AI helps to predict the molecular structure of the compound and its in-vivo vs. in-vitro characteristics without hampering life, thus saving time and economic loss. Clinical studies, electronic records, and images act as a helping hand for the development. The data mining and curation techniques help explore the data with a single click. AI in big data analysis has paved the red carpet for future rational drug development and optimization. This review's objective is to familiarise readers with various advances in the AI field concerning software, firms, and other tools working in easing out the labor of the drug discovery journey.

Keywords: Artificial intelligence, drug discovery, high-throughput screening, electronic records, molecular docking, machine learning, deep learning.

Atanassova I, Bertin M, Mayr P. Editorial: Mining scientific papers: NLP-enhanced bibliometrics. Front Res Metr Anal 2019; 4: 2.
[] [PMID: 33870034]
Hughes JP, Rees S, Kalindjian SB, Philpott KL. Principles of early drug discovery. Br J Pharmacol 2011; 162(6): 1239-49.
[] [PMID: 21091654]
Lander ES, Linton LM, Birren B, et al. Initial sequencing and analysis of the human genome. Nature 2001; 409(6822): 860-921.
[] [PMID: 11237011]
Szymański P, Markowicz M, Mikiciuk-Olasik E. Adaptation of high-throughput screening in drug discovery-toxicological screening tests. Int J Mol Sci 2012; 13(1): 427-52.
[] [PMID: 22312262]
Lionta E, Spyrou G, Vassilatis DK, Cournia Z. Structure-based virtual screening for drug discovery: Principles, applications and recent advances. Curr Top Med Chem 2014; 14(16): 1923-38.
[] [PMID: 25262799]
Pinzi L, Rastelli G. Molecular docking: Shifting paradigms in drug discovery. Int J Mol Sci 2019; 20(18): 1-23.
[] [PMID: 31487867]
3 ways big data and artificial intelligence revolutionize drug discovery | BioPharmaTrend. Available from: (Accessed on September 30, 2021).
Why drug designers will be at a disadvantage without AI. Available from: (Accessed on September 30, 2021).
Cloud pharmaceuticals CEO and CSO to speak at AI pharma innovation, July 26-27, 2017 in Boston - Cloud Pharmaceuticals. Available from: (Accessed on October 2, 2021).
Atomwise finds first evidence towards new ebola treatments – Atomwise. Available from: (Accessed on September 30, 2021).
Artificial intelligence helps find new drugs: Better, faster, cheaper BioPharmaTrend. Available from: (Accessed on September 30, 2021).
AlphaFold: A solution to a 50-year-old grand challenge in biology DeepMind Available from: (Accessed on September 30, 2021).
DeepChem. Available from: (Accessed on September 30, 2021).
Wójcikowski M, Zielenkiewicz P, Siedlecki P. Open Drug Discovery Toolkit (ODDT): A new open-source player in the drug discovery field. J Cheminform 2015; 7: 26.
Harris CR, Millman KJ, van der Walt SJ, et al. Array programming with NumPy. Nature 2020; 585(7825): 357-62.
[] [PMID: 32939066]
Jones G, Willett P, Glen RC. Molecular recognition of receptor sites using a genetic algorithm with a description of desolvation. J Mol Biol 1995; 245(1): 43-53.
[] [PMID: 7823319]
Trott O, Olson AJ. AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J Comput Chem 2010; 31(2): 455-61.
[PMID: 19499576]
Morris GM, Huey R, Lindstrom W, et al. AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. J Comput Chem 2009; 30(16): 2785-91.
[] [PMID: 19399780]
Drug discovery with an AI-augmented platform-Cyclica. Available from: (Accessed on October 2, 2021).
Toronto’s AI-vendor Cyclica Inks Strategic Collaboration with Elite Chinese Academic Research Center targeting COVID-19. Available from: (Accessed on October 2, 2021).
China’s Institute of Materia Medica Partners With Cyclica on Innovative Drug Repurposing for COVID-19 | Business Wire. Available from:’s-Institute-of-Materia-Medica-Partners-With-Cyclica-on-Innovative-Drug-Repurposing-for-COVID-19 (Accessed on October 2, 2021).
Using Cyclica’s technology to identify repurposed drug candidates for COVID-19 — Cyclica. Available from: (Accessed on October 2, 2021).
Brereton AE, MacKinnon S, Safikhani Z, et al. Predicting drug properties with parameter-free machine learning: Pareto-optimal embedded modeling (POEM). Mach Learn Sci Technol 2020; 1: 025008.
Exscientia | AI Drug Discovery | Pharmatech. Available from: (Accessed on October 2, 2021).
3-part Study to Assess Safety, Tolerability, Pharmacokinetics and Pharmacodynamics of EXS21546. Available from: (Accessed on October 2, 2021).
Minnich AJ, McLoughlin K, Tse M, et al. AMPL: A data-driven modeling pipeline for drug discovery. J Chem Inf Model 2020; 60(4): 1955-68.
[] [PMID: 32243153]
Berthold MR, Cebron N, Dill F, et al. KNIME-the konstanz information miner: Version 2.0 and beyond. SIGKDD Explor 2009; 11(1): 26-31.
Life sciences and material sciences | BIOVIA – Dassault systèmes. Available from: (Accessed on October 3, 2021).
Schenone M, Dančík V, Wagner BK, Clemons PA. Target identification and mechanism of action in chemical biology and drug discovery. Nat Chem Biol 2013; 9(4): 232-40.
[] [PMID: 23508189]
Lee J, Bogyo M. Target deconvolution techniques in modern phenotypic profiling. Curr Opin Chem Biol 2013; 17(1): 118-26.
[] [PMID: 23337810]
Yang X, Wang Y, Byrne R, Schneider G, Yang S. Concepts of artificial intelligence for computer-assisted drug discovery. Chem Rev 2019; 119(18): 10520-94.
[] [PMID: 31294972]
Goh GB, Hodas NO, Siegel C, Vishnu A. SMILES2Vec: An interpretable general-purpose deep neural network for predicting chemical properties. arXiv preprin 2017.
Parveen A, Mustafa SH, Yadav P, Kumar A. Applications of machine learning in miRNA discovery and target prediction. Curr Genomics 2019; 20(8): 537-44.
[] [PMID: 32581642]
Maia EHB, Assis LC, de Oliveira TA, da Silva AM, Taranto AG. Structure-based virtual screening: From classical to artificial intelligence. Front Chem 2020; 8: 343.
[] [PMID: 32411671]
Vamathevan J, Clark D, Czodrowski P, et al. Applications of machine learning in drug discovery and development. Nat Rev Drug Discov 2019; 18(6): 463-77.
[] [PMID: 30976107]
Carpenter KA, Huang X. Machine learning-based virtual screening and its applications to Alzheimer’s drug Discovery: A review. Curr Pharm Des 2018; 24(28): 3347-58.
[] [PMID: 29879881]
RoboRXN: Automating chemical synthesis | IBM Research Blog. Available from: (Accessed on October 3, 2021).
Schwaller P, Petraglia R, Zullo V, et al. Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy. Chem Sci (Camb) 2020; 11(12): 3316-25.
[] [PMID: 34122839]
Vaucher AC, Zipoli F, Geluykens J, Nair VH, Schwaller P, Laino T. Automated extraction of chemical synthesis actions from experimental procedures. Nat Commun 2020; 11(1): 3601.
[] [PMID: 32681088]
Segler MHS, Preuss M, Waller MP. Planning chemical syntheses with deep neural networks and symbolic AI 2018. Nature 2018; 555: 604-10.
Cotarelo A, García-Díaz V, Núñez-Valdez ER, González García C, Gómez A, Chun-Wei Lin J. Improving Monte Carlo tree search with artificial neural networks without heuristics. Appl Sci (Basel) 2021; 11: 2056.
Subramaniam S, Mehrotra M, Gupta D. Virtual high throughput screening (vHTS)-a perspective. Bioinformation 2008; 3(1): 14-7.
[] [PMID: 19052660]
Shaikh F, Zhao Y, Alvarez L, Iliopoulou M, Lohans C, Schofield CJ, et al. Structure-Based in Silico Screening Identifies a Potent Ebolavirus Inhibitor from a Traditional Chinese Medicine Library. J Med Chem 2019; 62: 21.
Opo FADM, Rahman MM, Ahammad F, Ahmed I, Bhuiyan MA, Asiri AM. Structure based pharmacophore modeling, virtual screening, molecular docking and ADMET approaches for identification of natural anti-cancer agents targeting XIAP protein. Sci Rep 2021; 11: 4049.
Yang CC, Domeniconi G, Zhang L. Design of AI-enhanced drug lead optimization workflow for HPC and Cloud.IEEE International Conference on Big Data. 2020; pp. 5861-3.
Zhang L, Domeniconi G, Yang CC, Kang S, Zhou R, Cong G. CASTELO: Clustered atom subtypes aided lead optimization—a combined machine learning and molecular modeling method. BMC Bioinformatics 2021; 22: 338.
[] [PMID: 34157976]
Melvin RL, Xiao J, Godwin RC, Berenhaut KS, Salsbury FR Jr. Visualizing correlated motion with HDBSCAN clustering. Protein Sci 2018; 27(1): 62-75.
[] [PMID: 28799290]
Awad M, Khanna R. Support vector machines for classification. Berkeley, CA: Apress 2015; pp. 39-66.
Hinton GE, Osindero S, Teh YW. A fast learning algorithm for deep belief nets. Neural Comput 2006; 18(7): 1527-54.
[] [PMID: 16764513]
Jaderberg M, Simonyan K, Vedaldi A, Zisserman A. Reading text in the wild with convolutional neural networks. Int J Comput Vis 2016; 116: 1-20.
Sliwoski G, Kothiwale S, Meiler J, Lowe EW Jr. Computational methods in drug discovery. Pharmacol Rev 2013; 66(1): 334-95.
[] [PMID: 24381236]
Krogh A. What are artificial neural networks? Nat Biotechnol 2008; 26(2): 195-7.
[] [PMID: 18259176]
Larochelle H, Bengio Y, Louradour J, Ca LU. Exploring strategies for training deep neural networks pascal lamblin. J Mach Learn Res 2009; 1: 1-40.
Albawi S, Mohammed TA, Al-Zawi S. Understanding of a convolutional neural network.International Conference on Engineering and Technology (ICET). 2017; pp. 1-6.
Lawrence S, Giles CL, Tsoi AC, Back AD. Face recognition: A convolutional neural-network approach. IEEE Trans Neural Netw 1997; 8(1): 98-113.
[] [PMID: 18255614]
Raj JS, Ananthi JV. Recurrent neural networks and nonlinear prediction in support vector machines. J Soft Comput Paradigm 2019; 1: 33-40.
Yin C, Zhu Y, Fei J, He X. A deep learning approach for intrusion detection using recurrent neural networks. IEEE Access 2017; 5: 21954-61.
Joo S, Kim MS, Yang J, Park J. Generative model for proposing drug candidates satisfying anticancer properties using a conditional variational autoencoder. ACS Omega 2020; 5(30): 18642-50.
[] [PMID: 32775866]
Kadurin A, Nikolenko S, Khrabrov K, Aliper A, Zhavoronkov A. druGAN: An advanced generative adversarial autoencoder model for de novo generation of new molecules with desired molecular properties in silico. Mol Pharm 2017; 14(9): 3098-104.
[] [PMID: 28703000]
Polykovskiy D, Zhebrak A, Vetrov D, et al. Entangled conditional adversarial autoencoder for de novo drug discovery. Mol Pharm 2018; 15(10): 4398-405.
[] [PMID: 30180591]
Using AI to accelerate drug discovery. Available from: (Accessed on October 5, 2021).
Standigm - Standigm. Available from: (Accessed on October 5, 2021).
Home - Cytoreason. Available from: (Accessed on October 5, 2021).
Cytoreason and Pfizer to use machine learning model of the immune system for drug discovery. Available from: (Accessed on October 5, 2021).
Normand R, Du W, Briller M, et al. Found in translation: A machine learning model for mouse-to-human inference. Nat Methods 2018; 15(12): 1067-73.
[] [PMID: 30478323]
Maxeiner J, Sharma R, Amrhein C, et al. Genomics integrated systems transgenesis (GENISYST) for gain-of-function disease modelling in Göttingen Minipigs. J Pharmacol Toxicol Methods 2021; 108: 106956.
[] [PMID: 33609731]
Genimaps®-Genisyst® drug discovery platform Genome Biologics UG - [LSE] - The European Life Sciences Web Portal. Available from: (Accessed on October 5, 2021).
Our Solution - BullFrog AI Holdings Available from: (Accessed on October 5, 2021).
Lavecchia A. Machine-learning approaches in drug discovery: Methods and applications. Drug Discov Today 2015; 20(3): 318-31.
[] [PMID: 25448759]
Ma J, Sheridan RP, Liaw A, Dahl GE, Svetnik V. Deep neural nets as a method for quantitative structure-activity relationships. J Chem Inf Model 2015; 55(2): 263-74.
[] [PMID: 25635324]
Krizhevsky A, Sutskever I, Hinton GE. ImageNet classification with deep convolutional neural networks. Commun ACM 2017; 60: 84-90.
Öztürk H, Özgür A, Schwaller P, Laino T, Ozkirimli E. Exploring chemical space using natural language processing methodologies for drug discovery. Drug Discov Today 2020; 25(4): 689-705.
[] [PMID: 32027969]
Jiménez-Luna J, Grisoni F, Weskamp N, Schneider G. Artificial intelligence in drug discovery: Recent advances and future perspectives. Expert Opin Drug Discov 2021; 16(9): 949-59.
[] [PMID: 33779453]
Sydow D, Burggraaff L, Szengel A, et al. Advances and challenges in computational target prediction. J Chem Inf Model 2019; 59(5): 1728-42.
[] [PMID: 30817146]
Mayr A, Klambauer G, Unterthiner T, Hochreiter S. DeepTox: Toxicity prediction using deep learning. Front Environ Sci 2016; 3: 80.
Zang Q, Mansouri K, Williams AJ, et al. In silico prediction of physicochemical properties of environmental chemicals using molecular fingerprints and machine learning. J Chem Inf Model 2017; 57(1): 36-49.
[] [PMID: 28006899]
Zhong F, Xing J, Li X, et al. Artificial intelligence in drug design. Sci China Life Sci 2018; 61(10): 1191-204.
[] [PMID: 30054833]
Lusci A, Pollastri G, Baldi P. Deep architectures and deep learning in chemoinformatics: The prediction of aqueous solubility for drug-like molecules. J Chem Inf Model 2013; 53(7): 1563-75.
[] [PMID: 23795551]
Kumar R, Sharma A, Siddiqui MH, Tiwari RK. Prediction of human intestinal absorption of compounds using artificial intelligence techniques. Curr Drug Discov Technol 2017; 14(4): 244-54.
[] [PMID: 28382857]
Rupp M, Körner R, Tetko IV. Estimation of acid dissociation constants using graph kernels. Mol Inform 2010; 29(10): 731-40.
[] [PMID: 27464016]
Öztürk H, Özgür A, Ozkirimli E. DeepDTA: Deep drug-target binding affinity prediction. Bioinformatics 2018; 34(17): i821-9.
[] [PMID: 30423097]
Lounkine E, Keiser MJ, Whitebread S, et al. Large-scale prediction and testing of drug activity on side-effect targets. Nature 2012; 486(7403): 361-7.
[] [PMID: 22722194]
Karimi M, Wu D, Wang Z, Shen Y. DeepAffinity: Interpretable deep learning of compound-protein affinity through unified recurrent and convolutional neural networks. Bioinformatics 2019; 35(18): 3329-38.
[] [PMID: 30768156]
Feng Q, Dueva E, Cherkasov A, Ester M. A deep learning-based framework for drug-target interaction prediction. arXiv 2018; 1-29.
Fonger GC. Hazardous substances data bank (HSDB) as a source of environmental fate information on chemicals. Toxicology 1995; 103(2): 137-45.
[] [PMID: 8545846]
Fonger GC, Hakkinen P, Jordan S, Publicker S. The National Library of Medicine’s (NLM) Hazardous Substances Data Bank (HSDB): Background, recent enhancements and future plans. Toxicology 2014; 325: 209-16.
[] [PMID: 25223694]
Hansch C. A quantitative approach to biochemical structure-activity relationship. Acc Chem Res 1969; 2(8): 232-9.
Bradbury SP. Predicting modes of toxic action from chemical structure: An overview. SAR QSAR Environ Res 1994; 2(1-2): 89-104.
[] [PMID: 8790641]
Cronin MTD, Dearden JC. QSAR in toxicology. 1. prediction of aquatic toxicity. Mol Inform 1995; 14: 1-7.
Dunn WJ III. QSAR approaches to predicting toxicity. Toxicol Lett 1988; 43(1-3): 277-83.
[] [PMID: 3176069]
Wang S, Liu W, Wu J, Cao L, Meng Q, Kennedy PJ. Training deep neural networks on imbalanced data sets Int Jt Conf Neural Netw. (IJCNN) 2016; pp. 4368-74.
Myint KZ, Wang L, Tong Q, Xie XQ. Molecular fingerprint-based artificial neural networks QSAR for ligand biological activity predictions. Mol Pharm 2012; 9(10): 2912-23.
[] [PMID: 22937990]
Myint KZ, Xie XQ. Ligand biological activity predictions using fingerprint-based artificial neural networks (FANN-QSAR). Methods Mol Biol 2015; 1260: 149-64.
[] [PMID: 25502380]
Dahl GE, Jaitly N, Salakhutdinov R. Multi-task neural networks for QSAR predictions. arXiv 2014; 1-21.
Gute BD, Basak SC. Predicting acute toxicity (LC50) of benzene derivatives using theoretical molecular descriptors: A hierarchical QSAR approach. SAR QSAR Environ Res 1997; 7(1-4): 117-31.
[] [PMID: 9501507]
Basak SC, Grunwald GD, Gute BD, Balasubramanian K, Opitz D. Use of statistical and neural net approaches in predicting toxicity of chemicals. J Chem Inf Comput Sci 2000; 40(4): 885-90.
[] [PMID: 10955514]
Lu J, Peng J, Wang J, et al. Estimation of acute oral toxicity in rat using local lazy learning. J Cheminform 2014; 6: 26.
[] [PMID: 24959207]
Martin TM, Lilavois CR, Barron MG. Prediction of pesticide acute toxicity using two-dimensional chemical descriptors and target species classification. SAR QSAR Environ Res 2017; 28(6): 525-39.
[] [PMID: 28703021]
Xu Y, Pei J, Lai L. Deep learning based regression and multiclass models for acute oral toxicity prediction with automatic chemical feature extraction. J Chem Inf Model 2017; 57(11): 2672-85.
[] [PMID: 29019671]
CovDock | Schrödinger.. Available from: (Accessed on February 18, 2021).
QM-Polarized Ligand Docking | Schrödinger. Available from: (Accessed on February 18, 2021).
Gohlke H, Klebe G. DrugScore meets CoMFA: Adaptation of fields for molecular comparison (AFMoC) or how to tailor knowledge-based pair-potentials to a particular protein. J Med Chem 2002; 45(19): 4153-70.
[] [PMID: 12213058]
Gohlke H, Hendlich M, Klebe G. Knowledge-based scoring function to predict protein-ligand interactions. J Mol Biol 2000; 295(2): 337-56.
[] [PMID: 10623530]
Roche O, Kiyama R, Brooks CL III. Ligand-protein database: Linking protein-ligand complex structures to binding data. J Med Chem 2001; 44(22): 3592-8.
[] [PMID: 11606123]
Gohlke H, Hendlich M, Klebe G. Predicting binding modes, binding affinities and “hot spots” for protein-ligand complexes using a knowledge-based scoring function. Perspect Drug Discov Des 2000; 20: 115-44.
Jones G, Willett P, Glen RC, Leach AR, Taylor R. Development and validation of a genetic algorithm for flexible docking. J Mol Biol 1997; 267(3): 727-48.
[] [PMID: 9126849]
Weisel M, Proschak E, Schneider G. PocketPicker: Analysis of ligand binding-sites with shape descriptors. Chem Cent J 2007; 1(7): 7.
[] [PMID: 17880740]

Rights & Permissions Print Export Cite as
© 2023 Bentham Science Publishers | Privacy Policy