Generic placeholder image

Current Topics in Medicinal Chemistry


ISSN (Print): 1568-0266
ISSN (Online): 1873-4294

Review Article

Recent Progress in Machine Learning-based Prediction of Peptide Activity for Drug Discovery

Author(s): Qihui Wu, Hanzhong Ke, Dongli Li, Qi Wang, Jiansong Fang* and Jingwei Zhou*

Volume 19, Issue 1, 2019

Page: [4 - 16] Pages: 13

DOI: 10.2174/1568026619666190122151634

Price: $65


Over the past decades, peptide as a therapeutic candidate has received increasing attention in drug discovery, especially for antimicrobial peptides (AMPs), anticancer peptides (ACPs) and antiinflammatory peptides (AIPs). It is considered that the peptides can regulate various complex diseases which are previously untouchable. In recent years, the critical problem of antimicrobial resistance drives the pharmaceutical industry to look for new therapeutic agents. Compared to organic small drugs, peptide- based therapy exhibits high specificity and minimal toxicity. Thus, peptides are widely recruited in the design and discovery of new potent drugs. Currently, large-scale screening of peptide activity with traditional approaches is costly, time-consuming and labor-intensive. Hence, in silico methods, mainly machine learning approaches, for their accuracy and effectiveness, have been introduced to predict the peptide activity. In this review, we document the recent progress in machine learning-based prediction of peptides which will be of great benefit to the discovery of potential active AMPs, ACPs and AIPs.

Keywords: Antimicrobial peptides (AMPs), Anticancer peptides (ACPs), Anti-inflammatory peptides (AIPs), Machine learning, Activity prediction, R&D.

Graphical Abstract
Mócsai, A.; Kovács, L.; Gergely, P. What is the future of targeted therapy in rheumatology: Biologics or small molecules? BMC Med., 2014, 12(1), 43-51.
[] [PMID: 24620738]
Fosgerau, K.; Hoffmann, T. Peptide therapeutics: Current status and future directions. Drug Discov. Today, 2015, 20(1), 122-128.
[] [PMID: 25450771]
Castel, G.; Chtéoui, M.; Heyd, B.; Tordo, N. Phage display of combinatorial peptide libraries: application to antiviral research. Molecules, 2011, 16(5), 3499-3518.
[] [PMID: 21522083]
de la Torre, B.G.; Albericio, F. The pharmaceutical industry in 2017. An analysis of FDA drug approvals from the perspective of molecules. Molecules, 2018, 23(3), 533-540.
[] [PMID: 29495494]
Du, Q.S.; Xie, N.Z.; Huang, R.B. Recent development of peptide drugs and advance on theory and methodology of peptide inhibitor design. Med. Chem., 2015, 11(3), 235-247.
[] [PMID: 25548931]
Fang, J.; Yang, R.; Gao, L.; Yang, S.; Pang, X.; Li, C.; He, Y.; Liu, A.L.; Du, G.H. Consensus models for CDK5 inhibitors in silico and their application to inhibitor discovery. Mol. Divers., 2015, 19(1), 149-162.
[] [PMID: 25511641]
Fang, J.; Li, Y.; Liu, R.; Pang, X.; Li, C.; Yang, R.; He, Y.; Lian, W.; Liu, A.L.; Du, G.H. Discovery of multitarget-directed ligands against Alzheimer’s disease through systematic prediction of chemical-protein interactions. J. Chem. Inf. Model., 2015, 55(1), 149-164.
[] [PMID: 25531792]
Fang, J.; Yang, R.; Gao, L.; Zhou, D.; Yang, S.; Liu, A.L.; Du, G.H. Predictions of BuChE inhibitors using support vector machine and naive Bayesian classification techniques in drug discovery. J. Chem. Inf. Model., 2013, 53(11), 3009-3020.
[] [PMID: 24144102]
Shah, Y.; Sehgal, D.; Valadi, J.K. Recent trends in antimicrobial peptide prediction using machine learning techniques. Bioinformation, 2017, 13(12), 415-416.
[] [PMID: 29379261]
Porto, W.F.; Pires, A.S.; Franco, O.L. Computational tools for exploring sequence databases as a resource for antimicrobial peptides. Biotechnol. Adv., 2017, 35(3), 337-349.
[] [PMID: 28216008]
Liu, S.; Fan, L.; Sun, J.; Lao, X.; Zheng, H. Computational resources and tools for antimicrobial peptides. J. Pept. Sci., 2017, 23(1), 4-12.
[] [PMID: 27966-278]
Torrent, M.; Nogués, M.V.; Boix, E. Discovering new in silico tools for antimicrobial peptide prediction. Curr. Drug Targets, 2012, 13(9), 1148-1157.
[] [PMID: 22664076]
Wang, Z.; Wang, G. APD: The antimicrobial peptide database. Nucleic Acids Res., 2004, 32(Database issue), D590-D592.
[] [PMID: 14681488]
Wang, G.; Li, X.; Wang, Z. APD2: The updated antimicrobial peptide database and its application in peptide design. Nucleic Acids Res., 2009, 37(Database issue), D933-D937.
[] [PMID: 18957441]
Whitmore, L.; Wallace, B.A. The peptaibol database: A database for sequences and structures of naturally occurring peptaibols. Nucleic Acids Res., 2004, 32, D593-D594.
[] [PMID: 14681489]
Fjell, C.D.; Hancock, R.E.; Cherkasov, A. AMPer: A database and an automated discovery tool for antimicrobial peptides. Bioinformatics, 2007, 23(9), 1148-1155.
[] [PMID: 17341497]
Seebah, S.; Suresh, A.; Zhuo, S.; Choong, Y.H.; Chua, H.; Chuon, D.; Beuerman, R.; Verma, C. Defensins knowledgebase: a manually curated database and information source focused on the defensins family of antimicrobial peptides. Nucleic Acids Res., 2007, 35(Database issue), D265-D268.
[] [PMID: 17090586]
Wang, C.K.L.; Kaas, Q.; Chiche, L.; Craik, D.J. CyBase: a database of cyclic protein sequences and structures, with applications in protein discovery and engineering. Nucleic Acids Res., 2008, 36(Database issue), D206-D210.
[PMID: 17986451]
Hammami, R.; Ben Hamida, J.; Vergoten, G.; Fliss, I. PhytAMP: A database dedicated to antimicrobial plant peptides. Nucleic Acids Res., 2009, 37, D963-D968.
[] [PMID: 18836196]
Thomas, S.; Karnik, S.; Barai, R.S.; Jayaraman, V.K.; Idicula-Thomas, S. CAMP: A useful resource for research on antimicrobial peptides. Nucleic Acids Res., 2010, 38(Database issue), D774-D780.
[] [PMID: 19923233]
Waghu, F.H.; Barai, R.S.; Gurung, P.; Idicula-Thomas, S. CAMPR3: A database on sequences, structures and signatures of antimicrobial peptides. Nucleic Acids Res., 2016, 44(D1), D1094-D1097.
[] [PMID: 26467-475]
Seshadri Sundararajan, V.; Gabere, M.N.; Pretorius, A.; Adam, S.; Christoffels, A.; Lehväslaiho, M.; Archer, J.A.; Bajic, V.B. DAMPD: A manually curated antimicrobial peptide database. Nucleic Acids Res., 2012, 40, D1108-D1112.
[] [PMID: 22110032]
Piotto, S.P.; Sessa, L.; Concilio, S.; Iannelli, P. YADAMP: Yet another database of antimicrobial peptides. Int. J. Antimicrob. Agents, 2012, 39(4), 346-351.
[] [PMID: 22325123]
Novković, M.; Simunić, J.; Bojović, V.; Tossi, A.; Juretić, D. DADP: The database of anuran defense peptides. Bioinformatics, 2012, 28(10), 1406-1407.
[] [PMID: 22467909]
Gautam, A.; Chaudhary, K.; Singh, S.; Joshi, A.; Anand, P.; Tuknait, A.; Mathur, D.; Varshney, G.C.; Raghava, G.P.S. Hemolytik: A database of experimentally determined hemolytic and non-hemolytic peptides. Nucleic Acids Res., 2014, 42(Database issue), D444-D449.
[] [PMID: 24174543]
Qureshi, A.; Thakur, N.; Tandon, H.; Kumar, M. AVPdb: a database of experimentally validated antiviral peptides targeting medically important viruses. Nucleic Acids Res., 2014, 42(Database issue), D1147-D1153.
[] [PMID: 24285301]
Pirtskhalava, M.; Gabrielian, A.; Cruz, P.; Griggs, H.L.; Squires, R.B.; Hurt, D.E.; Grigolava, M.; Chubinidze, M.; Gogoladze, G.; Vishnepolsky, B.; Alekseyev, V.; Rosenthal, A.; Tartakovsky, M. DBAASP v.2: An enhanced database of structure and antimicrobial/cytotoxic activity of natural and synthetic peptides. Nucleic Acids Res., 2016, 44(D1), D1104-D1112.
[] [PMID: 26578581]
Singh, S.; Chaudhary, K.; Dhanda, S.K.; Bhalla, S.; Usmani, S.S.; Gautam, A.; Tuknait, A.; Agrawal, P.; Mathur, D.; Raghava, G.P.S. SATPdb: A database of structurally annotated therapeutic peptides. Nucleic Acids Res., 2016, 44(D1), D1119-D1126.
[ org/10.1093/nar/gkv1114] [PMID: 26527728]
Fan, L.; Sun, J.; Zhou, M.; Zhou, J.; Lao, X.; Zheng, H.; Xu, H. DRAMP: A comprehensive data repository of antimicrobial peptides. Sci. Rep., 2016, 6, 24482-24488.
[] [PMID: 27075512]
Tyagi, A.; Tuknait, A.; Anand, P.; Gupta, S.; Sharma, M.; Mathur, D.; Joshi, A.; Singh, S.; Gautam, A.; Raghava, G.P.S. CancerPPD: a database of anticancer peptides and proteins. Nucleic Acids Res., 2015, 43, D837-D843.
[] [PMID: 25270878]
Deng, L.; Hinton, G.; Kingsbury, B. In New types of deep neural network learning for speech recognition and related applications: An overview. IEEE International Conference on Acoustics, Speech and Signal Processing, 2013, pp. 8599-8603.
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature, 2015, 521(7553), 436-444.
[] [PMID: 26017442]
Yap, C.W. PaDEL-descriptor: an open source software to calculate molecular descriptors and fingerprints. J. Comput. Chem., 2011, 32(7), 1466-1474.
[] [PMID: 21425294]
Tetko, I.V.; Gasteiger, J.; Todeschini, R.; Mauri, A.; Livingstone, D.; Ertl, P.; Palyulin, V.A.; Radchenko, E.V.; Zefirov, N.S.; Makarenko, A.S.; Tanchuk, V.Y.; Prokopenko, V.V. Virtual computational chemistry laboratory--design and description. J. Comput. Aided Mol. Des., 2005, 19(6), 453-463.
[] [PMID: 16231203]
Fang, J.; Pang, X.C.; Yan, R.; Lian, W.; Li, C.; Wang, Q.; Liu, A.L.; Du, G. Discovery of neuroprotective compounds by machine learning approaches. RSC Advances, 2016, 6(12), 9857-9871.
Frank, E.; Hall, M.; Trigg, L.; Holmes, G.; Witten, I.H. Data mining in bioinformatics using Weka. Bioinformatics, 2004, 20(15), 2479-2481.
[] [PMID: 15073010]
Demšar, J.; Curk, T.; Erjavec, A.; Goru, Č.; Hočevar, T.; Milutinovič, M.; Možina, M.; Polajnar, M.; Toplak, M.; Starič, A. Orange: Data Mining Toolbox in Python. J. Mach. Learn. Res., 2013, 14(1), 2349-2353.
Rao, H. B.; Zhu, F.; Yang, G. B.; Li, Z. R.; Chen, Y. Z. Update of PROFEAT: A web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence. Nucleic Acids Res., 2011, 39, (Web Server issue), W385- W390.
Sharma, B.K. Analysis and prediction of antibacterial peptides. BMC Bioinformatics, 2007, 8(1), 1-10.
[] [PMID: 17199892]
Lata, S.; Mishra, N.K.; Raghava, G.P. AntiBP2: improved version of antibacterial peptide prediction. BMC Bioinformatics, 2010, 11(Suppl. 1), S19.
[] [PMID: 20122190]
Waghu, F.H.; Gopi, L.; Barai, R.S.; Ramteke, P.; Nizami, B.; Idicula-Thomas, S. CAMP: Collection of sequences and structures of antimicrobial peptides. Nucleic Acids Res., 2014, 42(Database issue), D1154-D1158.
[] [PMID: 24265220]
Dziuba, B.; Dziuba, M. New milk protein-derived peptides with potential antimicrobial activity: An approach based on bioinformatic studies. Int. J. Mol. Sci., 2014, 15(8), 14531-14545.
[] [PMID: 25141106]
Wang, P.; Hu, L.; Liu, G.; Jiang, N.; Chen, X.; Xu, J.; Zheng, W.; Li, L.; Tan, M.; Chen, Z.; Song, H.; Cai, Y.D.; Chou, K.C. Prediction of antimicrobial peptides based on sequence alignment and feature selection methods. PLoS One, 2011, 6(4), e18476.
[] [PMID: 21533231]
Friedman, J.H.; Baskett, F.; Shustek, L.J. An algorithm for finding nearest neighbors. IEEE Trans. Comput., 1975, C-24(10), 1000-1006.
Thakur, N.; Qureshi, A.; Kumar, M. AVPPred: Collection and prediction of highly effective antiviral peptides. Nucleic Acids Res, 2012, 40, (Web Server issue), W199-W204.
Porto, W.F.; Pires, Á.S.; Franco, O.L. CS-AMPPred: An updated SVM model for antimicrobial activity prediction in cysteine-stabilized peptides. PLoS One, 2012, 7(12), e51444.
[] [PMID: 23240023]
Joseph, S.; Karnik, S.; Nilawe, P.; Jayaraman, V.K.; Idicula-Thomas, S. ClassAMP: a prediction tool for classification of antimicrobial peptides. IEEE/ACM Trans. Comput. Biol. Bioinformatics, 2012, 9(5), 1535-1538.
[] [PMID: 22732690]
Niarchou, A.; Alexandridou, A.; Athanasiadis, E.; Spyrou, G. C-PAmP: large scale analysis and database construction containing high scoring computationally predicted antimicrobial peptides for all the available plant species. PLoS One, 2013, 8(11), e79728.
[] [PMID: 24244550]
Xiao, X.; Wang, P.; Lin, W.Z.; Jia, J.H.; Chou, K.C. iAMP-2L: A two-level multi-label classifier for identifying antimicrobial peptides and their functional types. Anal. Biochem., 2013, 436(2), 168-177.
[] [PMID: 23395824]
Tanford, C. Contribution of hydrophobic interactions to the stability of the globular conformation of proteins. J. Am. Chem. Soc., 1962, 84(22), 4240-4247.
Gaydon, A.G. Handbook of chemistry and physics. Soil Science Society of America Journal., (47th Ed.), 1967, 18(4), pp. 115.
Higgins, M.J.P. Data for biochemical research. Biochemical Society Transactions., (3rd. ) 1987, 15(4), pp. 777.2-777.
Chou, K.C. Prediction of protein cellular attributes using pseudo-amino acid composition. Proteins, 2001, 43(3), 246-255.
[] [PMID: 11288174]
Lee, H.T.; Lee, C.C.; Yang, J.R.; Lai, J.Z.; Chang, K.Y. A large-scale structural classification of antimicrobial peptides. BioMed Res. Int., 2015, 2015, 475062-475067.
[PMID: 26000295]
Meher, P.K.; Sahu, T.K.; Saini, V.; Rao, A.R. Predicting antimicrobial peptides with improved accuracy by incorporating the compositional, physico-chemical and structural features into Chou’s general PseAAC. Sci. Rep., 2017, 7, 42362-42373.
[] [PMID: 28205576]
Veltri, D.; Kamath, U.; Shehu, A. Improving recognition of antimicrobial peptides and target selectivity through machine learning and genetic programming. IEEE/ACM Trans. Comput. Biol. Bioinformatics, 2017, 14(2), 300-313.
[] [PMID: 28368808]
Vishnepolsky, B.; Gabrielian, A.; Rosenthal, A.; Hurt, D.E.; Tartakovsky, M.; Managadze, G.; Grigolava, M.; Makhatadze, G.I.; Pirtskhalava, M. Predictive model of linear AMPs active against gram-negative bacteria. J. Chem. Inf. Model., 2018, 58(5), 1141-1151.
[] [PMID: 29716188]
Mader, J.S.; Hoskin, D.W. Cationic antimicrobial peptides as novel cytotoxic agents for cancer treatment. Expert Opin. Investig. Drugs, 2006, 15(8), 933-946.
[ 15.8.933] [PMID: 16859395]
Gupta, S.; Sharma, A.K.; Shastri, V.; Madhu, M.K.; Sharma, V.K. Prediction of anti-inflammatory proteins/peptides: An in silico approach. J. Transl. Med., 2017, 15(1), 7-17.
[] [PMID: 28057002]
Nagpal, G.; Usmani, S.S.; Dhanda, S.K.; Kaur, H.; Singh, S.; Sharma, M.; Raghava, G.P. Computer-aided designing of immunosuppressive peptides based on IL-10 inducing potential. Sci. Rep., 2017, 7, 42851-42860.
[] [PMID: 28211521]
Hawrylowicz, C.M.; O’Garra, A. Potential role of interleukin-10-secreting regulatory T cells in allergy and asthma. Nat. Rev. Immunol., 2005, 5(4), 271-283.
[] [PMID: 15775993]
Bromberg, J.S. IL-10 immunosuppression in transplantation. Curr. Opin. Immunol., 1995, 7(5), 639-643.
[] [PMID: 8573306]
Shinozaki, K.; Yahata, H.; Tanji, H.; Sakaguchi, T.; Ito, H.; Dohi, K. Allograft transduction of IL-10 prolongs survival following orthotopic liver transplantation. Gene Ther., 1999, 6(5), 816-822.
[] [PMID: 10505106]
Manavalan, B.; Shin, T.H.; Kim, M.O.; Lee, G. AIPpred: Sequence-based prediction of anti-inflammatory peptides using random forest. Front. Pharmacol., 2018, 9, 276.
[] [PMID: 29636690]
Tyagi, A.; Kapoor, P.; Kumar, R.; Chaudhary, K.; Gautam, A.; Raghava, G.P. In silico models for designing and discovering novel anticancer peptides. Sci. Rep., 2013, 3(10), 2984-2991.
[] [PMID: 24136089]
Vijayakumar, S.; Ptv, L. ACPP: A web server for prediction and design of anti-cancer peptides. Int. J. Pept. Res. Ther., 2015, 21(1), 99-106.
Chen, W.; Ding, H.; Feng, P.; Lin, H.; Chou, K.C. iACP: A sequence-based tool for identifying anticancer peptides. Oncotarget, 2016, 7(13), 16895-16909.
[] [PMID: 26942877]
Manavalan, B.; Basith, S.; Shin, T.H.; Choi, S.; Kim, M.O.; Lee, G. MLACP: Machine-learning-based prediction of anticancer peptides. Oncotarget, 2017, 8(44), 77121-77136.
[] [PMID: 29100375]
Hajisharifi, Z.; Piryaiee, M.; Mohammad Beigi, M.; Behbahani, M.; Mohabatkar, H. Predicting anticancer peptides with Chou’s pseudo amino acid composition and investigating their mutagenicity via Ames test. J. Theor. Biol., 2014, 341, 34-40.
[] [PMID: 24035842]
Akbar, S.; Hayat, M.; Iqbal, M.; Jan, M.A. iACP-GAEnsC: Evolutionary genetic algorithm based ensemble classification of anticancer peptides by utilizing hybrid feature space. Artif. Intell. Med., 2017, 79, 62-70.
[] [PMID: 28655440]
Kabir, M.; Hayat, M. iRSpot-GAEnsC: Identifing recombination spots via ensemble classifier and extending the concept of Chou’s PseAAC to formulate DNA samples. Mol. Genet. Genomics, 2016, 291(1), 285-296.
[] [PMID: 26319782]
Iqbal, M.; Hayat, M. “iSS-Hyb-mRMR”: Identification of splicing sites using hybrid space of pseudo trinucleotide and pseudo tetranucleotide composition. Comput. Methods Programs Biomed., 2016, 128, 1-11.
[] [PMID: 27040827]
Wang, P.; Ge, R.; Liu, L.; Xiao, X.; Li, Y.; Cai, Y. Multi-label learning for predicting the activities of antimicrobial peptides. Sci. Rep., 2017, 7(1), 2202-2212.
[] [PMID: 28526820]
Bhadra, P.; Yan, J.; Li, J.; Fong, S.; Siu, S.W.I. AmPEP: Sequence-based prediction of antimicrobial peptides using distribution patterns of amino acid properties and random forest. Sci. Rep., 2018, 8(1), 1697-1706.
[] [PMID: 29374199]
Veltri, D.; Kamath, U.; Shehu, A. Deep learning improves antimicrobial peptide recognition. Bioinformatics, 2018, 34(16), 2740-2747.
[] [PMID: 29590297]
Xu, L.; Liang, G.; Wang, L.; Liao, C. A novel hybrid sequence-based model for identifying anticancer peptides. Genes (Basel), 2018, 9(3), e158.
[] [PMID: 29534013]
Fernandes, F.C.; Rigden, D.J.; Franco, O.L. Prediction of antimicrobial peptides based on the adaptive neuro-fuzzy inference system application. Biopolymers, 2012, 98(4), 280-287.
[] [PMID: 23193592]
Jang, R. Adaptive network-based fuzzy inference system. IEEE Trans. Syst. Man Cybern., 1993, 23(3), 665-683.
Ng, X.Y.; Rosdi, B.A.; Shahrudin, S. Prediction of antimicrobial peptides based on sequence alignment and support vector machine-pairwise algorithm utilizing LZ-complexity. BioMed Res. Int., 2015, 2015(3), 212715-212727.
[PMID: 25802839]
Abboud, G.; Kaplowitz, N. Drug-induced liver injury. Drug Saf., 2007, 30(4), 277-294.
[] [PMID: 17408305]
Li, W.; Godzik, A. Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics, 2006, 22(13), 1658-1659.
[] [PMID: 16731699]
Fang, J.; Liu, C.; Wang, Q.; Lin, P.; Cheng, F. In silico polypharmacology of natural products. Brief. Bioinform., 2017.
[] [PMID: 28460068]
Yamanishi, Y.; Araki, M.; Gutteridge, A.; Honda, W.; Kanehisa, M. Prediction of drug-target interaction networks from the integration of chemical and genomic spaces. Bioinformatics, 2008, 24(13), i232-i240.
[] [PMID: 18586719]
Cheng, F.; Zhou, Y.; Li, J.; Li, W.; Liu, G.; Tang, Y. Prediction of chemical-protein interactions: Multitarget-QSAR versus computational chemogenomic methods. Mol. Biosyst., 2012, 8(9), 2373-2384.
[] [PMID: 22751809]
Wen, M.; Zhang, Z.; Niu, S.; Sha, H.; Yang, R.; Yun, Y.; Lu, H. Deep-learning-based drug-target interaction prediction. J. Proteome Res., 2017, 16(4), 1401-1409.
[] [PMID: 28264154]
Wang, L.; You, Z.H.; Chen, X.; Xia, S.X.; Liu, F.; Yan, X.; Zhou, Y.; Song, K.J. A Computational-based method for predicting drug-target interactions by using stacked autoencoder deep neural network. J. Comput. Biol., 2017, 25(3), 361-373.
Cai, C.; Fang, J.; Guo, P.; Wang, Q.; Hong, H.; Moslehi, J.; Cheng, F. In Silico pharmacoepidemiologic evaluation of drug-induced cardiovascular complications using combined classifiers. J. Chem. Inf. Model., 2018, 58(5), 943-956.
[] [PMID: 29712429]
Xu, Y.; Dai, Z.; Chen, F.; Gao, S.; Pei, J.; Lai, L. Deep learning for drug-induced liver injury. J. Chem. Inf. Model., 2015, 55(10), 2085-2093.
[] [PMID: 26437739]

Rights & Permissions Print Export Cite as
© 2023 Bentham Science Publishers | Privacy Policy