Evolution of Sequence-based Bioinformatics Tools for Protein-protein Interaction Prediction

Author(s): Mst. Shamima Khatun, Watshara Shoombuatong, Md. Mehedi Hasan*, Hiroyuki Kurata*

Journal Name: Current Genomics

Volume 21 , Issue 6 , 2020

Become EABM
Become Reviewer
Call for Editor

Graphical Abstract:


Protein-protein interactions (PPIs) are the physical connections between two or more proteins via electrostatic forces or hydrophobic effects. Identification of the PPIs is pivotal, which contributes to many biological processes including protein function, disease incidence, and therapy design. The experimental identification of PPIs via high-throughput technology is time-consuming and expensive. Bioinformatics approaches are expected to solve such restrictions. In this review, our main goal is to provide an inclusive view of the existing sequence-based computational prediction of PPIs. Initially, we briefly introduce the currently available PPI databases and then review the state-of-the-art bioinformatics approaches, working principles, and their performances. Finally, we discuss the caveats and future perspective of the next generation algorithms for the prediction of PPIs.

Keywords: Protein-protein interactions, PPIs database, sequence features, feature selection, machine learning, bioinformatics.

De Las Rivas, J.; Fontanillo, C. Protein-protein interactions essentials: key concepts to building and analyzing interactome networks. PLOS Comput. Biol., 2010, 6(6): e1000807.
[http://dx.doi.org/10.1371/journal.pcbi.1000807] [PMID: 20589078]
Liu, X.; Yang, Z.; Sang, S.; Lin, H.; Wang, J.; Xu, B. Detection of protein complexes from multiple protein interaction networks using graph embedding. Artif. Intell. Med., 2019, 96, 107-115.
[http://dx.doi.org/10.1016/j.artmed.2019.04.001] [PMID: 31164203]
Dos Santos Vasconcelos, C.R.; de Lima Campos, T.; Rezende, A.M. Building protein-protein interaction networks for Leishmania species through protein structural information. BMC Bioinformatics, 2018, 19(1), 85.
[http://dx.doi.org/10.1186/s12859-018-2105-6] [PMID: 29510668]
Caterino, M.; Ruoppolo, M.; Mandola, A.; Costanzo, M.; Orrù, S.; Imperlini, E. Protein-protein interaction networks as a new perspective to evaluate distinct functional roles of voltage-dependent anion channel isoforms. Mol. Biosyst., 2017, 13(12), 2466-2476.
[http://dx.doi.org/10.1039/C7MB00434F] [PMID: 29028058]
Xiao, H.; Yang, L.; Liu, J.; Jiao, Y.; Lu, L.; Zhao, H. Protein-protein interaction analysis to identify biomarker networks for endometriosis. Exp. Ther. Med., 2017, 14(5), 4647-4654.
[http://dx.doi.org/10.3892/etm.2017.5185] [PMID: 29201163]
Planas-Iglesias, J.; Marin-Lopez, M.A.; Bonet, J.; Garcia-Garcia, J.; Oliva, B. iLoops: a protein-protein interaction prediction server based on structural features. Bioinformatics, 2013, 29(18), 2360-2362.
[http://dx.doi.org/10.1093/bioinformatics/btt401] [PMID: 23842807]
Ammari, MG; Gresham, CR; McCarthy, FM; Nanduri, B HPIDB 2.0: a curated database for host-pathogen interactions. Database: J. Biol. Databases Curation, 2016.baw103.
Ohue, M.; Matsuzaki, Y.; Uchikoga, N.; Ishida, T.; Akiyama, Y. MEGADOCK: an all-to-all protein-protein interaction prediction system using tertiary structure data. Protein Pept. Lett., 2014, 21(8), 766-778.
[http://dx.doi.org/10.2174/09298665113209990050] [PMID: 23855673]
Goel, R.; Harsha, H.C.; Pandey, A.; Prasad, T.S. Human Protein Reference Database and Human Proteinpedia as resources for phosphoproteome analysis. Mol. Biosyst., 2012, 8(2), 453-463.
[http://dx.doi.org/10.1039/C1MB05340J] [PMID: 22159132]
Lian, X.; Yang, S.; Li, H.; Fu, C.; Zhang, Z. Machine-learning-based predictor of human-bacteria protein-protein interactions by incorporating comprehensive host-network properties. J. Proteome Res., 2019, 18(5), 2195-2205.
[http://dx.doi.org/10.1021/acs.jproteome.9b00074] [PMID: 30983371]
Liu, C.; Liu, L.; Zhou, C.; Zhuang, J.; Wang, L.; Sun, Y.; Sun, C. Protein-protein interaction networks and different clustering analysis in Burkitt’s lymphoma. Hematology, 2018, 23(7), 391-398.
[http://dx.doi.org/10.1080/10245332.2017.1409947] [PMID: 29189103]
Hanna, E.M.; Zaki, N.; Amin, A. Detecting protein complexes in protein interaction networks modeled as gene expression biclusters. PLoS One, 2015, 10(12): e0144163.
[http://dx.doi.org/10.1371/journal.pone.0144163] [PMID: 26641660]
Giurgiu, M.; Reinhard, J.; Brauner, B.; Dunger-Kaltenbach, I.; Fobo, G.; Frishman, G.; Montrone, C.; Ruepp, A. CORUM: the comprehensive resource of mammalian protein complexes-2019. Nucleic Acids Res., 2019, 47(D1), D559-D563.
[http://dx.doi.org/10.1093/nar/gky973] [PMID: 30357367]
Ruepp, A.; Waegele, B.; Lechner, M.; Brauner, B.; Dunger-Kaltenbach, I.; Fobo, G.; Frishman, G.; Montrone, C.; Mewes, H.W. CORUM: the comprehensive resource of mammalian protein complexes--2009. Nucleic Acids Res., 2010, 38(Database issue), D497-D501.
[http://dx.doi.org/10.1093/nar/gkp914] [PMID: 19884131]
Kaake, R.M.; Wang, X.; Huang, L. Profiling of protein interaction networks of protein complexes using affinity purification and quantitative mass spectrometry. Mol. Cell. Proteomics, 2010, 9(8), 1650-1665.
[http://dx.doi.org/10.1074/mcp.R110.000265] [PMID: 20445003]
Ochoa, D.; García-Gutiérrez, P.; Juan, D.; Valencia, A.; Pazos, F. Incorporating information on predicted solvent accessibility to the co-evolution-based study of protein interactions. Mol. Biosyst., 2013, 9(1), 70-76.
[http://dx.doi.org/10.1039/C2MB25325A] [PMID: 23104128]
Marsh, J.A.; Teichmann, S.A. Structure, dynamics, assembly, and evolution of protein complexes. Annu. Rev. Biochem., 2015, 84, 551-575.
[http://dx.doi.org/10.1146/annurev-biochem-060614-034142] [PMID: 25494300]
Yeh, F.L.; Tung, L.; Chang, T.H. Detection of protein-protein interaction within an RNA-protein complex via unnatural-amino-acid-mediated photochemical crosslinking. Methods Mol. Biol., 2016, 1421, 175-189.
[http://dx.doi.org/10.1007/978-1-4939-3591-8_15] [PMID: 26965266]
Pham, C.D. Detection of protein-protein interaction using bimolecular fluorescence complementation assay. Methods Mol. Biol., 2015, 1278, 483-495.
[http://dx.doi.org/10.1007/978-1-4939-2425-7_32] [PMID: 25859971]
Lavallee-Adam, M.; Coulombe, B.; Blanchette, M. Detection of locally over-represented GO terms in protein-protein interaction networks. J. Computational. Biol., 2010, 17(3), 443-457.
Yang, S.; Fu, C.; Lian, X.; Dong, X.; Zhang, Z. Understanding human-virus protein-protein interactions using a human protein complex-based analysis framework. mSystems, 2019, 4(2), e00303-18.
[http://dx.doi.org/10.1128/mSystems.00303-18] [PMID: 30984872]
Saha, S.; Prasad, A.; Chatterjee, P.; Basu, S.; Nasipuri, M. Protein function prediction from protein-protein interaction network using gene ontology based neighborhood analysis and physico-chemical features. J. Bioinform. Comput. Biol., 2018, 16(6): 1850025.
[http://dx.doi.org/10.1142/S0219720018500257] [PMID: 30400756]
Zhai, J.X.; Cao, T.J.; An, J.Y.; Bian, Y.T. Highly accurate predic-tion of protein self-interactions by incorporating the average block and PSSM information into the general PseAAC. J. Theor. Biol.,. 2017, 432, 80-86.
[http://dx.doi.org/10.1016/j.jtbi.2017.08.009] [PMID: 28802824]
Teng, WJ; Zhou, C; Liu, LJ; Cao, XJ; Zhuang, J; Liu, GX; Sun, C.G. Construction of a protein-protein interaction network of Wilms' tumor and pathway prediction of molecular complexes. Ge-net. Molecul. Res., 2016, 15(2), 1-9.
Aumentado-Armstrong, T.T.; Istrate, B.; Murgita, R.A. Algorith-mic approaches to protein-protein interaction site prediction. Algo-rithms Mol. Biol., 2015, 10, 7.
[http://dx.doi.org/10.1186/s13015-015-0033-9] [PMID: 25713596]
Taghipour, S.; Zarrineh, P.; Ganjtabesh, M.; Nowzari-Dalini, A. Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources. BMC Bioinforma-tics 2017, 18(1), 10.
[http://dx.doi.org/10.1186/s12859-016-1422-x] [PMID: 28049415]
Keane, H.; Ryan, B.J.; Jackson, B.; Whitmore, A.; Wade-Martins, R. Protein-protein interaction networks identify targets which res-cue the MPP+ cellular model of Parkinson’s disease. Sci. Rep., 2015, 5, 17004.
[http://dx.doi.org/10.1038/srep17004] [PMID: 26608097]
Ji, C.; Cao, X.; Yao, C.; Xue, S.; Xiu, Z. Protein-protein interaction network of the marine microalga Tetraselmis subcordiformis: pre-diction and application for starch metabolism analysis. J. Ind. Microbiol. Biotechnol., 2014, 41(8), 1287-1296.
[http://dx.doi.org/10.1007/s10295-014-1462-z] [PMID: 24879479]
Wang, L.; Tam, J.P.; Liu, D.X. Biochemical and functional charac-terization of Epstein-Barr virus-encoded BARF1 protein: interac-tion with human hTid1 protein facilitates its maturation and secre-tion. Oncogene, 2006, 25(31), 4320-4331.
[http://dx.doi.org/10.1038/sj.onc.1209458] [PMID: 16518412]
Amoutzias, G.D.; Robertson, D.L.; Bornberg-Bauer, E. The evolu-tion of protein interaction networks in regulatory proteins. Comp. Funct. Genomics, 2004, 5(1), 79-84.
[http://dx.doi.org/10.1002/cfg.365] [PMID: 18629034]
Ivanic, J.; Yu, X.; Wallqvist, A.; Reifman, J. Influence of protein abundance on high-throughput protein-protein interaction detec-tion. PLoS One 2009, 4(6): e5815.
[http://dx.doi.org/10.1371/journal.pone.0005815] [PMID: 19503833]
Hurst, R.; Hook, B.; Slater, M.R.; Hartnett, J.; Storts, D.R.; Nath, N. Protein-protein interaction studies on protein arrays: effect of detection strategies on signal-to-background ratios. Anal. Biochem., 2009, 392(1), 45-53.
[http://dx.doi.org/10.1016/j.ab.2009.05.028] [PMID: 19464993]
Park, H.; Kang, H.; Ko, W.; Lee, W.; Jo, K.; Lee, H.S. FRET-based analysis of protein-nucleic acid interactions by genetically incorpo-rating a fluorescent amino acid. Amino Acids, 2015, 47(4), 729-734.
[http://dx.doi.org/10.1007/s00726-014-1900-2] [PMID: 25540052]
Xu, B.; Guan, J.; Wang, Y.; Wang, Z. Essential protein detection by random walk on weighted protein-protein interaction networks, IEEE/ACM Trans. Comput. Biol. Bioinformatics, 2019, 16(2), 377-387.
[http://dx.doi.org/10.1109/TCBB.2017.2701824] [PMID: 28504946]
Zaki, N.; Alashwal, H. Improving the Detection of Protein Com-plexes by Predicting Novel Missing Interactome Links in the Pro-tein-Protein Interaction Network Conference proceedings : Annual International Conference of the IEEE Engineering in Medicine and Biology Society IEEE Engineering in Medicine and Biology Society Annual Conference 2018 , 2018; pp. 5041-5044.
Liu, W.; Ma, L.; Jeon, B.; Chen, L.; Chen, B. A Network Hie-rarchy-Based method for functional module detection in protein-protein interaction networks, J. Theor. Biol., 2018, 455, 26-38.
[http://dx.doi.org/10.1016/j.jtbi.2018.06.026] [PMID: 29981337]
Liu, TY; Chou, WC; Chen, WY; Chu, CY; Dai, CY; Wu, PY De-tection of membrane protein-protein interaction in planta based on dual-intein-coupled tripartite split-GFP association, Plant J., 2018, 94(3), 426-438.
Song, B.; Wang, F.; Guo, Y.; Sang, Q.; Liu, M.; Li, D.; Fang, W.; Zhang, D. Protein-protein interaction network-based detection of functionally similar proteins within, species Proteins, 2012, 80(7), 1736-1743.
[http://dx.doi.org/10.1002/prot.24066] [PMID: 22411607]
Subramanian, C.; Xu, Y.; Johnson, C.H.; von Arnim, A.G. In vivodetection of protein-protein interaction in plant cells using BRET, Methods Mol. Biol., 2004, 284, 271-286.
[http://dx.doi.org/10.1385/1-59259-816-1:271] [PMID: 15173623]
Pang, E.; Lin, K. Yeast protein-protein interaction binding sites: prediction from the motif-motif, motif-domain and domain-domain levels, Mol. Biosyst., 2010, 6(11), 2164-2173.
[http://dx.doi.org/10.1039/c0mb00038h] [PMID: 20714642]
Singhal, M.; Resat, H. A domain-based approach to predict protein-protein interactions, BMC Bioinformatics, 2007, 8, 199.
[http://dx.doi.org/10.1186/1471-2105-8-199] [PMID: 17567909]
Dyer, M.D.; Murali, T.M.; Sobral, B.W. Computational prediction of host-pathogen protein-protein interactions, Bioinformatics, 2007, 23(13), i159-i166.
[http://dx.doi.org/10.1093/bioinformatics/btm208] [PMID: 17646292]
Burgoyne, N.J.; Jackson, R.M. Predicting protein interaction sites: binding hot-spots in protein-protein and protein-ligand interfaces, Bioinformatics, 2006, 22(11), 1335-1342.
[http://dx.doi.org/10.1093/bioinformatics/btl079] [PMID: 16522669]
Tachiki, H.; Kato, R.; Kuramitsu, S. DNA binding and protein-protein interaction sites in MutS, a mismatched DNA recognition protein from Thermus thermophilus HB8, J. Biol. Chem., 2000, 275(52), 40703-40709.
[http://dx.doi.org/10.1074/jbc.M007124200] [PMID: 11024056]
Yu, H.; Luscombe, N.M.; Lu, H.X.; Zhu, X.; Xia, Y.; Han, J.D.; Bertin, N.; Chung, S.; Vidal, M.; Gerstein, M. Annotation transfer between genomes: protein-protein interologs and protein-DNA re-gulogs, Genome Res., 2004, 14(6), 1107-1118.
[http://dx.doi.org/10.1101/gr.1774904] [PMID: 15173116]
Khatun, M.S.; Hasan, M.M.; Mollah, M.N.H.; Kurata, H. SIPMA: A Systematic Identification of Protein-Protein Interactions in Zea mays Using Autocorrelation Features in a Machine-Learning Framework 2018. IEEE 18th International Conference on Bioin-formatics and Bioengineering (BIBE), Taichung, Taiwan, , 2018; pp. 122-125.
Romero-Molina, S.; Ruiz-Blanco, Y.B.; Harms, M.; Münch, J.; Sanchez-Garcia, E. PPI-Detect: A support vector machine model for sequence-based prediction of protein-protein interactions, J. Comput. Chem., 2019, 40(11), 1233-1242.
[http://dx.doi.org/10.1002/jcc.25780] [PMID: 30768790]
An, J.Y.; You, Z.H.; Zhou, Y.; Wang, D.F. Sequence-based predic-tion of protein-protein interactions using gray wolf optimizer-based relevance vector machine, Evol. Bioinform. Online, 2019, 151176934319844522
[http://dx.doi.org/10.1177/1176934319844522] [PMID: 31080346]
Sun, T.; Zhou, B.; Lai, L.; Pei, J. Sequence-based prediction of protein protein interaction using a deep-learning algorithm, BMC Bioinformatics, 2017, 18(1), 277.
[http://dx.doi.org/10.1186/s12859-017-1700-2] [PMID: 28545462]
Xia, J.F.; Han, K.; Huang, D.S. Sequence-based prediction of pro-tein-protein interactions by means of rotation forest and autocorre-lation descriptor, Protein Pept. Lett., 2010, 17(1), 137-145.
[http://dx.doi.org/10.2174/092986610789909403] [PMID: 20214637]
Huang, Y.A.; You, Z.H.; Chen, X.; Chan, K.; Luo, X. Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding, BMC Bioinformatics, 2016, 17(1), 184.
[http://dx.doi.org/10.1186/s12859-016-1035-4] [PMID: 27112932]
Eid, F.E.; ElHefnawi, M.; Heath, L.S. DeNovo: virus-host se-quence-based protein-protein interaction prediction, Bioinformatics, 2016, 32(8), 1144-1150.
[http://dx.doi.org/10.1093/bioinformatics/btv737] [PMID: 26677965]
Hamp, T.; Rost, B. Evolutionary profiles improve protein-protein interaction prediction from sequence, Bioinformatics, 2015, 31(12), 1945-1950.
[http://dx.doi.org/10.1093/bioinformatics/btv077] [PMID: 25657331]
Zahiri, J.; Yaghoubi, O.; Mohammad-Noori, M.; Ebrahimpour, R.; Masoudi-Nejad, A. PPIevo: protein-protein interaction prediction from PSSM based evolutionary information, Genomics, 2013, 102(4), 237-242.
[http://dx.doi.org/10.1016/j.ygeno.2013.05.006] [PMID: 23747746]
Zahiri, J.; Mohammad-Noori, M.; Ebrahimpour, R.; Saadat, S.; Bozorgmehr, J.H.; Goldberg, T.; Masoudi-Nejad, A. LocFuse: hu-man protein-protein interaction prediction via classifier fusion u-sing protein localization information, Genomics, 2014, 104(6 Pt B), 496-503.
[http://dx.doi.org/10.1016/j.ygeno.2014.10.006] [PMID: 25458812]
Neuvirth, H.; Raz, R.; Schreiber, G. ProMate: a structure based prediction program to identify the location of protein-protein bin-ding sites, J. Mol. Biol., 2004, 338(1), 181-199.
[http://dx.doi.org/10.1016/j.jmb.2004.02.040] [PMID: 15050833]
Yang, S.; Li, H.; He, H.; Zhou, Y.; Zhang, Z. Critical assessment and performance improvement of plant-pathogen protein-protein interaction prediction methods, Brief. Bioinform., 2019, 20(1), 274-287.
[http://dx.doi.org/10.1093/bib/bbx123] [PMID: 29028906]
Alonso-Lopez, D; Campos-Laborie, FJ; Gutierrez, MA; Lambourne, L; Calderwood, MA; Vidal, M.; De Las Rivas, J. APID database: redefining protein-protein interaction experimental evidences and binary interactomes, Database , 2019.
Poole, R.L. The TAIR database, Methods Mol. Biol., 2007, 406, 179-212.
[PMID: 18287693]
Alanis-Lobato, G.; Andrade-Navarro, M.A.; Schaefer, M.H. HIP-PIE v2.0: enhancing meaningfulness and reliability of protein-protein interaction networks, Nucleic Acids Res., 2017, 45(D1), D408-D414.
[http://dx.doi.org/10.1093/nar/gkw985] [PMID: 27794551]
Zhu, G.; Wu, A.; Xu, X.J.; Xiao, P.P.; Lu, L.; Liu, J.; Cao, Y.; Chen, L.; Wu, J.; Zhao, X.M. PPIM: a protein-protein interaction database for maize, Plant Physiol., 2016, 170(2), 618-626.
[http://dx.doi.org/10.1104/pp.15.01821] [PMID: 26620522]
Oughtred, R.; Stark, C.; Breitkreutz, B.J.; Rust, J.; Boucher, L.; Chang, C.; Kolas, N.; O’Donnell, L.; Leung, G.; McAdam, R.; Zhang, F.; Dolma, S.; Willems, A.; Coulombe-Huntington, J.; Chatr-Aryamontri, A.; Dolinski, K.; Tyers, M. The BioGRID inter-action database: 2019 update, Nucleic Acids Res., 2019, 47(D1), D529-D541.
[http://dx.doi.org/10.1093/nar/gky1079] [PMID: 30476227]
Chatr-Aryamontri, A.; Oughtred, R.; Boucher, L.; Rust, J.; Chang, C.; Kolas, N.K.; O’Donnell, L.; Oster, S.; Theesfeld, C.; Sellam, A.; Stark, C.; Breitkreutz, B.J.; Dolinski, K.; Tyers, M. The Bio-GRID interaction database: 2017 update, Nucleic Acids Res., 2017, 45(D1), D369-D379.
[http://dx.doi.org/10.1093/nar/gkw1102] [PMID: 27980099]
Xenarios, I.; Salwínski, L.; Duan, X.J.; Higney, P.; Kim, S.M.; Eisenberg, D. DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions, Nucleic Acids Res., 2002, 30(1), 303-305.
[http://dx.doi.org/10.1093/nar/30.1.303] [PMID: 11752321]
Hashemifar, S.; Neyshabur, B.; Khan, A.A.; Xu, J. Predicting pro-tein-protein interactions through sequence-based deep learning, Bioinformatics, 2018, 34(17), i802-i810.
[http://dx.doi.org/10.1093/bioinformatics/bty573] [PMID: 30423091]
Fu, L.; Niu, B.; Zhu, Z.; Wu, S.; Li, W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics, 2012, 28(23), 3150-3152.
[http://dx.doi.org/10.1093/bioinformatics/bts565] [PMID: 23060610]
Chen, K.H.; Wang, T.F.; Hu, Y.J. Protein-protein interaction pre-diction using a hybrid feature representation and a stacked genera-lization scheme, BMC Bioinformatics, 2019, 20(1), 308.
[http://dx.doi.org/10.1186/s12859-019-2907-1] [PMID: 31182027]
Ray, S.; Alberuni, S.; Maulik, U. Computational prediction of HCV-human protein-protein interaction via topological analysis of HCV infected PPI modules, IEEE Trans. Nanobioscience, 2018, 17(1), 55-61.
[http://dx.doi.org/10.1109/TNB.2018.2797696] [PMID: 29570075]
Sze-To, A.; Fung, S.; Lee, E.A.; Wong, A.K.C. Prediction of pro-tein-protein interaction via co-occurring aligned pattern clusters, Methods, 2016, 110, 26-34.
[http://dx.doi.org/10.1016/j.ymeth.2016.07.018] [PMID: 27476008]
Zhang, L.; Yu, G.; Guo, M.; Wang, J. Predicting protein-protein interactions using high-quality non-interacting pairs, BMC Bioinformatics, 2018, 19(Suppl. 19), 525.
[http://dx.doi.org/10.1186/s12859-018-2525-3] [PMID: 30598096]
Li, Y.; Ilie, L. SPRINT: ultrafast protein-protein interaction predic-tion of the entire human interactome, BMC Bioinformatics, 2017, 18(1), 485.
[http://dx.doi.org/10.1186/s12859-017-1871-x] [PMID: 29141584]
Guo, Y.; Li, M.; Pu, X.; Li, G.; Guang, X.; Xiong, W.; Li, J. PRED_PPI: a server for predicting protein-protein interactions ba-sed on sequence data with probability assignment, BMC Res. Notes, 2010, 3, 145.
[http://dx.doi.org/10.1186/1756-0500-3-145] [PMID: 20500905]
Quan, L.; Wu, H.; Lyu, Q.; Zhang, Y. DAMpred: recognizing disease-associated nsSNPs through bayes-guided neural-network model built on low-resolution structure prediction of proteins and protein-protein interactions, J. Mol. Biol., 2019, 431(13), 2449-2459.
[http://dx.doi.org/10.1016/j.jmb.2019.02.017] [PMID: 30796987]
Kong, M.; Zhang, Y.; Xu, D.; Chen, W.; Dehmer, M. FCTP-WSRC: protein-protein interactions prediction via weighted sparse representation based classification Front. Genet., 2020, 11, 18.
[http://dx.doi.org/10.3389/fgene.2020.00018] [PMID: 32117437]
Murakami, Y.; Mizuguchi, K. Homology-based prediction of inter-actions between proteins using Averaged One-Dependence Estima-tors BMC Bioinformatics, 2014, 15, 213.
[http://dx.doi.org/10.1186/1471-2105-15-213] [PMID: 24953126]
Islam, M.M.; Alam, M.J.; Ahmed, F.F.; Hasan, M.M.; Mollah, M.N.H. Improved prediction of protein-protein interaction mapping on Homo sapiens by using amino acid sequence features in a su-pervised learning framework, Protein Pept. Lett., 2020.
[http://dx.doi.org/10.2174/0929866527666200610141258] [PMID: 32520672]
Mosharaf, M.P.; Hassan, M.M.; Ahmed, F.F.; Khatun, M.S.; Moni, M.A.; Mollah, M.N.H. Computational prediction of protein ubiquitination sites mapping on Arabidopsis thaliana, Comput. Biol. Chem., 2020, 85: 107238.
[http://dx.doi.org/10.1016/j.compbiolchem.2020.107238] [PMID: 32114285]
Altschul, S.F.; Madden, T.L.; Schäffer, A.A.; Zhang, J.; Zhang, Z.; Miller, W.; Lipman, D.J. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res., 1997, 25(17), 3389-3402.
[http://dx.doi.org/10.1093/nar/25.17.3389] [PMID: 9254694]
Hasan, M.M.; Khatun, M.S. Recent progress and challenges for protein pupylation sites prediction, EC Proteomics and Bioinformatics, 2017, 2(1), 36-45.
Murakami, Y.; Mizuguchi, K. PSOPIA: Toward more reliable protein-protein interaction prediction from sequence information. 2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS), Okinawa, 2017, pp. 255-261.
Li, Z.W.; You, Z.H.; Chen, X.; Gui, J.; Nie, R. Highly accurate prediction of protein-protein interactions via incorporating evoluti-onary information and physicochemical characteristics, Int. J. Mol. Sci., 2016, 17(9), E1396.
[http://dx.doi.org/10.3390/ijms17091396] [PMID: 27571061]
Zhu-Hong You, ; MengChu Zhou, ; Xin Luo, ; Shuai, L. Highly efficient framework for predicting interactions between proteins, IEEE Trans. Cybern., 2017, 47(3), 731-743.
[http://dx.doi.org/10.1109/TCYB.2016.2524994] [PMID: 28113829]
Kawashima, S.; Pokarowski, P.; Pokarowska, M.; Kolinski, A.; Katayama, T.; Kanehisa, M. AAindex: amino acid index database, progress report 2008, Nucleic Acids Res., . 2008, 36(Database issue), D202-D205.
[PMID: 17998252]
Kawashima, S.; Kanehisa, M. AAindex: amino acid index data-base, Nucleic Acids Res., 2000, 28(1), 374.
[http://dx.doi.org/10.1093/nar/28.1.374] [PMID: 10592278]
Narykov, O.; Bogatov, D.; Korkin, D. DISPOT: a simple know-ledge-based protein domain interaction statistical potential Bioinformatics, 2019, 35(24), 5374-5378.
[http://dx.doi.org/10.1093/bioinformatics/btz587] [PMID: 31350874]
Li, X.; Yang, L.; Zhang, X.; Jiao, X. Prediction of protein-protein interactions based on domain., Comput. Math. Methods Med., 2019, 2019, 5238406.
[http://dx.doi.org/10.1155/2019/5238406] [PMID: 31531123]
Wojcik, J.; Schächter, V. Protein-protein interaction map inference using interacting domain profile pairs, Bioinformatics, 2001, 17(Suppl. 1), S296-S305.
[http://dx.doi.org/10.1093/bioinformatics/17.suppl_1.S296] [PMID: 11473021]
Kim, W.K.; Park, J.; Suh, J.K. Large scale statistical prediction of protein-protein interaction by potentially interacting domain (PID) pair; ; Genome Informatics, 2002, pp. 22-50.
Hayashida, M.; Kamada, M.; Song, J.; Akutsu, T. Conditional random field approach to prediction of protein-protein interactions using domain information BMC Syst. Biol., 2011, 5(Suppl. 1), S8.
[http://dx.doi.org/10.1186/1752-0509-5-S1-S8] [PMID: 21689483]
Ghadie, M.A.; Lambourne, L.; Vidal, M.; Xia, Y. Domain-based prediction of the human isoform interactome provides insights into the functional impact of alternative splicing, PLOS Comput. Biol., 2017, 13(8), e1005717.
[http://dx.doi.org/10.1371/journal.pcbi.1005717] [PMID: 28846689]
Hasan, M.M.; Zhou, Y.; Lu, X.; Li, J.; Song, J.; Zhang, Z. Compu-tational identification of protein pupylation sites by using profile-based composition of k-spaced amino acid pairs, PLoS One, 2015, 10(6), e0129635.
[http://dx.doi.org/10.1371/journal.pone.0129635] [PMID: 26080082]
Hasan, M.M.; Kurata, H. iLMS, Computational Identification of Lysine-Malonylation Sites by Combining Multiple Sequence Fea-tures. 2018 IEEE 18th International Conference on Bioinformatics and Bioengineering (BIBE), Taichung, Taiwan, , 2018; pp. 356-359.
Liaw, A. Wiener: Classification and regression by random forest, R News, 2002, 2, 18-22.
Su, R.; Hu, J.; Zou, Q.; Manavalan, B.; Wei, L. Empirical compari-son and analysis of web-based cell-penetrating peptide prediction tools, Brief. Bioinform., 2019, 21(2), 408-420.
[http://dx.doi.org/10.1093/bib/bby124] [PMID: 30649170]
Shoombuatong, W.; Schaduangrat, N.; Pratiwi, R.; Nantasenamat, C. THPep: a machine learning-based approach for predicting tumor homing peptides, Comput. Biol. Chem., 2019, 80, 441-451.
[http://dx.doi.org/10.1016/j.compbiolchem.2019.05.008] [PMID: 31151025]
Schaduangrat, N.; Nantasenamat, C.; Prachayasittikul, V.; Shoombuatong, W. Meta-iAVP: a sequence-based meta-predictor for im-proving the prediction of antiviral peptides using effective feature representation. Int. J. Mol. Sci., 2019, 20(22), E5743.
[http://dx.doi.org/10.3390/ijms20225743] [PMID: 31731751]
Win, T.S.; Malik, A.A.; Prachayasittikul, V.S; Wikberg, J.E.; Nantasenamat, C.; Shoombuatong, W. HemoPred: a web server for predicting the hemolytic activity of peptides Future Med. Chem., 2017, 9(3), 275-291.
[http://dx.doi.org/10.4155/fmc-2016-0188] [PMID: 28211294]
Manavalan, B.; Subramaniyam, S.; Shin, T.H.; Kim, M.O.; Lee, G. Machine-learning-based prediction of cell-penetrating peptides and their uptake efficiency with improved accuracy. J. Proteome Res., 2018, 17(8), 2715-2726.
[http://dx.doi.org/10.1021/acs.jproteome.8b00148] [PMID: 29893128]
Manavalan, B.; Shin, T.H.; Kim, M.O.; Lee, G. PIP-EL: a new ensemble learning method for improved proinflammatory peptide predictions, Front. Immunol., 2018, 9, 1783.
[http://dx.doi.org/10.3389/fimmu.2018.01783] [PMID: 30108593]
Boopathi, V.; Subramaniyam, S.; Malik, A.; Lee, G.; Manavalan, B.; Yang, D.C. mACPpred: a support vector machine-based meta-predictor for identification of anticancer peptides. Int. J. Mol. Sci., 2019, 20(8), E1964.
[http://dx.doi.org/10.3390/ijms20081964] [PMID: 31013619]
Manavalan, B.; Basith, S.; Shin, T.H.; Wei, L.; Lee, G. mAHT-Pred: a sequence-based meta-predictor for improving the prediction of anti-hypertensive peptides using effective feature representation. Bioinformatics, 2018, 35(16), 2757-2765.
[PMID: 30590410]
Hasan, M.M.; Khatun, M.S.; Mollah, M.N.H.; Yong, C.; Guo, D. A systematic identification of species-specific protein succinylation sites using joint element features information. Int. J. Nanomedicine, 2017, 12, 6303-6315.
[http://dx.doi.org/10.2147/IJN.S140875] [PMID: 28894368]
Hasan, M.M.; Yang, S.; Zhou, Y.; Mollah, M.N. SuccinSite: a computational tool for the prediction of protein succinylation sites by exploiting the amino acid patterns and properties, Mol. Biosyst., 2016, 12(3), 786-795.
[http://dx.doi.org/10.1039/C5MB00853K] [PMID: 26739209]
Hasan, M.M.; Schaduangrat, N.; Basith, S.; Lee, G.; Shoombuatong, W.; Manavalan, B. HLPpred-Fuse: improved and robust pre-diction of hemolytic peptide and its activity by fusing multiple fea-ture representation, Bioinformatics, 2020, 36(11), 3350-3356.
[http://dx.doi.org/10.1093/bioinformatics/btaa160] [PMID: 32145017]
Hasan, M.M.; Manavalan, B.; Shoombuatong, W.; Khatun, M.S.; Kurata, H. i6mA-Fuse: improved and robust prediction of DNA 6 mA sites in the Rosaceae genome by fusing multiple feature re-presentation. Plant Mol. Biol., 2020, 103(1-2), 225-234.
[http://dx.doi.org/10.1007/s11103-020-00988-y] [PMID: 32140819]
Hasan, M.M.; Manavalan, B.; Khatun, M.S.; Kurata, H. i4mC-ROSE, a bioinformatics tool for the identification of DNA N4-methylcytosine sites in the Rosaceae genome, Int. J. Biol. Macromol., 2019, 157, 752-758.
[PMID: 31805335]
Khatun, S.; Hasan, M.; Kurata, H. Efficient computational model for identification of antitubercular peptides by integrating amino acid patterns and properties. FEBS Lett., 2019, 593(21), 3029-3039.
[http://dx.doi.org/10.1002/1873-3468.13536] [PMID: 31297788]
Khatun, M.S.; Hasan, M.M.; Kurata, H. PreAIP: computational prediction of anti-inflammatory peptides by integrating multiple complementary features, Front. Genet., 2019, 10, 129.
[http://dx.doi.org/10.3389/fgene.2019.00129] [PMID: 30891059]
Hasan, M.M.; Rashid, M.M.; Khatun, M.S.; Kurata, H. Computati-onal identification of microbial phosphorylation sites by the enhan-ced characteristics of sequence information, Sci. Rep., 2019, 9(1), 8258.
[http://dx.doi.org/10.1038/s41598-019-44548-x] [PMID: 31164681]
Hasan, M.M.; Khatun, M.S.; Kurata, H. Large-scale assessment of bioinformatics tools for lysine succinylation sites, Cells, 2019, 8(2)E95
[http://dx.doi.org/10.3390/cells8020095] [PMID: 30696115]
Hasan, M.M.; Khatun, M.S.; Mollah, M.N.H.; Yong, C.; Dianjing, G. NTyroSite: computational identification of protein nitrotyrosine sites using sequence evolutionary features, Molecules, 2018, 23(7)E1667
[http://dx.doi.org/10.3390/molecules23071667] [PMID: 29987232]
Hasan, M.M.; Guo, D.; Kurata, H. Computational identification of protein S-sulfenylation sites by incorporating the multiple sequence features information, Mol. Biosyst., 2017, 13(12), 2545-2550.
[http://dx.doi.org/10.1039/C7MB00491E] [PMID: 28990628]
Hasan, M.M.; Khatun, M.S.; Kurata, H. A comprehensive review of in silico analysis for protein S-sulfenylation sites, Protein Pept. Lett., 2018, 25(9), 815-821.
[http://dx.doi.org/10.2174/0929866525666180905110619] [PMID: 30182830]
Manavalan, B.; Basith, S.; Shin, T.H.; Wei, L.; Lee, G. mAHT-Pred: a sequence-based meta-predictor for improving the prediction of anti-hypertensive peptides using effective feature representation, Bioinformatics, 2019, 35(16), 2757-2765.
[http://dx.doi.org/10.1093/bioinformatics/bty1047] [PMID: 30590410]
Hasan, MM; Manavalan, B.; Khatun, MS; Kurata, H. Prediction of S-nitrosylation sites by integrating support vector machines and random forest, Molecular Omics, 2019, 15(6), 451-458.
Hasan, M.M.; Kurata, H. GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple se-quence features PLoS One, 2018, 13(10), e0200283.
[http://dx.doi.org/10.1371/journal.pone.0200283] [PMID: 30312302]
Win, T.S.; Schaduangrat, N.; Prachayasittikul, V.; Nantasenamat, C.; Shoombuatong, W. PAAP: a web server for predicting antihy-Evolution of Sequence-based Bioinformatics Tools Current Genomics, 2020, Vol. 21, No. 6 463pertensive activity of peptides, Future Med. Chem., 2018, 10(15), 1749-1767.
[http://dx.doi.org/10.4155/fmc-2017-0300] [PMID: 30039980]
Simeon, S.; Shoombuatong, W.; Anuwongcharoen, N.; Preeyanon, L.; Prachayasittikul, V.; Wikberg, J.E.; Nantasenamat, C. osFP: a web server for predicting the oligomeric states of fluorescent pro-teins, J. Cheminform., 2016, 8, 72.
[http://dx.doi.org/10.1186/s13321-016-0185-8] [PMID: 28053671]
Shoombuatong, W.; Prachayasittikul, V.; Anuwongcharoen, N.; Songtawee, N.; Monnor, T.; Prachayasittikul, S.; Prachayasittikul, V.; Nantasenamat, C. Navigating the chemical space of dipeptidyl peptidase-4 inhibitors, Drug Des. Devel. Ther., 2015, 9, 4515-4549.
[PMID: 26309399]
Zhang, B.; Li, J.; Quan, L.; Chen, Y.; Lü, Q. Sequence-based pre-diction of protein-protein interaction sites by simplified long short-term memory network. Neurocomputing, 2019, 86, 100.
Tabei, Y. Scalable prediction of compound-protein interaction on compressed molecular fingerprints, Mol. Inform., 2020, 39(1-2), e1900130.
[http://dx.doi.org/10.1002/minf.201900130] [PMID: 31908150]
Ruas, F.A.D.; Guerra-Sá, R. In silico prediction of protein-protein interaction network induced by Manganese II in Meyerozyma guil-liermondii, Front. Microbiol., 2020, 11, 236.
[http://dx.doi.org/10.3389/fmicb.2020.00236] [PMID: 32140149]
Basith Mail, S.; Manavalan, B.; Shin, T.H.; Lee, D.; Lee, G. Evolu-tion of machine learning algorithms in the prediction and design of anticancer peptides, Curr. Protein Pept. Sci., 2020.
[http://dx.doi.org/10.2174/1389203721666200117171403] [PMID: 31957610]
Manavalan, B.; Basith, S.; Shin, T.H.; Wei, L.; Lee, G. Meta-4mCpred: a sequence-based meta-predictor for accurate DNA 4mC site prediction using effective feature representation., Mol. Ther. Nucleic Acids, 2019, 16, 733-744.
[http://dx.doi.org/10.1016/j.omtn.2019.04.019] [PMID: 31146255]
Alkan, F.; Erten, C. SiPAN: simultaneous prediction and alignment of protein-protein interaction networks. Bioinformatics, 2015, 31(14), 2356-2363.
[http://dx.doi.org/10.1093/bioinformatics/btv160] [PMID: 25788620]
Aloy, P.; Russell, R.B. InterPreTS: protein interaction prediction through tertiary structure Bioinformatics, 2003, 19(1), 161-162.
[http://dx.doi.org/10.1093/bioinformatics/19.1.161] [PMID: 12499311]
Li, Z.; Nie, R.; You, Z.; Cao, C.; Li, J. Using discriminative vector machine model with 2DPCA to predict interactions among pro-teins BMC Bioinformatics, 2019, 20(S25)(Suppl. 25), 694.
[http://dx.doi.org/10.1186/s12859-019-3268-5] [PMID: 31874626]
Göbel, U.; Sander, C.; Schneider, R.; Valencia, A. Correlated mu-tations and residue contacts in proteins. Proteins, 1994, 18(4), 309-317.
[http://dx.doi.org/10.1002/prot.340180402] [PMID: 8208723]
Chen, W.; Feng, P.; Song, X.; Lv, H.; Lin, H. iRNA-m7G: identi-fying N7-methylguanosine sites by fusing multiple features. Mol. Ther. Nucleic Acids, 2019, 18, 269-274.
[http://dx.doi.org/10.1016/j.omtn.2019.08.022] [PMID: 31581051]
Shatabda, S.; Saha, S.; Sharma, A.; Dehzangi, A. iPHLoc-ES: identification of bacteriophage protein locations using evolutionary and structural features. J. Theor. Biol., 2017, 435, 229-237.
[http://dx.doi.org/10.1016/j.jtbi.2017.09.022] [PMID: 28943403]
Charoenkwan, P.; Shoombuatong, W.; Lee, H.C.; Chaijaruwanich, J.; Huang, H.L.; Ho, S.Y. SCMCRYS: predicting protein crystal-lization using an ensemble scoring card method with estimating propensity scores of P-collocated amino acid pairs. PLoS One, 2013, 8(9), e72368.
[http://dx.doi.org/10.1371/journal.pone.0072368] [PMID: 24019868]
Chowdhury, S.Y.; Shatabda, S.; Dehzangi, A. iDNAProt-ES: iden-tification of DNA-binding proteins using evolutionary and structu-ral features. Sci. Rep., 2017, 7(1), 14938.
[http://dx.doi.org/10.1038/s41598-017-14945-1] [PMID: 29097781]
Hasan, M.M.; Khatun, M.S. Prediction of protein Post-Translational Modifi cation sites: an overview. bann. proteom. bioinform.,, 2018, 2, 049-057.
Chen, X.; Huang, L.; Xie, D.; Zhao, Q. EGBMMDA: extreme gradient boosting machine for MiRNA-disease association predic-tion, Cell Death Dis., 2018, 9(1), 3.
[http://dx.doi.org/10.1038/s41419-017-0003-x] [PMID: 29305594]
Li, F.; Chen, J.; Leier, A.; Marquez-Lago, T.; Liu, Q.; Wang, Y.; Revote, J.; Smith, A.I.; Akutsu, T.; Webb, G.I. DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substra-tes and cleavage sites. Bioinformatics, 2019.
[http://dx.doi.org/10.1093/bioinformatics/btz721] [PMID: 31566664]
Manavalan, B.; Basith, S.; Shin, T.H.; Wei, L.; Lee, G. AtbPpred: A robust sequence-based prediction of anti-tubercular peptides u-sing extremely randomized trees, Comput. Struct. Biotechnol. J., 2019, 17, 972-981.
[http://dx.doi.org/10.1016/j.csbj.2019.06.024] [PMID: 31372196]
Basith, S.; Manavalan, B.; Hwan Shin, T.; Lee, G. Machine intelli-gence in peptide therapeutics: A next-generation tool for rapid disease screening, Med. Res. Rev., 2020.
[http://dx.doi.org/10.1002/med.21658] [PMID: 31922268]
Manavalan, B.; Shin, T.H.; Kim, M.O.; Lee, G. AIPpred: sequence-based prediction of anti-inflammatory peptides using random fo-rest, Front. Pharmacol., 2018, 9, 276.
[http://dx.doi.org/10.3389/fphar.2018.00276] [PMID: 29636690]

Rights & PermissionsPrintExport Cite as

Article Details

Year: 2020
Published on: 15 September, 2020
Page: [454 - 463]
Pages: 10
DOI: 10.2174/1389202921999200625103936
Price: $65

Article Metrics

PDF: 20