Generic placeholder image

Current Bioinformatics


ISSN (Print): 1574-8936
ISSN (Online): 2212-392X

Short Communication

DILI-Stk: An Ensemble Model for the Prediction of Drug-induced Liver Injury of Drug Candidates

Author(s): Jingyu Lee, Myeong-Sang Yu and Dokyun Na*

Volume 17, Issue 3, 2022

Published on: 17 March, 2022

Page: [296 - 303] Pages: 8

DOI: 10.2174/1574893617666211228113939

Price: $65


Background: Drug-induced Liver Injury (DILI) is a leading cause of drug failure, accounting for nearly 20% of drug withdrawal. Thus, there has been a great demand for in silico DILI prediction models for successful drug discovery. To date, various models have been developed for DILI prediction; however, building an accurate model for practical use in drug discovery remains challenging.

Methods: We constructed an ensemble model composed of three high-performance DILI prediction models to utilize the unique advantage of each machine learning algorithm.

Results: The ensemble model exhibited high predictive performance, with an area under the curve of 0.88, sensitivity of 0.83, specificity of 0.77, F1-score of 0.82, and accuracy of 0.80. When a test dataset collected from the literature was used to compare the performance of our model with publicly available DILI prediction models, our model achieved an accuracy of 0.77, sensitivity of 0.82, specificity of 0.72, and F1-score of 0.79, which were higher than those of the other DILI prediction models. As many published DILI prediction models are not available for public access, which hinders in silico drug discovery, we made our DILI prediction model publicly accessible (

Conclusion: We expect that our ensemble model may facilitate advancements in drug discovery by providing a highly predictive model and reducing the drug withdrawal rate.

Keywords: Drug discovery, hepatotoxicity, machine learning, quantitative structure-activity relationship model, drug-induced liver injury, xenobiotics metabolism, xenobiotic metabolism.

« Previous
Graphical Abstract
Bajželj B, Drgan V. Hepatotoxicity modeling using counter-propagation artificial neural networks: Handling an imbalanced classification problem. Molecules 2020; 25(3): 481.
[ ] [PMID: 31979300]
Sturgill MG, Lambert G. Xenobiotic-induced hepatotoxicity: Mechanisms of liver injury and methods of monitoring hepatic function. Clin Chem 1997; 43(8): 1512-26.
Walker PA, Ryder S, Lavado A, Dilworth C, Riley RJ. The evolution of strategies to minimise the risk of human drug-induced liver injury (DILI) in drug discovery and development. Arch Toxicol 2020; 94(8): 2559-85.
[ ] [PMID: 32372214]
Ionescu C, Caira MR, Eds. Drug metabolism: Current concepts. Netherlands: Springer 2006.
Benedetti MS, Whomsley R, Poggesi I, et al. Drug metabolism and pharmacokinetics. Drug Metab Pharmacokinet 2009; 41(3): 344-90.
[ ] [PMID: 19601718]
Andrade RJ, Robles M, Fernández-Castañer A, López-Ortega S, López-Vega MC, Lucena MI. Assessment of drug-induced hepatotoxicity in clinical practice: A challenge for gastroenterologists. World J Gastroenterol 2007; 13(3): 329-40.
[ ] [PMID: 17230599]
Chen M, Borlak J, Tong W. Predicting idiosyncratic drug-induced liver injury: Some recent advances. Expert Rev Gastroenterol Hepatol 2014; 8(7): 721-3.
[ ] [PMID: 24857265]
Remmer H. The role of theliver in drug metabolism. Am J Med 1970; 49(5): 617-29.
[ ] [PMID: 4924589]
Gregus Z, Ed. Mechanisms of toxicity. New York: McGraw-Hill Professional 2008.
Przybylak KR, Cronin MT. In silico models for drug-induced liver injury--current status. Expert Opin Drug Metab Toxicol 2012; 8(2): 201-17.
[ ] [PMID: 22248266]
Schroeter TS, Schwaighofer A, Mika S, et al. Estimating the domain of applicability for machine learning QSAR models: A study on aque-ous solubility of drug discovery molecules. J Comput Aided Mol Des 2007; 21(12): 651-64.
[ ] [PMID: 18060505]
Ponzoni I, Sebastián-Pérez V, Requena-Triguero C, et al. Hybridizing feature selection and feature learning approaches in QSAR modeling for drug discovery. Sci Rep 2017; 7(1): 2403.
[ ] [PMID: 28546583]
Liu Y, Ed. Drug design by machine learning: Ensemble learning for QSAR modeling. Proceedings of the fourth International Conference on Machine Learning and Applications. 2005 Dec 15-17; LA, USA. Los Angeles: IEEE 2006.
Chen M, Hong H, Fang H, et al. Quantitative structure-activity relationship models for predicting drug-induced liver injury based on FDA-approved drug labeling annotation and using a large collection of drugs. Toxicol Sci 2013; 136(1): 242-9.
[ ] [PMID: 23997115]
He S, Ye T, Wang R, et al. An in silico model for predicting drug-induced hepatotoxicity. Int J Mol Sci 2019; 20(8): 1897.
[ ] [PMID: 30999595]
Ai H, Chen W, Zhang L, et al. Predicting drug-induced liver injury using ensemble learning methods and molecular fingerprints. Toxicol Sci 2018; 165(1): 100-7.
[ ] [PMID: 29788510]
Liu Z, Shi Q, Ding D, Kelly R, Fang H, Tong W. Translating clinical findings into knowledge in drug safety evaluation--drug induced liver injury prediction system (DILIps). PLOS Comput Biol 2011; 7(12): e1002310.
[ ] [PMID: 22194678]
Ancuceanu R, Hovanet MV, Anghel AI, et al. Computational models using multiple machine learning algorithms for predicting drug hepa-totoxicity with the DILIrank dataset. Int J Mol Sci 2020; 21(6): 2114.
[ ] [PMID: 32204453]
Xiong G, Wu Z, Yi J, et al. ADMETlab 2.0: An integrated online platform for accurate and comprehensive predictions of ADMET proper-ties. Nucleic Acids Res 2021; 49(W1): W5-W14.
[ ] [PMID: 33893803]
Banerjee P, Eckert AO, Schrey AK, Preissner R. ProTox-II: A webserver for the prediction of toxicity of chemicals. Nucleic Acids Res 2018; 46(W1): W257-63.
[ ] [PMID: 29718510]
Liew CY, Lim YC, Yap CW. Mixed learning algorithms and features ensemble in hepatotoxicity prediction. J Comput Aided Mol Des 2011; 25(9): 855-71.
[ ] [PMID: 21898162]
Zhu X, Kruhlak NL. Construction and analysis of a human hepatotoxicity database suitable for QSAR modeling using post-market safety data. Toxicology 2014; 321: 62-72.
[ ] [PMID: 24721472]
Center for Drug Evaluation and Research (U.S.). Orange book: Approved drug products with therapeutic equivalence evaluations. US Food Drug Adm 2013. Available from: 2013.
Xia J, Wright J, Adams CE. Five large Chinese biomedical bibliographic databases: Accessibility and coverage. Health Info Libr J 2008; 25(1): 55-61.
Mauri A, Consonni V, Pavan M, Todeschini R. Dragon software: An easy approach to molecular descriptor calculations. Match (Mulh) 2007; 56(2): 237-48.
Eesa AS, Kh Arabo W. A normalization methods for backpropagation: A comparative study. Sci J Univ Zakho 2017; 5(4): 319-23.
Ranjan GSK, Kumar Verma A, Radhika S, Eds. K-nearest neighbors and grid search CV based real time fault monitoring system for indus-tries. Proceedings of the 2019 IEEE 5th International Conference for Convergence in Technology. 2019 March 29-31; Bombay, India. 2020.
Cao C, Wang Z. IMCStacking: Cost-sensitive stacking learning with feature inverse mapping for imbalanced problems. Knowl Base Syst 2018; 150: 27-37.
El-Rashidy N, El-Sappagh S, Abuhmed T, Abdelrazek S, El-Bakry HM. Intensive care unit mortality prediction: An improved patientspecific stacking ensemble model. IEEE Access 2020; 8: 133541-64.
Cockroft NT, Cheng X, Fuchs JR. STarFish: A stacked ensemble target fishing approach and its application to natural products. J Chem Inf Model 2019; 59(11): 4906-20.
[ ] [PMID: 31589422]
He H, Zhang W, Zhang S. A novel ensemble method for credit scoring: Adaption of different imbalance ratios. Expert Syst Appl 2018; 98: 105-17.
Williams ML, James WP, Rose MT. Variable segmentation and ensemble classifiers for predicting dairy cow behaviour. Biosyst Eng 2019; 178: 156-67.
Layeghian Javan S, Sepehri MM, Layeghian Javan M, Khatibi T. An intelligent warning model for early prediction of cardiac arrest in sep-sis patients. Comput Methods Programs Biomed 2019; 178: 47-58.
[ ] [PMID: 31416562]
Kaplowitz N, Deleve LD. Drug-Induced Liver Disease. New York 2003.
Williams M. An encyclopedia of chemicals, drugs, and biologicals. NJ, USA: Merck & Co, Inc. 1989.
Kotsampasakou E, Montanari F, Ecker GF. Predicting drug-induced liver injury: The importance of data curation. Toxicology 2017; 389: 139-45.
[ ] [PMID: 28652195]
Wang Y, Xiao Q, Chen P, Wang B. In silico prediction of drug-induced liver injury based on ensemble classifier method. Int J Mol Sci 2019; 20(17): 4106.
[ ] [PMID: 31443562]
Karlos S, Kostopoulos G, Kotsiantis S. A soft-voting ensemble based co-training scheme using static selection for binary classification problems. Algorithms 2020; 13(1): 26.
Le NQK, Do DT, Hung TNK, Lam LHT, Huynh TT, Nguyen NTK. A computational framework based on ensemble deep neural networks for essential genes identification. Int J Mol Sci 2020; 21(23): 9070.
[ ] [PMID: 33260643]
Teschke R, Uetrecht J. Mechanism of idiosyncratic drug induced liver injury (DILI): Unresolved basic issues. Ann Transl Med 2021; 9(8): 730.
[ ] [PMID: 33987428]

Rights & Permissions Print Cite
© 2024 Bentham Science Publishers | Privacy Policy