Identification of Carcinogenic Chemicals with Network Embedding and Deep Learning Methods

Author(s): Xuefei Peng, Lei Chen*, Jian-Peng Zhou

Journal Name: Current Bioinformatics

Volume 15 , Issue 9 , 2020


Become EABM
Become Reviewer
Call for Editor

Graphical Abstract:


Abstract:

Background: Cancer is the second leading cause of human death in the world. To date, many factors have been confirmed to be the cause of cancer. Among them, carcinogenic chemicals have been widely accepted as the important ones. Traditional methods for detecting carcinogenic chemicals are of low efficiency and high cost.

Objective: The aim of this study was to design an efficient computational method for the identification of carcinogenic chemicals.

Methods: A new computational model was proposed for detecting carcinogenic chemicals. As a data-driven model, carcinogenic and non-carcinogenic chemicals were obtained from Carcinogenic Potency Database (CPDB). These chemicals were represented by features extracted from five chemical networks, representing five types of chemical associations, via a network embedding method, Mashup. Obtained features were fed into a powerful deep learning method, recurrent neural network, to build the model.

Results: The jackknife test on such model provided the F-measure of 0.971 and AUROC of 0.971.

Conclusion: The proposed model was quite effective and was superior to the models with traditional machine learning algorithms, classic chemical encoding schemes or direct usage of chemical associations.

Keywords: Carcinogenicity, carcinogenic chemical, network embedding method, deep learning, recurrent neural network, cancer.

Rights & PermissionsPrintExport Cite as

Article Details

VOLUME: 15
ISSUE: 9
Year: 2020
Page: [1017 - 1026]
Pages: 10
DOI: 10.2174/1574893615999200414084317
Price: $65

Article Metrics

PDF: 18