Abstract
Aim and Objective: The Krüppel-like factors (KLFs) are a family of containing Zn finger(ZF) motif transcription factors with 18 members in human genome, among them, KLF18 is predicted by bioinformatics. KLFs possess various physiological function involving in a number of cancers and other diseases. Here we perform a binary-class classification of KLFs and non-KLFs by machine learning methods.
Material and Method: The protein sequences of KLFs and non-KLFs were searched from UniProt and randomly separate them into training dataset(containing positive and negative sequences) and test dataset(containing only negative sequences), after extracting the 188-dimensional(188D) feature vectors we carry out category with four classifiers(GBDT, libSVM, RF, and k-NN). On the human KLFs, we further dig into the evolutionary relationship and motif distribution, and finally we analyze the conserved amino acid residue of three zinc fingers.
Results: The classifier model from training dataset were well constructed, and the highest specificity(Sp) was 99.83% from a library for support vector machine(libSVM) and all the correctly classified rates were over 70% for 10-fold cross-validation on test dataset. The 18 human KLFs can be further divided into 7 groups and the zinc finger domains were located at the carboxyl terminus, and many conserved amino acid residues including Cysteine and Histidine, and the span and interval between them were consistent in the three ZF domains.
Conclusion: Two classification models for KLFs prediction have been built by novel machine learning methods.
Keywords: Krüppel-like factor, binary-class classification, phylogenetic analysis, motif, a library for support vector machine, machine learning method.
Combinatorial Chemistry & High Throughput Screening
Title:Prediction and Identification of Krüppel-Like Transcription Factors by Machine Learning Method
Volume: 20 Issue: 7
Author(s): Zhijun Liao, Xinrui Wang, Xingyong Chen and Quan Zou*
Affiliation:
- School of Computer and Technology, Tianjin University, Tianjin,China
Keywords: Krüppel-like factor, binary-class classification, phylogenetic analysis, motif, a library for support vector machine, machine learning method.
Abstract: Aim and Objective: The Krüppel-like factors (KLFs) are a family of containing Zn finger(ZF) motif transcription factors with 18 members in human genome, among them, KLF18 is predicted by bioinformatics. KLFs possess various physiological function involving in a number of cancers and other diseases. Here we perform a binary-class classification of KLFs and non-KLFs by machine learning methods.
Material and Method: The protein sequences of KLFs and non-KLFs were searched from UniProt and randomly separate them into training dataset(containing positive and negative sequences) and test dataset(containing only negative sequences), after extracting the 188-dimensional(188D) feature vectors we carry out category with four classifiers(GBDT, libSVM, RF, and k-NN). On the human KLFs, we further dig into the evolutionary relationship and motif distribution, and finally we analyze the conserved amino acid residue of three zinc fingers.
Results: The classifier model from training dataset were well constructed, and the highest specificity(Sp) was 99.83% from a library for support vector machine(libSVM) and all the correctly classified rates were over 70% for 10-fold cross-validation on test dataset. The 18 human KLFs can be further divided into 7 groups and the zinc finger domains were located at the carboxyl terminus, and many conserved amino acid residues including Cysteine and Histidine, and the span and interval between them were consistent in the three ZF domains.
Conclusion: Two classification models for KLFs prediction have been built by novel machine learning methods.
Export Options
About this article
Cite this article as:
Liao Zhijun , Wang Xinrui , Chen Xingyong and Zou Quan*, Prediction and Identification of Krüppel-Like Transcription Factors by Machine Learning Method, Combinatorial Chemistry & High Throughput Screening 2017; 20 (7) . https://dx.doi.org/10.2174/1386207320666170314094951
DOI https://dx.doi.org/10.2174/1386207320666170314094951 |
Print ISSN 1386-2073 |
Publisher Name Bentham Science Publisher |
Online ISSN 1875-5402 |
Call for Papers in Thematic Issues
Artificial Intelligence Methods for Biomedical, Biochemical and Bioinformatics Problems
Recently, a large number of technologies based on artificial intelligence have been developed and applied to solve a diverse range of problems in the areas of biomedical, biochemical and bioinformatics problems. By utilizing powerful computing resources and massive amounts of data, methods based on artificial intelligence can significantly improve the ...read more
Eco-friendly Agents for Biological Control of Pathogenic Diseases
The discovery of an alternative biological approach to disease management includes work on medicinal products derived from natural sources as a starting point for the development of eco-friendly agents for these diseases and the injuries they cause, as well as reducing human contact with hazardous chemicals and their residues. We ...read more
Emerging trends in diseases mechanisms, noble drug targets and therapeutic strategies: focus on immunological and inflammatory disorders
Recently infectious and inflammatory diseases have been a key concern worldwide due to tremendous morbidity and mortality world Wide. Recent, nCOVID-9 pandemic is a good example for the emerging infectious disease outbreak. The world is facing many emerging and re-emerging diseases out breaks at present however, there is huge lack ...read more
Exploring Spectral Graph Theory in Combinatorial Chemistry
Scope of the Thematic Issue: Combinatorial chemistry involves the synthesis and analysis of a large number of diverse compounds simultaneously. Traditional methods rely on brute force experimentation, which can be time-consuming and resource-intensive. Spectral Graph Theory, a branch of mathematics dealing with the properties of graphs in relation to the ...read more
- Author Guidelines
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
Related Articles
-
B7-H3 Immune Checkpoint Protein in Human Cancer
Current Medicinal Chemistry A Critical Review on Potential Advancements of Nanostructured Lipid Carriers in Drug Delivery
Current Nanomedicine Diffusion-Weighted Imaging in Abdominal Oncology
Current Medical Imaging Therapeutic Potential of Targeting TREM-1 in Inflammatory Diseases and Cancer
Current Pharmaceutical Design Signal Transducer and Activator of Transcription 3 Signaling Pathway: A Potential Target in Sarcoma Treatment
Current Enzyme Inhibition Telomerase as a Cancer Target. Development of New Molecules
Current Topics in Medicinal Chemistry Targeting Microtubules to Inhibit Angiogenesis and Disrupt Tumour Vasculature:Implications for Cancer Treatment
Current Cancer Drug Targets Chaperones and Their Role in Telomerase Ribonucleoprotein Biogenesis and Telomere Maintenance
Current Proteomics Ghrelin: A Gastric Peptide that Regulates Hypothalamic Control of Feeding
Current Medicinal Chemistry - Central Nervous System Agents Potentials of Polymeric Nanoparticle as Drug Carrier for Cancer Therapy: With a Special Reference to Pharmacokinetic Parameters
Current Drug Metabolism Recent Advances in Inducible Cyclooxygenase (COX-2) Inhibition.
Current Medicinal Chemistry Biotransformations of Prenylated Hop Flavonoids for Drug Discovery and Production
Current Drug Metabolism Interrelationships Among Gut Microbiota and Host: Paradigms, Role in Neurodegenerative Diseases and Future Prospects
CNS & Neurological Disorders - Drug Targets Interaction between Gastric and Upper Small Intestinal Hormones in the Regulation of Hunger and Satiety: Ghrelin and Cholecystokinin Take the Central Stage
Current Protein & Peptide Science Respiratory Failure in Cancer Patients: Non-Infectious Complications of Antineoplastic Agents for Solid Tumors
Current Respiratory Medicine Reviews Circular RNAs in Eukaryotic Cells
Current Genomics Application of Genetic Polymorphisms in DNA Repair in the Prediction of Cancer Susceptibility and Clinical Outcome
Current Pharmacogenomics Recent Advances in Fluorescent Probes for Monitoring of Hydrogen Sulfide
Current Medicinal Chemistry Understanding Heterogeneity in Supplementation Effects of Selenium in Men: A Study of Stratification Variables and Human Genetics in a Prospective Sample from New Zealand
Current Pharmacogenomics and Personalized Medicine Cancer and Phase II Drug-Metabolizing Enzymes
Current Drug Metabolism