Abstract
Aims: Based on protein sequence information, a simple and effective method was used to analyze protein sequence similarity and predict DNA-binding protein.
Background: It is absolutely necessary that we generate computational methods of low complexity to accurate infer protein structure, function, and evolution in the rapidly growing number of molecular biology data available.
Objective: It is important to generate novel computational algorithms for analyzing and comparing protein sequences with the rapidly growing number of molecular biology data available.
Methods: Based on global and local position representation with the curves of Fermat spiral and normalized moments of inertia of the curve of Fermat spiral, respectively, moreover, composition of 20 amino acids to get the numerical characteristics of protein sequences.
Results: It has been applied to analyze the similarity/dissimilarity of nine ND5 proteins, the analysis results are consistent with the biological evolution theory. Furthermore, we employ the Logistic regression with 5-fold cross-validation to establish the prediction of DNA-binding proteins model, which outperformed the DNAbinder, iDNA-prot, DNA-prot and gDNA-prot by 0.0069-0.609 in terms of F-measure, 0.293-0.898 in terms of MCC in unbalanced dataset.
Conclusion: These results show that our method, namely FermatS, is effective to compare, recognition and prediction the protein sequences.
Keywords: Fermat spiral, mass, moment of inertia, similarity/dissimilarity of species, identification of DNA-binding proteins, logistic regression.
Combinatorial Chemistry & High Throughput Screening
Title:FermatS: A Novel Numerical Representation for Protein Sequence Comparison and DNA-binding Protein Identification
Volume: 24 Issue: 10
Author(s): Yanping Zhang*, Ya Gao, Jianwei Ni, Pengcheng Chen and Xiaosheng Wang
Affiliation:
- School of Mathematics and Physics Science and Engineering, Hebei University of Engineering, Handan 056038,China
Keywords: Fermat spiral, mass, moment of inertia, similarity/dissimilarity of species, identification of DNA-binding proteins, logistic regression.
Abstract:
Aims: Based on protein sequence information, a simple and effective method was used to analyze protein sequence similarity and predict DNA-binding protein.
Background: It is absolutely necessary that we generate computational methods of low complexity to accurate infer protein structure, function, and evolution in the rapidly growing number of molecular biology data available.
Objective: It is important to generate novel computational algorithms for analyzing and comparing protein sequences with the rapidly growing number of molecular biology data available.
Methods: Based on global and local position representation with the curves of Fermat spiral and normalized moments of inertia of the curve of Fermat spiral, respectively, moreover, composition of 20 amino acids to get the numerical characteristics of protein sequences.
Results: It has been applied to analyze the similarity/dissimilarity of nine ND5 proteins, the analysis results are consistent with the biological evolution theory. Furthermore, we employ the Logistic regression with 5-fold cross-validation to establish the prediction of DNA-binding proteins model, which outperformed the DNAbinder, iDNA-prot, DNA-prot and gDNA-prot by 0.0069-0.609 in terms of F-measure, 0.293-0.898 in terms of MCC in unbalanced dataset.
Conclusion: These results show that our method, namely FermatS, is effective to compare, recognition and prediction the protein sequences.
Export Options
About this article
Cite this article as:
Zhang Yanping *, Gao Ya , Ni Jianwei , Chen Pengcheng and Wang Xiaosheng , FermatS: A Novel Numerical Representation for Protein Sequence Comparison and DNA-binding Protein Identification, Combinatorial Chemistry & High Throughput Screening 2021; 24 (10) . https://dx.doi.org/10.2174/1386207323999201117111738
DOI https://dx.doi.org/10.2174/1386207323999201117111738 |
Print ISSN 1386-2073 |
Publisher Name Bentham Science Publisher |
Online ISSN 1875-5402 |
Call for Papers in Thematic Issues
Artificial Intelligence Methods for Biomedical, Biochemical and Bioinformatics Problems
Recently, a large number of technologies based on artificial intelligence have been developed and applied to solve a diverse range of problems in the areas of biomedical, biochemical and bioinformatics problems. By utilizing powerful computing resources and massive amounts of data, methods based on artificial intelligence can significantly improve the ...read more
Eco-friendly Agents for Biological Control of Pathogenic Diseases
The discovery of an alternative biological approach to disease management includes work on medicinal products derived from natural sources as a starting point for the development of eco-friendly agents for these diseases and the injuries they cause, as well as reducing human contact with hazardous chemicals and their residues. We ...read more
Emerging trends in diseases mechanisms, noble drug targets and therapeutic strategies: focus on immunological and inflammatory disorders
Recently infectious and inflammatory diseases have been a key concern worldwide due to tremendous morbidity and mortality world Wide. Recent, nCOVID-9 pandemic is a good example for the emerging infectious disease outbreak. The world is facing many emerging and re-emerging diseases out breaks at present however, there is huge lack ...read more
Exploring Spectral Graph Theory in Combinatorial Chemistry
Scope of the Thematic Issue: Combinatorial chemistry involves the synthesis and analysis of a large number of diverse compounds simultaneously. Traditional methods rely on brute force experimentation, which can be time-consuming and resource-intensive. Spectral Graph Theory, a branch of mathematics dealing with the properties of graphs in relation to the ...read more
- Author Guidelines
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
Related Articles
-
An Update On Proficiency of Voltage-gated Ion Channel Blockers in the
Treatment of Inflammation-associated Diseases
Current Drug Targets Pyrazolo[4,3-e][1,2,4]Triazolo[1,5-c]Pyrimidine Template: Organic and Medicinal Chemistry Approach
Current Organic Chemistry The Development of Ataxia Telangiectasia Mutated Kinase Inhibitors
Mini-Reviews in Medicinal Chemistry Comparative Study on Clinical Characteristics of COVID-19 Patients with or without Digestive Symptoms in Razi Hospital, Ahvaz, Khuzestan
Endocrine, Metabolic & Immune Disorders - Drug Targets Natural Products: Key Prototypes to Drug Discovery Against Neglected Diseases Caused by Trypanosomatids
Current Medicinal Chemistry Potential Role of Rho-Associated Protein Kinase Inhibitors for Glaucoma Treatment
Recent Patents on Endocrine, Metabolic & Immune Drug Discovery Bivalent Ligands Targeting Chemokine Receptor Dimerization: Molecular Design and Functional Studies
Current Topics in Medicinal Chemistry RNA Splicing Manipulation: Strategies to Modify Gene Expression for a Variety of Therapeutic Outcomes
Current Gene Therapy Synthetic and Natural Protease Inhibitors Provide Insights into Parasite Development, Virulence and Pathogenesis
Current Medicinal Chemistry Morphological and Molecular Changes of the Myocardium After Left Ventricular Mechanical Support
Current Cardiology Reviews Treatment of Pulmonary Edema by ENaC Activators/Stimulators
Current Molecular Pharmacology Isomannide and Derivatives. Chemical and Pharmaceutical Applications
Mini-Reviews in Organic Chemistry Sodium Channel Blocking Activity and In-vivo Testing of New Phenylimidazole Derivatives
Letters in Drug Design & Discovery High Mobility Group Box Protein-1 in HIV-1 Infection: Connecting Microbial Translocation, Cell Death and Immune Activation
Current HIV Research CORONAVIRUS and COVID-19: A Systematic Review and Perspective
Current Drug Therapy Ca<sup>2+</sup>/cAMP Ratio as an Inflammatory Index
Current Hypertension Reviews 8-(Heteroaryl)phenalkyl-1-Phenyl-1,3,8-triazaspiro[4.5]decan-4-ones as Opioid Receptor Modulators
Medicinal Chemistry Editorial: Mining for Pharmacophores in Phenotypic Screens
Combinatorial Chemistry & High Throughput Screening Thwarting Coronavirus Infections by Tapping Host Targets: The ‘Greek Gift Sacrifice’ to Curb the Menace of Drug Resistances
Current Molecular Pharmacology Adjunctive Therapies in Severe Pneumonia in Critical Care Patients
Infectious Disorders - Drug Targets