Generic placeholder image

Current Bioinformatics

Editor-in-Chief

ISSN (Print): 1574-8936
ISSN (Online): 2212-392X

General Research Article

Sequence-based Identification of Allergen Proteins Developed by Integration of PseAAC and Statistical Moments via 5-Step Rule

Author(s): Yaser Daanial Khan*, Ebraheem Alzahrani, Wajdi Alghamdi and Malik Zaka Ullah

Volume 15 , Issue 9 , 2020

Page: [1046 - 1055] Pages: 10

DOI: 10.2174/1574893615999200424085947

Price: $65

Abstract

Background: Allergens are antigens that can stimulate an atopic type I human hypersensitivity reaction by an immunoglobulin E (IgE) reaction. Some proteins are naturally allergenic than others. The challenge for toxicologists is to identify properties that allow proteins to cause allergic sensitization and allergic diseases. The identification of allergen proteins is a very critical and pivotal task. The experimental identification of protein functions is a hectic, laborious and costly task; therefore, computer scientists have proposed various methods in the field of computational biology and bioinformatics using various data science approaches. Objectives: Herein, we report a novel predictor for the identification of allergen proteins.

Methods: For feature extraction, statistical moments and various position-based features have been incorporated into Chou’s pseudo amino acid composition (PseAAC), and are used for training of a neural network.

Results: The predictor is validated through 10-fold cross-validation and Jackknife testing, which gave 99.43% and 99.87% accurate results.

Conclusion: Thus, the proposed predictor can help in predicting the Allergen proteins in an efficient and accurate way and can provide baseline data for the discovery of new drugs and biomarkers.

Keywords: Allergen proteins, prediction, PseAAC, 5-step rule, statistical moments, biomarkers.

Graphical Abstract

Rights & Permissions Print Export Cite as
© 2022 Bentham Science Publishers | Privacy Policy