Generic placeholder image

Current Bioinformatics

Editor-in-Chief

ISSN (Print): 1574-8936
ISSN (Online): 2212-392X

Research Article

PREDAIP: Computational Prediction and Analysis for Anti-inflammatory Peptide via a Hybrid Feature Selection Technique

Author(s): Dan Lin, Jialin Yu, Ju Zhang, Huan He, Xinyun Guo and Shaoping Shi*

Volume 16, Issue 8, 2021

Published on: 01 June, 2021

Page: [1048 - 1059] Pages: 12

DOI: 10.2174/1574893616666210601111157

Abstract

Background: Anti-Inflammatory Peptides (AIPs) are potent therapeutic agents for inflammatory and autoimmune disorders due to their high specificity and minimal toxicity under normal conditions. Therefore, it is greatly significant and beneficial to identify AIPs for further discovering novel and efficient AIPs-based therapeutics. Recently, three computational approaches, which can effectively identify potential AIPs, have been developed based on machine learning algorithms. However, there are several challenges with the existing three predictors.

Objective: A novel machine learning algorithm needs to be proposed to improve the AIPs prediction accuracy.

Methods: This study attempts to improve the recognition of AIPs by employing multiple primary sequence-based feature descriptors and an efficient feature selection strategy. By sorting features through four enhanced minimal redundancy maximal relevance (emRMR) methods, and then attaching seven different classifiers wrapper methods based on the sequential forward selection algorithm (SFS), we proposed a hybrid feature selection technique emRMR-SFS to optimize feature vectors. Furthermore, by evaluating seven classifiers trained with the optimal feature subset, we developed the Extremely Randomized Tree (ERT) based predictor named PREDAIP for identifying AIPs.

Results: We systematically compared the performance of PREDAIP with the existing tools on independent test dataset. It demonstrates the effectiveness and power of the PREDAIP.

Conclusion: The correlation criteria used in emRMR would affect the selection results of the optimal feature subset at the SFS-wrapper stage, which justifies the necessity for considering different correlation criteria in emRMR.

Keywords: Machine learning, feature selection, enhanced minimal redundancy maximal relevance, sequential forward selection algorithm, prediction, extremely randomized tree.

Graphical Abstract

© 2024 Bentham Science Publishers | Privacy Policy