HIV-1 Protease Cleavage Site Prediction Based on Two-Stage Feature Selection Method

Author(s): Bing Niu, Xiao-Cheng Yuan, Preston Roeper, Qiang Su, Chun-Rong Peng, Jing-Yuan Yin, Juan Ding, HaiPeng Li, Wen-Cong Lu.

Journal Name: Protein & Peptide Letters

Volume 20 , Issue 3 , 2013

Become EABM
Become Reviewer

Abstract:

Knowledge of the mechanism of HIV protease cleavage specificity is critical to the design of specific and effective HIV inhibitors. Searching for an accurate, robust, and rapid method to correctly predict the cleavage sites in proteins is crucial when searching for possible HIV inhibitors. In this article, HIV-1 protease specificity was studied using the correlation-based feature subset (CfsSubset) selection method combined with Genetic Algorithms method. Thirty important biochemical features were found based on a jackknife test from the original data set containing 4,248 features. By using the AdaBoost method with the thirty selected features the prediction model yields an accuracy of 96.7% for the jackknife test and 92.1% for an independent set test, with increased accuracy over the original dataset by 6.7% and 77.4%, respectively. Our feature selection scheme could be a useful technique for finding effective competitive inhibitors of HIV protease.

Keywords: Correlation-based feature subset (CfsSubset), genetic algorithm (GA), adaboost, feature selection, HIV protease, chou’s distorted key theory, HIV inhibitor, HIV-1 protease specificity, Genetic Algorithms method, jackknife test

Rights & PermissionsPrintExport Cite as

Article Details

VOLUME: 20
ISSUE: 3
Year: 2013
Page: [290 - 298]
Pages: 9
DOI: 10.2174/0929866511320030007
Price: $65

Article Metrics

PDF: 10