A Feature and Algorithm Selection Method for Improving the Prediction of Protein Structural Class

Qianwu       Ni; Lei       Chen

Abstract

Aim and Objective: Correct prediction of protein structural class is beneficial to investigation on protein functions, regulations and interactions. In recent years, several computational methods have been proposed in this regard. However, based on various features, it is still a great challenge to select proper classification algorithm and extract essential features to participate in classification.

Material and Methods: In this study, a feature and algorithm selection method was presented for improving the accuracy of protein structural class prediction. The amino acid compositions and physiochemical features were adopted to represent features and thirty-eight machine learning algorithms collected in Weka were employed. All features were first analyzed by a feature selection method, minimum redundancy maximum relevance (mRMR), producing a feature list. Then, several feature sets were constructed by adding features in the list one by one. For each feature set, thirtyeight algorithms were executed on a dataset, in which proteins were represented by features in the set. The predicted classes yielded by these algorithms and true class of each protein were collected to construct a dataset, which were analyzed by mRMR method, yielding an algorithm list. From the algorithm list, the algorithm was taken one by one to build an ensemble prediction model. Finally, we selected the ensemble prediction model with the best performance as the optimal ensemble prediction model.

Results: Experimental results indicate that the constructed model is much superior to models using single algorithm and other models that only adopt feature selection procedure or algorithm selection procedure.

Conclusion: The feature selection procedure or algorithm selection procedure are really helpful for building an ensemble prediction model that can yield a better performance.

Keywords: Protein structural class prediction, minimum redundancy maximum relevance, feature selection, algorithm selection, ensemble classifier, optimal ensemble prediction model.

« Previous Next »

Rights & Permissions Print Cite

Article Metrics

28

8

1

Journal Information

For Authors

For Editors

For Reviewers

Explore Articles

Open Access

Open Access Articles

For Visitors

DOI https://dx.doi.org/10.2174/1386207320666170314103147	Print ISSN 1386-2073
Publisher Name Bentham Science Publisher	Online ISSN 1875-5402

Combinatorial Chemistry & High Throughput Screening

A Feature and Algorithm Selection Method for Improving the Prediction of Protein Structural Class

Abstract

Artificial Intelligence Methods for Biomedical, Biochemical and Bioinformatics Problems

Eco-friendly Agents for Biological Control of Pathogenic Diseases

Emerging trends in diseases mechanisms, noble drug targets and therapeutic strategies: focus on immunological and inflammatory disorders

Exploring Spectral Graph Theory in Combinatorial Chemistry

Combinatorial Chemistry & High Throughput Screening

A Feature and Algorithm Selection Method for Improving the Prediction of Protein Structural Class

Abstract

Call for Papers in Thematic Issues

Artificial Intelligence Methods for Biomedical, Biochemical and Bioinformatics Problems

Eco-friendly Agents for Biological Control of Pathogenic Diseases

Emerging trends in diseases mechanisms, noble drug targets and therapeutic strategies: focus on immunological and inflammatory disorders

Exploring Spectral Graph Theory in Combinatorial Chemistry

Related Journals

Related Books