A Review of Ensemble Methods in Bioinformatics

Author(s): Pengyi Yang, Yee Hwa Yang, Bing B. Zhou, Albert Y. Zomaya

Journal Name: Current Bioinformatics

Volume 5 , Issue 4 , 2010

Become EABM
Become Reviewer
Call for Editor


Ensemble learning is an intensively studied technique in machine learning and pattern recognition. Recent work in computational biology has seen an increasing use of ensemble learning methods due to their unique advantages in dealing with small sample size, high-dimensionality, and complex data structures. The aim of this article is two-fold. Firstly, it is to provide a review of the most widely used ensemble learning methods and their application in various bioinformatics problems, including the main topics of gene expression, mass spectrometry-based proteomics, gene-gene interaction identification from genome-wide association studies, and prediction of regulatory elements from DNA and protein sequences. Secondly, we try to identify and summarize future trends of ensemble methods in bioinformatics. Promising directions such as ensemble of support vector machines, meta-ensembles, and ensemble based feature selection are discussed.

Keywords: Ensemble learning, bioinformatics, microarray, mass spectrometry-based proteomics, gene-gene interaction, regulatory elements prediction, ensemble of support vector machines, meta ensemble, ensemble feature selection.

Rights & PermissionsPrintExport Cite as

Article Details

Year: 2010
Page: [296 - 308]
Pages: 13
DOI: 10.2174/157489310794072508
Price: $65

Article Metrics

PDF: 16