Background: DNA-binding proteins are very important to many biomolecular functions.
The traditional experimental methods are expensive and time-consuming, so, computational methods
that can predict whether a protein is a DNA-binding protein or not are very helpful to researchers.
Machine learning has been widely used in many research areas. Many researchers have proposed machine
learning methods for DNA-binding protein prediction, and this paper highlights their advantages
Objective: There are many computational methods that can predict DNA-binding proteins. Every
method uses different features and different classifier algorithms. In this paper, a review of these
methods is provided to find out some common procedures that can help researchers to develop more
Methods: Firstly, the information stored in the protein sequence and gene sequence is presented. That
information is the basis to find out the patterns leading to binding. Then, feature extraction methods
and classifier algorithms are discussed. At last, some commonly used benchmark datasets are analysed
and evaluated by methods.
Conclusion: In this review, we analyzed some popular computational methods to predict DNAbinding
protein. From those methods, we highlighted many features necessary to build up an accurate
DNA-binding protein classifier. This can also help researchers to build up more useful computational
tools. Currently, there are some machine learning methods with good performance in predicting DNAbinding
proteins. The performance can be improved by using different kinds of features and classifiers.