Protein subcellular localization is closely related to protein functions. Protein can work only in specific subcellular positions, so protein localization in a cell is very important in studies on cytobiology, proteomics, and drug design. Protein subcellular localization prediction based on machine learning is timely and has generated great interest in the field of bioinformatics. This paper reviews the research status of this problem in recent years from the following four aspects: protein dataset construction, features extraction of protein sequence, machine learning algorithms, and web server construction. Finally, we analyzed the challenges in predicting protein subcellular localization and identified possible future research trends.
Department of Computer Science, Xiamen University, Xiamen 361005, China.