A Similarity Searching System for Biological Phenotype Images Using Deep Convolutional Encoder-decoder Architecture

Author(s): Bizhi Wu, Hangxiao Zhang, Limei Lin, Huiyuan Wang, Yubang Gao, Liangzhen Zhao, Yi-Ping Phoebe Chen*, Riqing Chen*, Lianfeng Gu*.

Journal Name: Current Bioinformatics

Volume 14 , Issue 7 , 2019

Become EABM
Become Reviewer

Graphical Abstract:


Abstract:

Background: The BLAST (Basic Local Alignment Search Tool) algorithm has been widely used for sequence similarity searching. Analogously, the public phenotype images must be efficiently retrieved using biological images as queries and identify the phenotype with high similarity. Due to the accumulation of genotype-phenotype-mapping data, a system of searching for similar phenotypes is not available due to the bottleneck of image processing.

Objective: In this study, we focus on the identification of similar query phenotypic images by searching the biological phenotype database, including information about loss-of-function and gain-of-function.

Methods: We propose a deep convolutional autoencoder architecture to segment the biological phenotypic images and develop a phenotype retrieval system to enable a better understanding of genotype–phenotype correlation.

Results: This study shows how deep convolutional autoencoder architecture can be trained on images from biological phenotypes to achieve state-of-the-art performance in a phenotypic images retrieval system.

Conclusion: Taken together, the phenotype analysis system can provide further information on the correlation between genotype and phenotype. Additionally, it is obvious that the neural network model of image segmentation and the phenotype retrieval system is equally suitable for any species, which has enough phenotype images to train the neural network.

Keywords: Biological phenotype similarity searching, convolutional encoder-decoder architecture, segmentation of biological images, phenotypic image retrieval system, histogram, benchmark dataset.

[1]
Akiyama K, Kurotani A, Iida K, Kuromori T, Shinozaki K, Sakurai T. RARGE II: an integrated phenotype database of Arabidopsis mutant traits using a controlled vocabulary. Plant Cell Physiol 2013; 55(1): e4-.
[2]
Swarbreck D, Wilks C, Lamesch P, et al. The Arabidopsis Information Resource (TAIR): gene structure and function annotation. Nucleic Acids Res 2007; 36(Suppl_1): D1009-4.
[3]
McGinnis S, Madden TL. BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Res 2004; 32(Suppl_2): W20-5.
[4]
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol 1990; 215(3): 403-10.
[5]
Bengio Y. Learning deep architectures for AI. Foundations and trends® in Machine Learning 2009; 2(1): 1-127.
[6]
Masci J, Meier U, Cireşan D, Schmidhuber J. Stacked convolutional autoencoders for hierarchical feature extraction. Artificial Neural Networks and Machine Learning–ICANN 2011 2011; 52-9.
[7]
Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015;
[8]
Badrinarayanan V, Kendall A, Cipolla R. Segnet: A deep convolutional encoder-decoder architecture for scene segmentation. IEEE trans pattern analysis and machine intelligence 2017.
[9]
Li W, Duan L, Xu D, Tsang IW-H. editors.. Text-based image retrieval using progressive multi-instance learning. Computer Vision (ICCV), 2011 IEEE International Conference on;. 2011 ; IEEE
[10]
Singhai N, Shandilya SK. A survey on: content based image retrieval systems. Int J Comput Appl 2010; 4(2): 22-6.
[11]
Luccheseyz L, Mitray S. Color image segmentation: A state-of-the-art survey. Proc Indian National Sci Acad 2001; 67(2): 207-21.
[12]
He K, Zhang X, Ren S, Sun J. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. Proceedings of the IEEE international conference on computer vision. 2015;
[13]
Krizhevsky A, Hinton GE. Using very deep autoencoders for content-based image retrieval. ESANN 2011.
[14]
Morel JB, Godon C, Mourrain P, et al. Fertile hypomorphic ARGONAUTE (ago1) mutants impaired in post-transcriptional gene silencing and virus resistance. Plant Cell 2002; 14(3): 629-39.
[15]
Singh A, Ganapathysubramanian B, Singh AK, Sarkar S. Machine learning for high-throughput stress phenotyping in plants. Trends Plant Sci 2016; 21(2): 110-24.
[16]
Peña JM, Torres-Sánchez J, Serrano-Pérez A, de Castro AI, López-Granados F. Quantifying efficacy and limits of unmanned aerial vehicle (UAV) technology for weed seedling detection as affected by sensor resolution. Sensors 2015; 15(3): 5609-26.
[17]
Seren Ü, Grimm D, Fitz J, Weigel D, Nordborg M, Borgwardt K, et al. AraPheno: a public database for Arabidopsis thaliana phenotypes. Nucleic Acids Res 2017; 45(D1): D1054-9.


Rights & PermissionsPrintExport Cite as

Article Details

VOLUME: 14
ISSUE: 7
Year: 2019
Page: [628 - 639]
Pages: 12
DOI: 10.2174/1574893614666190204150109
Price: $58

Article Metrics

PDF: 22
HTML: 2
EPUB: 1
PRC: 1