A Parallel and Distributed Computing System for Protein-Protein Interaction Literature Mining

Author(s): Hsi-Chieh Lee*, Szu-Wei Huang .

Journal Name: Current Proteomics

Volume 15 , Issue 5 , 2018

Submit Manuscript
Submit Proposal

Graphical Abstract:


Abstract:

A parallel and distributed computing mining system is proposed for finding protein-protein interaction literatures from the databases on the Internet. In the proposed system, we try to find out discriminating words for protein-protein interaction by way of text mining from the literatures. A threshold called matching-degree is also evaluated to check if a given literature might related to protein- protein interactions. Furthermore, a keypage-based search mechanism is adopted to find related papers for protein-protein interactions from a given document. The system is designed with a webbased graphical user interface and a parallel and distributed job-dispatching kernel. Experiments are conducted the experimental results indicate that by using the proposed system, it is helpful for researchers to find out protein-protein literatures from the overwhelming piece of information. Moreover, the utilization of parallel and distributed architecture makes this system scalable and the speedup and efficiency of the system are promising. With two servers, the speedup is 1.95 and with three servers the speedup is 3.97 which derive the efficiency to be 0.975 and 0.9925, respectively.

Keywords: Protein-protein interactions, parallel and distributed computing, intelligent computing, data mining, information retrieval, keypage-based search.

Rights & PermissionsPrintExport Cite as


Article Details

VOLUME: 15
ISSUE: 5
Year: 2018
Page: [344 - 351]
Pages: 8
DOI: 10.2174/1570164615666180403150355
Price: $58

Article Metrics

PDF: 10
HTML: 3