Generic placeholder image

Current Bioinformatics

Editor-in-Chief

ISSN (Print): 1574-8936
ISSN (Online): 2212-392X

HaShRECA: Hadoop Based Short Read Error Correction Algorithm for Genome Assembly

Author(s): Muhammad Tahir, Muhammad Sardaraz, Ataul Aziz Ikram and Hassan Bajwa

Volume 10, Issue 4, 2015

Page: [469 - 475] Pages: 7

DOI: 10.2174/157489361004150922151409

Price: $65

Abstract

Next-generation high-throughput sequencing technologies have opened up new and challenging research opportunities. In particular, Next-generation sequencers produce a massive amount of short-reads data in a single run. However, the large amount of short-reads data produced is highly susceptible to errors, as compared to shotgun sequencing. Therefore, there is a peremptory demand to design fast and more accurate statistical and computational tools to analyze this data. We present HaShRECA, a new short-reads error correction algorithm based on probabilistic analysis of potential read errors that utilizes the Hadoop MapReduce framework. Experimental results show that HaShRECA is more accurate, as well as time and space efficient as compared to previous algorithms.

Keywords: Algorithm, genome, mapreduce, next generation sequencing, short read errors.

Graphical Abstract

Rights & Permissions Print Cite
© 2024 Bentham Science Publishers | Privacy Policy