Journal article
Authors list: Yu, J; Blom, J; Sczyrba, A; Goesmann, A
Publication year: 2017
Pages: 58-60
Journal: Journal of Biotechnology
Volume number: 257
ISSN: 0168-1656
eISSN: 1873-4863
Open access status: Hybrid
DOI Link: https://doi.org/10.1016/j.jbiotec.2017.02.020
Publisher: Elsevier
Abstract:
The introduction of next generation sequencing has caused a steady increase in the amounts of data that have to be processed in modern life science. Sequence alignment plays a key role in the analysis of sequencing data e.g. within whole genome sequencing or metagenome projects. BLAST is a commonly used alignment tool that was the standard approach for more than two decades, but in the last years faster alternatives have been proposed including RapSearch, GHOSTX, and DIAMOND. Here we introduce HAMOND, an application that uses Apache Hadoop to parallelize DIAMOND computation in order to scale-out the calculation of alignments. HAMOND is fault tolerant and scalable by utilizing large cloud computing infrastructures like Amazon Web Services. HAMOND has been tested in comparative genomics analyses and showed promising results both in efficiency and accuracy.
Citation Styles
Harvard Citation style: Yu, J., Blom, J., Sczyrba, A. and Goesmann, A. (2017) Rapid protein alignment in the cloud: HAMOND combines fast DIAMOND alignments with Hadoop parallelism, Journal of Biotechnology, 257, pp. 58-60. https://doi.org/10.1016/j.jbiotec.2017.02.020
APA Citation style: Yu, J., Blom, J., Sczyrba, A., & Goesmann, A. (2017). Rapid protein alignment in the cloud: HAMOND combines fast DIAMOND alignments with Hadoop parallelism. Journal of Biotechnology. 257, 58-60. https://doi.org/10.1016/j.jbiotec.2017.02.020