Parallelizing de novo assembly with heterogeneous processors

HKUST Electronic Theses

Parallelizing de novo assembly with heterogeneous processors

by Shuang Qiu

THESIS 2019

Ph.D. Computer Science and Engineering

vii, 120 pages : illustrations ; 30 cm

Abstract

De Novo assemblers construct genome sequences from small fragments, without using any reference genome. Specifically, they represent the fragments in a De Bruijn graph and traverse the graph to generate the sequence. As constructing and traversing a big De Bruijn graph is both time and memory space consuming, we develop UNIPAR, a parallel software package that runs this process on a cluster of GPU-equipped computers. In particular, it utilizes all processor cores in each CPU and GPU, all CPUs and GPUs in a computer node, and all computer nodes of the cluster. Furthermore, we analyze the characteristics of genome data to design a concurrent hashing algorithm for the graph construction, and to reduce the communication overhead in the graph traversal. We further improve the overal...[ Read more ]

View Copyrighted to the author. Reproduction is prohibited without the author’s prior written consent.

Details

Collection HKUST Electronic Theses Degree Ph.D. Department Computer Science and Engineering Supervisors Luo, Qiong Authors Qiu, Shuang Subjects Nucleotide sequence Data processing Graphic methods Sequential analysis Graph theory Language English Call number Thesis CSED 2019 Qiu DOI 10.14711/thesis-991012730762203412

Full record

Parallelizing de novo assembly with heterogeneous processors

by Shuang Qiu

Post a Comment Cancel reply