Accelerating protein structure alignment with multicore-CPUs and graphic processors

HKUST Electronic Theses

Accelerating protein structure alignment with multicore-CPUs and graphic processors

by Yishui Wu

THESIS 2016

M.Phil. Computer Science and Engineering

x, 48 pages : illustrations ; 30 cm

Abstract

Among a variety of protein analysis tools, the pairwise protein structural alignment tool is the one used to identify structural similarities between two different proteins. It is widely applied to analyze the functionalities and evolutionary relationships of proteins. With the rapid growth of new 3D protein structures and the further development of protein research work, protein structural alignment becomes more and more important in this field.

Compared with different structural alignment tools, DALIX is the one capable of calculating an optimal structural alignment based on DALI score. It outperforms DALI, one of the most popular structural alignment algorithms, in most cases when aligning relatively short proteins. However, DALIX has high time complexity, which restricts its practical application.

In this work, we design parallel versions of DALIX to accelerate it. We implement a coarse-grained parallel algorithm using OPENMP on multicore-CPUs platforms and also a fine-grained parallel algorithm using CUDA on GPUs. In both versions, we parallelize the following components of DALIX: 1) Computation of the DALI score array, 2) algorithm engineering, 3) 2-level dynamic programming and 4) updating of the Lagrangian multipliers. In particular, to better utilize the massive GPU thread parallelism, in the CUDA version of DALIX, we design a 2-level parallel algorithm for the 2-level dynamic programming procedure. We compact the decision table of dynamic programming so that it can fit into the shared memory for inter thread communication, which improves the performance. We have compared the two parallel algorithms with the serial version of DALIX on a set of protein alignments. The CUDA version is run on an NVIDIA GTX960 GPU while the OPENMP version and the serial version are executed on a platform equipped with two Intel Xeon E5620 CPUs. The results show that the CUDA version outperforms the serial version with a speedup ranging from 5.5x to 20x. The OPENMP version also has a speedup of around 7x.

[ Hide abstract ]

View Copyrighted to the author. Reproduction is prohibited without the author’s prior written consent.

Details

Collection HKUST Electronic Theses Degree M.Phil. Department Computer Science and Engineering Authors Wu, Yishui Subjects Proteins Analysis Graphics processing units Data processing Mathematical models Multiprocessors Parallel processing (Electronic computers) Language English Call number Thesis CSED 2016 WuY DOI 10.14711/thesis-b1628168

Full record

Accelerating protein structure alignment with multicore-CPUs and graphic processors

by Yishui Wu

Post a Comment Cancel reply