THESIS
2012
viii, 66 p. : ill. ; 30 cm
Abstract
Estimating Site Frequency Spectrum (SFS) from gene sequences is an important task in population genetic analysis. SFS is a statistical summary that describes the distribution of minor allele frequency (MAF) at a set of gene sites from a group of individuals. Due to the high intensity of MAF computation, current SFS estimation is limited to a population size of a couple of hundreds individuals. To scale up MAF computation for a larger population, we explore the use of graphics processors, or GPUs, as a hardware accelerator. Specifically, we develop a software package named GAMA (GPU-Accelerated Minor Allele Frequency computation). In GAMA, we design a new MAF computation algorithm, which has a lower time complexity than the state-of-the-art algorithm, and is suitable for parallelization...[
Read more ]
Estimating Site Frequency Spectrum (SFS) from gene sequences is an important task in population genetic analysis. SFS is a statistical summary that describes the distribution of minor allele frequency (MAF) at a set of gene sites from a group of individuals. Due to the high intensity of MAF computation, current SFS estimation is limited to a population size of a couple of hundreds individuals. To scale up MAF computation for a larger population, we explore the use of graphics processors, or GPUs, as a hardware accelerator. Specifically, we develop a software package named GAMA (GPU-Accelerated Minor Allele Frequency computation). In GAMA, we design a new MAF computation algorithm, which has a lower time complexity than the state-of-the-art algorithm, and is suitable for parallelization on the GPU. Also, we utilize the local memory and warp synchronization mechanism on the GPU to further improve the performance. Finally, we adopt a logarithm transformation to avoid the floating point underflow problem in the computation. With GAMA, we are able to compute MAF for up to a thousand individuals for the first time. On a server equipped with an NVIDIA Tesla C2070 GPU and an Intel Xeon E5520 2.27 GHz CPU, GAMA achieves a speedup of 47 times over realSFS, an advanced, single-threaded, CPU-based SFS estimation program on MAF computation time, and is 3.5 times faster than our optimized, 16-thread parallel implementation on the CPU.
Post a Comment