Principles and automation of low-level optimizations on GPUs

HKUST Electronic Theses

Principles and automation of low-level optimizations on GPUs

by Da Yan

THESIS 2022

Ph.D. Computer Science and Engineering

1 online resource (xii, 100 pages) : illustrations (some color)

Abstract

Performance optimizations on GPUs are not well-understood enough. This thesis discusses principles and automation of performance optimizations on NVIDIA GPUs, with a special focus on compute-bound kernels. This thesis focuses on the abstraction layers between portable virtual instruction sets (e.g., LLVM IR, NVIDIA PTX) and native hardware assembly.

We first introduce the native GPU instruction set, Shader ASSembly (SASS). Previously, the public cannot customize SASS generation as the only way to generate SASS is to use close-sourced proprietary compiler ptxas. ptxas hides many important optimizations including instruction scheduling. We develop an open-source assembler, TuringAs, for the public to manipulate SASS. And we identified new optimization opportunities at SASS level. For inst...[ Read more ]

View Copyrighted to the author. Reproduction is prohibited without the author’s prior written consent.

Details

Collection HKUST Electronic Theses Degree Ph.D. Department Computer Science and Engineering Supervisors Wang, Wei Authors Yan, Da Subjects Graphics processing units Structural optimization Language English Call number Thesis CSE 2022 YanD DOI 10.14711/thesis-991013114652903412

Full record

Principles and automation of low-level optimizations on GPUs

by Da Yan

Post a Comment Cancel reply