Massively parallel algorithms for acyclic joins

HKUST Electronic Theses

Massively parallel algorithms for acyclic joins

by Xiao Hu

THESIS 2019

Ph.D. Computer Science and Engineering

xi, 107 pages : illustrations ; 30 cm

Abstract

Due to the rapid development of massively parallel data processing systems such as MapReduce and Spark, there have been revived interests in the theoretical computer science community to study algorithms in a massively parallel computational model. Join evaluation, as one of the central algorithmic problems in database theory, has received much more attention. In this thesis, we study how to compute join queries efficiently (with theoretical guarantees) in today’s massively parallel systems.

We give an almost complete characterization of acyclic join queries on which instance-optimality, output-optimality and worst-case optimality can be achieved respectively. We first give an instance-optimal algorithm for r-hierarchical joins and prove that instance-optimality cannot be achieved...[ Read more ]

View Copyrighted to the author. Reproduction is prohibited without the author’s prior written consent.

Details

Collection HKUST Electronic Theses Degree Ph.D. Department Computer Science and Engineering Supervisors Yi, Ke Authors Hu, Xiao Subjects SQL (Computer program language) Mathematical models Query languages (Computer science) Electronic data processing Language English Call number Thesis CSED 2019 HuX DOI 10.14711/thesis-991012757468003412

Full record

Massively parallel algorithms for acyclic joins

by Xiao Hu

Post a Comment Cancel reply