Network optimization for distributed dataflow applications in datacenter

HKUST Electronic Theses

Network optimization for distributed dataflow applications in datacenter

by Bairen Yi

THESIS 2019

M.Phil. Computer Science and Engineering

ix, 40 pages : illustrations ; 30 cm

Abstract

Dataflow is a prevailing programming paradigm for processing data in a distributed fashion. When programming dataflow applications, programmers express a data processing task as a dataflow graph, with its vertices as specific operations and its edges as input/output relations or dataflow dependencies between operations. When deployed inside a data center in which a large number of processors are available, the dataflow graph is partitioned and placed onto different processors for improved processing throughput. For graph edges that cross the partition boundaries, the distributed execution engine needs to transferred data chunks between different processors. Increasingly higher data volumes and more substantial processing power of individual processors often bringing in communicatio...[ Read more ]

View Copyrighted to the author. Reproduction is prohibited without the author’s prior written consent.

Details

Collection HKUST Electronic Theses Degree M.Phil. Department Computer Science and Engineering Authors Yi, Bairen Subjects Electronic data processing Distributed processing Data flow computing Data centers Data processing Language English Call number Thesis CSED 2019 Yi DOI 10.14711/thesis-991012757568203412

Full record

Network optimization for distributed dataflow applications in datacenter

by Bairen Yi

Post a Comment Cancel reply