Multi-server multi-function distributed computation

Malak, Derya

Talk at University of Illinois Chicago, 1 October 2024, Chicago, IL, USA

Large-scale distributed computing systems, such as MapReduce, Spark, or distributed deep networks, are critical for parallelizing the execution of computational tasks. Nevertheless, a struggle between computation and communication complexity lies at the heart of distributed computing. The work presented studies the communication cost for a multi-server, multi-task distributed computation framework, and does so for a broad class of functions and data statistics. Considering the framework where a user seeks the computation of multiple complex (conceivably non-linear) tasks from a set of distributed servers, we establish communication cost upper bounds for a variety of data statistics, function classes and data placements across the servers. To do so, we proceed to apply, for the first time here, Korner’s characteristic graph approach — which is known to capture the structural properties of data and functions — to the framework of multi-server multi-task distributed computing. Going beyond the general expressions, and in order to offer clearer insight, we also consider the well-known scenario of cyclic dataset placement and linearly separable functions over the binary field, in which case our approach exhibits considerable gains over the state of the art. Similar gains are identified for multi-linear functions.

Detail

BIBTEX

Type:

Talk

City:

Chicago

Date:

2024-10-01

Department:

Systèmes de Communication

Eurecom Ref:

7891

© EURECOM. Personal use of this material is permitted. The definitive version of this paper was published in Talk at University of Illinois Chicago, 1 October 2024, Chicago, IL, USA and is available at :