ALCOMFT-TR-01-70

ALCOM-FT
 

Ben H.H. Juurlink, Petr Kolman, Friedhelm Meyer auf der Heide and Ingo Rieping
Optimal Broadcast on Parallel Locality Models
Paderborn. Work package 2. May 2001.
Abstract: In this paper matching upper and lower bounds for broadcast on general purpose parallel computation models that exploit network locality are proven. These models try to capture both the general purpose properties of models like the PRAM or BSP on the one hand, and to exploit network locality of special purpose models like meshes, hypercubes, etc., on the other hand. They do so by charging a cost l(|i-j|) for a communication between processors i and j, where l is a suitably chosen latency function.

An upper bound T(p) = \sumi=0log log p 2i · l(p1/2i) on the runtime of a broadcast on a p processor H-PRAM is given, for an arbitrary latency function l(k).

The main contribution of the paper is a matching lower bound, holding for all latency functions in the range from l(k)=Omega(log k / log log k) to l(k)=O(log2 k). This is not a severe restriction since for latency functions l(k) = O(log k/ log1+epsilon log (k)) with arbitrary epsilon > 0, the runtime of the algorithm matches the trivial lower bound Omega(log p) and for l(k) = Theta (log1+epsilon k) or l(k)=Theta(k^epsilon), the runtime matches the other trivial lower bound Omega(l(p)). Both upper and lower bounds apply for other parallel locality models like Y-PRAM, D-BSP and E-BSP, too.

Postscript file: ALCOMFT-TR-01-70.ps.gz (128 kb).

System maintainer Gerth Stølting Brodal <gerth@cs.au.dk>