New Anticipatory Load Balancing Strategies for Parallel A* Algorithms

نویسندگان

  • Nihar R. Mahapatra
  • Shantanu Dutt
چکیده

In this paper, we develop load balancing strategies for scalable high-performance parallel A* algorithms suitable for distributed-memory machines. In parallel A* search, ineeciencies such as processor starvation and search of non-essential spaces (search spaces not explored by the sequential algorithm) grow with the number of processors P used, thus restricting its scalability. To alleviate this eeect, we propose a novel parallel startup phase and an eecient dynamic load balancing strategy called the quality equalizing (QE) strategy. Our new parallel startup scheme executes optimally in (logP) time and, in addition, achieves good initial load balance. The QE strategy employs near-neighbor quantitative and qualitative load balancing schemes to achieve load balance. These schemes utilize anticipatory mechanisms to detect and correct load imbalance before its actual occurrence; such mechanisms are particularly useful at lower work densities (the ratio of the problem size to P) and for lower granularity applications. The QE strategy possesses certain unique load balancing properties that enable it to signiicantly reduce starvation and non-essential work, and that make its performance robust across applications with diierent cost distributions for search-space nodes. Consequently, we obtain a highly scalable parallel A* algorithm with an almost-linear speedup. The startup and load balancing schemes were employed in parallel A* algorithms to solve the Traveling Salesman Problem on an nCUBE2 hypercube multicomputer. The QE strategy yields average speedup improvements of about 20-185% and 15-120% at low and intermediate work densities, respectively, over three well-known load balancing methods|the round-robin (RR), the random communication (RC) and the neighborhood averaging (NA) strategies. The average speedup observed on 1024 processors is about 985, representing a very high eeciency of 0:96. We also tested the eeect of including an anticipatory qualitative load balancing scheme in the QE strategy and found that it reduces the average execution time by 3:32% and 8:77% on 1 256 and 512 processors, respectively, at lower work densities. Finally, we present analytical and empirical results on the scalability of parallel A* algorithms in terms of the isoeeciency metric. Our analytical results include (1) a (P: log P) lower bound on the isoeeciency function of any parallel A* algorithm, and (2) a general expression for the upper bound on the isoeeciency function of our parallel A* algorithm using the QE strategy on any topology|for the hypercube and 2-D mesh ar-chitectures the upper bounds on the isoeeciency function are found to be (P: log 2 P) and (P: p …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parleda: a Library for Parallel Processing in Computational Geometry Applications

ParLeda is a software library that provides the basic primitives needed for parallel implementation of computational geometry applications. It can also be used in implementing a parallel application that uses geometric data structures. The parallel model that we use is based on a new heterogeneous parallel model named HBSP, which is based on BSP and is introduced here. ParLeda uses two main lib...

متن کامل

Scalable Load Balancing

In this paper, we develop load balancing strategies for scalable high-performance parallel A* algorithms suitable for distributed-memory machines. In parallel A* search, ineeciencies such as processor starvation and search of nonessential spaces (search spaces not explored by the sequential algorithm) grow with the number of processors P used, thus restricting its scalability. To alleviate this...

متن کامل

Load Balancing Strategies for Parallel SAMR Algorithms

Highly resolved solutions of partial differential equations are important in many areas of science and technology nowadays. Only adaptive mesh refinement methods reduce the necessary work sufficiently allowing the calculation of realistic problems. Blockstructured SAMR methods are well-suited for the time-explicit computation of large-scale dynamical problems, but still require parallelization ...

متن کامل

Load balancing of irregular parallel divide-and-conquer algorithms in group-SPMD programming environments

We study strategies for local load balancing of irregular parallel divide-andconquer algorithms such as Quicksort and Quickhull in SPMD-parallel environments such as MPI and Fork that allow to exploit nested parallelism by dynamic group splitting. We propose two new local strategies, repivoting and serialisation, and develop a hybrid local load balancing strategy, which is calibrated by paramet...

متن کامل

Dynamic Load Balancing Strategies for Parallel Computers

This paper deals with the problem of load balancing of parallel applications. Based on the study of recent work in the area, we propose a general taxonomy for describing and classifying the growing number of different load balancing techniques. This gives a thorough overview of different algorithms, helping designers to compare and choose the most suitable strategy for a given application. To i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994