Experiences with Mesh-like computations using Prediction Binary Trees

نویسندگان

  • Gennaro Cordasco
  • Biagio Cosenza
  • Rosario De Chiara
  • Ugo Erra
  • Vittorio Scarano
چکیده

In this paper we aim at exploiting the temporal coherence among successive phases of a computation, in order to implement a load-balancing technique in mesh-like computations to be mapped on a cluster of processors. A key concept, on which the load balancing schema is built on, is the use of a Predictor component that is in charge of providing an estimation of the unbalancing between successive phases. By using this information, our method partitions the computation in balanced tasks through the Prediction Binary Tree (PBT). At each new phase, current PBT is updated by using previous phase computing time for each task as next-phase’s cost estimate. The PBT is designed so that it balances the load across the tasks as well as reduces dependency among processors for higher performances. Reducing dependency is obtained by using rectangular tiles of the mesh, of almost-square shape (i. e. one dimension is at most twice the other). By reducing dependency, one can reduce inter-processors communication or exploit local dependencies among tasks (such as data locality). Furthermore, we also provide two heuristics which take advantage of data-locality. Our strategy has been assessed on a significant problem, Parallel Ray Tracing. Our implementation shows a good scalability, and improves performance in both cheaper commodity cluster and high performance clusters with low latency networks. We report different measurements showing that tasks granularity is a key point for the performances of our decomposition/mapping strategy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bagging-Like Effects for Decision Trees and Neural Nets in Protein Secondary Structure Prediction

In the Third Critical Assessment of Techniques for Protein Structure Prediction (\CASP-3") contest, the best performance was obtained with a classi er that uses neural networks, a window size of fteen around a given amino acid, and a training set of about 299,186 amino acids. We set out to investigate the possibility of obtaining better performance by using a bagging-like committee of binary de...

متن کامل

Time-Optimal Tree Computations on Sparse Meshes

The main goal of this work is to fathom the suitability of the mesh with multiple broadcasting architecture (MMB) for some tree-related computations. We view our contribution at two levels: on the one hand we exhibit time lower bounds for a number of tree-related problems on the MMB. On the other hand, we show that these lower bounds are tight by exhibiting time-optimal tree algorithms on the M...

متن کامل

Comparison of Accelerating Techniques for Discontinuity Meshing

Creating an appropriate mesh is one of demanding tasks of many global illumination algorithms. Discontinuity meshing proved to diminish artifacts caused by other meshing strategies. Since naive discontinuity meshing would produce a great amount of geometric computations, accelerating techniques are usually involved. In this paper, we present results obtained with help of k-discrete orientation ...

متن کامل

Link-disjoint Embedding of Complete Binary Trees into 3d-meshes with Dimension-ordered Routing

This paper is considered with the problem of embedding complete binary trees into 3-dimensional meshes using dimension-ordered routing with the primary concern of minimizing the link congestion. The authors showed that a complete binary tree with 2 1 nodes can be embedded into a 3-dimensional mesh with optimum size, 2 nodes, if the link congestion is two [7]. (More precisely, the link congestio...

متن کامل

MATHEMATICAL ENGINEERING TECHNICAL REPORTS A Parallel Tree Contraction Algorithm on Non-Binary Trees

Parallel tree contraction is an important framework to develop efficient parallel algorithms on trees. Parallel tree contraction gives an appropriate scheduling for parallel computations on trees, and the scheduling brings efficient parallel algorithms to us. While there are many studies for efficient algorithms of parallel tree contraction and implementation of various parallel computations ba...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Scalable Computing: Practice and Experience

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2009