Parallel Join Algorithms based on Parallel B+-trees

نویسندگان

  • Jianzhong Li
  • Wen-jun Sun
  • Yingshu Li
چکیده

Within the last several years, a number of parallel algorithms for the join operation have been proposed. However, almost all of the algorithms did not take advantages of the underlying parallel storage structures or data declustering methods of the operand relations. This paper introduces the concept of parallel storage structure or declustering aware parallel join algorithm. A parallel B+-tree index is presented and two classes of parallel join algorithms, which take advantages of the underlying parallel B+-tree index, are proposed and analyzed in the paper. First, the parallel B+-tree index is described. A parallel B+-tree index of a relation consists of a set of improved B+-tree indices (IB-tree for short) distributed among the processing nodes in a parallel computing system. Each IB-tree is built on a partition of the relation partitioned by one of the three strategies of range-partition, hash-partition and round-robin-partition. Second, two classes of parallel join algorithms based on the parallel B+-tree index are presented and analyzed. One is based on the range-partition strategy. The other is based on the hash-partition strategy. It is shown that the parallel execution times of the algorithms are linearly proportional to max{N/P, M/P}, where N and M are the numbers of tuples of the operand relations and P is the number of processing nodes. Finally, the two classes of parallel join algorithms are compared with the well known parallel join algorithms, parallel hybrid-hash join, parallel Grace-hash join and parallel merge-sort join, in practice. Theoretical and experimental results show that the algorithms based on parallel B+-tree are very efficient in case of at least one operand relation having a parallel B+-tree index on the join attributes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel Generation of t-ary Trees

A parallel algorithm for generating t-ary tree sequences in reverse B-order is presented. The algorithm generates t-ary trees by 0-1 sequences, and each 0-1 sequences is generated in constant average time O(1). The algorithm is executed on a CREW SM SIMD model, and is adaptive and cost-optimal. Prior to the discussion of the parallel algorithm a new sequential generation with O(1) average time ...

متن کامل

Parallel Pointer-Based Join Algorithms in Memory-mapped Environments

Three pointer-based parallel join algorithms are presented and analyzed for environments in which secondary storage is made transparent to the programmer through memory mapping. Buhr, Goel, and Wai [11] have shown that data structures such as B-Trees, R-Trees and graph data structures can be implemented as efficiently and effectively in this environment as in a traditional environment using exp...

متن کامل

Execution in a Parallel Main - Memory EnvironmentAnnita

In this paper, the performance and characteristics of the execution of various join-trees on a parallel DBMS are studied. The results of this study, are a step into the direction of the design of a query optimization strategy that is t for parallel execution of complex queries. Among others, synchronization issues are identiied to limit the performance gain from parallelism. A new hash-join alg...

متن کامل

Two Strategies Based on Meta-Heuristic Algorithms for Parallel Row Ordering Problem (PROP)

Proper arrangement of facility layout is a key issue in management that influences efficiency and the profitability of the manufacturing systems. Parallel Row Ordering Problem (PROP) is a special case of facility layout problem and consists of looking for the best location of n facilities while similar facilities (facilities which has some characteristics in common) should be arranged in a row ...

متن کامل

A New Parallel Matrix Multiplication Method Adapted on Fibonacci Hypercube Structure

The objective of this study was to develop a new optimal parallel algorithm for matrix multiplication which could run on a Fibonacci Hypercube structure. Most of the popular algorithms for parallel matrix multiplication can not run on Fibonacci Hypercube structure, therefore giving a method that can be run on all structures especially Fibonacci Hypercube structure is necessary for parallel matr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001