Disk Allocation Methods for Parallelizing Grid Files
نویسندگان
چکیده
The grid file [1] is a well known access method for multi-dimensional and spatial data. The response time needed to process path and range queries on the grid file access method can be improved significantly by distributing the data pages over multiple disks. This paper explores the disk allocation methods used to allocate the data pages of grid file among a set of disks, which can be accessed in parallel. Given N disks, a perfect allocation will speed up the processing of each query by a factor of N in this environment. The paper shows that no disk allocation is perfect for the set of all orthogonal range queries, even on uniformly distributed read-only data. We then introduce two families of allocation methods, namely the Linear allocation method and the Lattice allocation method, which are perfect for a large collection of interesting path queries (rows, columns, diagonals, anti-diagonals) and range queries (small rectangles), on an interesting set of data distributions. We address the issues in extending disk allocation methods to general data distributions with random updates. Finally, we provide experimental results on the performance of the proposed methods and other well known disk allocation methods on different query sets, data distributions and data set sizes. Areas: Performance evaluation, spatial databases, parallel databases.
منابع مشابه
Evaluation of Disk Allocation Methods for Parallelizing Spatial Queries on Grid Files‡
Spatial Database Systems are characterized by large amounts of geometric and geographic data. Query response times in these systems are crucial, since these systems are often used interactively for decision support systems. The Grid file[1] is a well-known spatial access method that has great potential for parallelism, which reduces the response time of spatial queries for time-critical on-line...
متن کاملjParalize – a simple, free and lightweight tool for parallelizing Matlab calculations on multicores and in clusters
We present a very simple, reliable, efficient and free tool for parallelizing calculations under Matlab in multicore and cluster environments. It does not use any compilers, MEX files, disk files, etc.. It is compatible with old Paralize package, but allows the involved cores/machines to do other jobs when a server is not busy.
متن کاملjPar - a simple, free and lightweight tool for parallelizing Matlab calculations on multicores and in clusters
We present a very simple, free tool for parallelizing calculations under Matlab in multicore and cluster environments. After the installation it does not use any compilers, MEX files, disk files, etc. It is compatible with the old Paralize package, but allows the involved cores/machines to do other jobs when a worker core/machine is not busy.
متن کاملAn Optimal Disk Allocation Strategy for Partial Match Queries on Non-Uniform Cartesian Product Files
The disk allocation problem addresses the issue of how to distribute a file on to several disks to maximize the concurrent disk accesses in response to a partial match query. In the past this problem has been studied for binary as well as for p-ary cartesian product files. In this paper, we propose a disk allocation strategy for non-uniform cartesian product files by a coding theoretic approach...
متن کاملA Comparison of FFS Disk Allocation Policies
The 4.4BSD file system includes a new algorithm for allocating disk blocks to files. The goal of this algorithm is to improve file clustering, increasing the amount of sequential I/O when reading or writing files, thereby improving file system performance. In this paper we study the effectiveness of this algorithm at reducing file system fragmentation. We have created a program that artificiall...
متن کامل