Cache optimization for CPU - GPU heterogeneous processors ∗
نویسنده
چکیده
Microprocessors combining CPU and GPU cores using a common last-level cache pose new challenges to cache management algorithms. Since GPU cores feature much higher data access rates than CPU cores, the majority of the available cache space will be used by GPU applications, leaving only very limited cache capacity for CPU applications, which may be disadvantageous for overall system performance. This paper introduces a novel cache management algorithm that aims at determining an optimal split of cache capacity between CPU and GPU applications.
منابع مشابه
Techniques for Shared Resource Management in Systems with Throughput Processors
The continued growth of the computational capability of throughput processors has made throughput processors the platform of choice for a wide variety of high performance computing applications. Graphics Processing Units (GPUs) are a prime example of throughput processors that can deliver high performance for applications ranging from typical graphics applications to general-purpose data parall...
متن کاملOptimization and Scheduling of Applications in a Heterogeneous CPU-GPU Environment
With the emergence of General Purpose computation on GPU (GPGPU) and corresponding programming frameworks (OpenCL, CUDA), more applications are being ported to use GPUs as a co-processor to achieve performance that could not be accomplished using just the traditional processors. However, programming the GPUs is not a trivial task and depends on the experience and knowledge of the individual pro...
متن کاملIn-Memory Data Analytics on Coupled CPU-GPU Architectures
In the big data era, in-memory data analytics is an effective means of achieving high performance data processing and realizing the value of data in a timely manner. Efforts in this direction have been spent on various aspects, including in-memory algorithmic designs and system optimizations. In this paper, we propose to develop the next-generation in-memory relational database processing techn...
متن کاملDesign space exploration of on-chip ring interconnection for a CPU-GPU heterogeneous architecture
Incorporating a GPU architecture into CMP, which is more efficient with certain types of applications, is a popular architecture trend in recent processors. This heterogeneous mix of architectures will use an on-chip interconnection to access shared resources such as last-level cache tiles andmemory controllers. The configuration of this on-chip network will likely have a significant impact on ...
متن کاملPerformance Analysis of Join Algorithms on GPUs
Implementing database operations on parallel platforms has gain a lot of momentum in the past decade, due to the increasing popularity of many-core processors. A number of studies have shown the potential of using GPUs to speed up database operations. In this paper, we present empirical evaluations of a state-of-the-art work published in SIGMOD’08 on GPU-based join processing. In particular, su...
متن کامل