Intelligent Memory Manager: towards Improving the Locality Behavior of Allocation-intensive Applications

نویسنده

  • Mehran Rezaei
چکیده

and linked data structured) applications has led to a large number of research trends. Memory performance due to the cache misses in these applications continues to lag in terms of execution cycles as ever increasing CPU-Memory speed gap continues to grow. Sophisticated prefetcing techniques, data relocations, and multithreaded architec-tures have tried to address memory latency. These techniques are not completely successful since they require either extra hardware/software in the system or special properties in the applications. Software needed for prefetching and data relocation strategies, aimed to improve cache performance, pollutes the cache so that the technique itself becomes counter-productive. On the other hand, extra hardware complexity needed in multithreaded architectures decelerates CPU's clock, since " Simpler is Faster ". This dissertation, directed to seek the cause of poor locality behavior of allocation-intensive applications, studies allocators and their impact on the cache performance of these applications. Our study concludes that service functions, in general, and memory management functions, in particular, entangle with application's code and become the major cause of cache pollution. In this dissertation, we present a novel technique that transfers the allocation and de-allocation functions entirely to a separate processor residing in chip with DRAM (Intelligent Memory Manager). Our empirical results show that, on average, 60% of the cache misses caused by allocation and de-allocation service functions are eliminated using our technique. We also show that internal fragmentation, extra memory over-allocated by the allocators, counters special locality of applications. We introduce " hybrid, " an exact fit allocator, which results in 25% cache miss reduction due to minimizing the internal fragmentation. Moreover, this work indicates that external fragmentation, inability to use the existing free space, indirectly affects the execution performance. We propose address ordered and segregrated binary tree allocators that exhibit high storage utilization and moderate execution performance to compare with existing allocators. ii ACKNOWLEDGMENTS In the name of Allah, the most merciful, the most compassionate. All praise is due to Allah. My thanks, first and foremost, belongs to Almighty Allah, who has given me wisdom, courage, and strength. I am filled with joy and excitement. Now, when I look back at the last six years of my life, the thoughts and feelings in my heart are only about the countless people who contributed to my Ph.D. work in variety of ways. Two individuals have had the main role in guiding and supporting me throughout my Ph.D. work. Krishna …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Intelligent Memory Manager Eliminates Cache Pollution Due to Memory Management Functions

In this work, we show that data-intensive and frequently-used service functions such as memory allocation and de-allocation entangle with application’s working set and become a major cause for cache misses. We present our technique that transfers the allocation and de-allocation functions entirely to a separate processor residing on chip with DRAM (Intelligent Memory Manager). The results manif...

متن کامل

Intelligent memory manager: Reducing cache pollution due to memory management functions

In this work, we show that data-intensive and frequently-used service functions such as memory allocation and deallocation entangle with application s working set and become a major cause for cache misses. We present our technique that transfers the allocation and de-allocation functions executions from main CPU to a separate processor residing on chip with DRAM (Intelligent Memory Manager). Th...

متن کامل

Custom Memory Allocation for Free Improving Data Locality with Container-Centric Memory Allocation

We propose a novel container-centric memory allocation scheme. In this scheme, the container’s semantics guide the memory allocation, which results in data locality improvement and execution time reduction. The container-centric allocation provides the benefits of custom memory allocation, with the portability advantage. Applications need not change a single line of code, but rather change the ...

متن کامل

Predicting Lifetimes in Dynamically Allocated Memory

Predictions oflifetimes of dynamically allocated objects can be used to improve time and space efficiency of dynamic memory management in computer programs. Barrett and Zorn [1993] used a simple lifetime predictor and demonstrated this improvement on a variety of computer programs. In this paper, we use decision trees to do lifetime prediction on the same programs and show significantly better ...

متن کامل

The Characterization of Data Intensive Memory Workloads on Distributed PIM Systems

Processing-In-Memory (PIM) circumvents the von Neumann bottleneck by combining logic and memory (typically DRAM) on a single die. This work examines the memory system parameters for constructing PIM based parallel computers which are capable of meeting the memory access demands of complex programs that exhibit low reuse and non uniform stride accesses. The analysis uses the Data Intensive Syste...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004