Techniques for Cache and Memory Simulation Using Address Reference Traces

نویسنده

  • Mark A. Holliday
چکیده

Simulation using address reference traces is one of the primary methods for the performance evaluation of the memory hierarchy of computer systems. In this paper we survey the techniques used in such a simulation. In both the uniprocessor and shared-memory multiprocessor cases, the issues can be divided into trace collection, trace storage, and trace usage. Trace collection can employ several hardware or software methods. Common concerns are that the collection method capture all of the address references of interest, that the execution overhead of the collection method is not excessive, and that the trace is of adequate length. The increasing size of caches heightens the adequate length concern. Trace storage is of concern because of the large size of traces. Techniques for trace compression and trace reduction have been developed. Trace usage is of concern because of the length of a simulation. Under some circumstances it is possible to evaluate multiple cache sizes in a single pass of the trace. For multiprocessor traces it is also possible to simulate the trace in parallel to achieve speedup. In the multiprocessor case, the global trace problem arises because environment-dependent address changes prevent the adjustment of traces collected in one environment from re ecting a di erent environment. A relatively new technique, inline simulation, attempts to avoid a number of the problems associated with traditional trace-driven simulation. Index Terms address reference traces, trace-driven simulation, survey, inclusion property, trace reduction, one-pass simulation, parallel traces, global trace problem, inline simulation. 2

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PSnAP: Accurate Synthetic Address Streams through Memory Profiles

Memory address traces are an important information source; they drive memory simulations for performance modeling, systems design and application tuning. For long running applications, the direct use of an address trace is complicated by its size. Previous attempts to reduce address trace size incurred a substantial penalty with respect to trace accuracy. We propose a novel method of memory pro...

متن کامل

Memory Performance Profiling via Sampled Performance Monitor Event Traces

Memory performance can be studied, process behavior can be characterized, and application performance can be improved through the use of sampled performance monitor event traces. As an example, this paper demonstrates how sampled traces of the TPC-C benchmark executed on eightand 32-processor configurations of the IBM eServer pSeries 690 (p690) are analyzed to identify the resolution sites of l...

متن کامل

Memory Characterization of Embedded Applications

One method commonly used by computer architects to study application memory behavior is trace driven simulation of synthetic address traces. Leveraging synthetic traces reduces the time needed to gather the diverse set of application traces needed for effective testing, and provides programmers with knowledge to tune their programs for better runtime performance. In this paper, we present a tec...

متن کامل

BACH: BYU Address Collection Hardware, The Collection of Complete Traces

Trace driven simulation is an important tool for computer systems performance analysis and prediction, but its accuracy decreases when incomplete or inaccurate traces are used for input. Nevertheless, many memory hierarchy simulation studies have been published which rely on such traces. In this paper we describe BACH, a hardware monitor developed to capture long, accurate, and complete traces ...

متن کامل

CMP$im: A Binary Instrumentation Approach to Modeling Memory Behavior of Workloads on CMPs CMP$im: A Binary Instrumentation Approach to Modeling Memory Behavior of Workloads on CMPs

Chip multiprocessors are the next attractive point in the design space of future high performance processors. There is a growing need for simulation methodologies to determine the memory system requirements of emerging workloads in a reasonable amount of time. To explore the design space of a CMP memory hierarchy, this paper presents the use of binary instrumentation as an alternative to execut...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Int. Journal in Computer Simulation

دوره 1  شماره 

صفحات  -

تاریخ انتشار 1991