Stratified sampling of execution traces: Execution phases serving as strata

نویسندگان

  • Heidar Pirzadeh
  • Sara Shanian
  • Abdelwahab Hamou-Lhadj
  • Luay Alawneh
  • Arya Shafiee
چکیده

The understanding of the behavioural aspects of a software system is an important enabler for many reverse engineering activities. The behaviour of software is typically represented in the form of execution traces. Traces, however, can be overwhelmingly large. To reduce their size, sampling techniques, especially the ones based on random sampling, have been extensively used. Random sampling, however, may result in samples that are not representative of the original trace. In this paper, we propose a trace sampling technique that not only reduces the size of a trace but also results in a sample that is representative of the original trace by ensuring that the desired characteristics of an execution are distributed similarly in both the sampled and the original trace. Hence, the insights gained from analyzing the sample trace could be extrapolated to the original execution trace. Our approach is based on stratified sampling instead of random sampling and uses the concept of execution phases as strata. We define an execution phase as a part of a trace that represents a specific task of the traced system. We also present an approach for the automatic detection of execution phases from a trace. Finally, we show the effectiveness of our sampling technique through two case studies.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PASS: Program Structure Aware Stratified Sampling for Statistically Selecting Instruction Traces and Simulation Points

As modeled microarchitectures become more complex and the size of benchmark program keeps increasing, simulating a complete program with various input sets is practically infeasible within a given time and computation resource budget. A common approach is to simulate only a subset of representative parts of the program selected from the complete program execution. SimPoint [1,2] and SMARTS [10]...

متن کامل

Detecting Program Execution Phases Using Heuristic Search

Understanding a program from its execution traces is extremely difficult because a trace consists of thousands to millions of events, such as method calls, object creation and destruction, etc. Nonetheless, execution traces can provide valuable information, once abstracted from their low-level events. We propose to identify feature-level phases based on events collected from traces of the progr...

متن کامل

Using Dictionary Compression Algorithms to Identify Phases in Program Traces

Program execution traces record the sequences of events or functions that are encountered during a program execution. They can provide valuable insights into the run-time behaviour of software systems and form the basis for dynamic analysis techniques. Execution traces of large software systems can be huge, incorporating hundreds of thousands of elements, rendering them difficult to interpret a...

متن کامل

Stratified Programming Integrated Development Environment (SPIDER)

This paper describes the functionality required for a development environment that supports stratified programming (SP), a novel software development method that we have proposed recently [1, 2]. In this paper we discuss the case when program strata are controlled outside the programming language and present the main features of the SPIDER environment for strata creation and SP program executio...

متن کامل

Explaining the Dimensions and Consequences of Geography-Political Execution of Health Promotion Plan in Kohgiluyeh and Boyerahmad Province

  Introduction: The purpose of this study is to explain the dimensions and implications of the political geography of the implementation of the Health System Development Plan in Kohgiluyeh and Boyer-Ahmad Province. Methods: The research method was a survey and data gathering tool was a structured questionnaire. Sample size was determined by the modified Cochran formula (400). The research data...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Sci. Comput. Program.

دوره 78  شماره 

صفحات  -

تاریخ انتشار 2013