Multidimensional Blocking in UPC

نویسندگان

  • Christopher Barton
  • Calin Cascaval
  • George Almási
  • Rahul Garg
  • José Nelson Amaral
  • Montse Farreras
چکیده

Partitioned Global Address Space (PGAS) languages offer an attractive, high-productivity programming model for programming large-scale parallel machines. PGAS languages, such as Unified Parallel C (UPC), combine the simplicity of shared-memory programming with the efficiency of the messagepassing paradigm by allowing users control over the data layout. PGAS languages distinguish between private, shared-local, and shared-remote memory, with shared-remote accesses typically much more expensive than shared-local and private accesses, especially on distributed memory machines where sharedremote access implies communication over a network. In this paper we present a simple extension to the UPC language that allows the programmer to block shared arrays in multiple dimensions. We claim that this extension allows for better control of locality, and therefore performance, in the language. We describe an analysis that allows the compiler to distinguish between local shared array accesses and remote shared array accesses. Local shared array accesses are then transformed into direct memory accesses by the compiler, saving the overhead of a locality check at runtime. We present results to show that locality analysis is able to significantly reduce the number of shared accesses.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

UPC Implementation of the Sparse Triangular Solve and NAS FT pdfsubject

One of the main trends in modern high performance computing is the use of clusters of SMPs instead of the large monolithic machines to solve scientific codes. Clusters have recently received a lot of traction in the community through the availability of low-cost high performance interconnects. These interconnects allow not only high performance, but good scalability as well. One of the main dra...

متن کامل

Optimizing Partitioned Global Address Space Programs for Cluster Architectures

Optimizing Partitioned Global Address Space Programs for Cluster Architectures by Wei-Yu Chen Doctor of Philosophy in Computer Science University of California, Berkeley Professor Katherine A. Yelick, Chair Unified Parallel C (UPC) is an example of a partitioned global address space language for high performance parallel computing. This programming model enables application to be written in a s...

متن کامل

Comparison of Single, Averaged, and Pooled Urine Protein:Creatinine Ratios in Proteinuric Dogs Undergoing Medical Treatment

BACKGROUND Monitoring urine protein:creatinine ratios (UPC) in dogs with protein-losing nephropathy (PLN) is challenging because of day-to-day variation in UPC results. HYPOTHESIS/OBJECTIVES Determine whether single, averaged, or pooled samples from PLN dogs receiving medical treatment yield comparable UPCs, regardless of degree of proteinuria. ANIMALS Twenty-five client-owned PLN dogs rece...

متن کامل

Effect of Eight Weeks of Aerobic Progressive Training with Capsaicin on Changes in PGC-1α and UPC-1 Expression in Visceral Adipose Tissue of Obese Rats With Diet

Objective: Decreased physical activity coupled with increased High‐Fat Diet (HFD) intake prompts obesity. Current research suggests that changing White Adipose Tissue (WAT) to brown promotes energy expenditure to counter obesity. The purpose of this study was to investigate the effects of aerobic Progressive training and Capsaicin (Cap) on Peroxisome proliferator-activated receptor gamma coacti...

متن کامل

UPC-SPIN: A Framework for the Model Checking of UPC Programs

This paper presents a method supported by a software framework for the model checking of Unified Parallel C (UPC) programs. The proposed framework includes a front-end compiler that generates finite models of UPC programs in the modeling language of the SPIN model checker. The model generation is based on a set of sound abstraction rules that transform the UPC synchronization primitives to sema...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007