نتایج جستجو برای: openmp

تعداد نتایج: 2294  

2015
Ananya Muddukrishna Peter A. Jonsson Mats Brorsson

Programmers struggle to understand performance of task-based OpenMP programs since profiling tools only report thread-based performance. Performance tuning also requires task-based performance in order to balance per-task memory hierarchy utilization against exposed task parallelism. We provide a cost-effective method to extract detailed task-based performance information from OpenMP programs. ...

2011
Besar Wicaksono Ramachandra C. Nanjegowda Barbara M. Chapman

Developing shared memory parallel programs using OpenMP is straightforward, but getting good performance in terms of speedup and scalability can be difficult. This paper demonstrates the functionality of a collector-based dynamic optimization framework called DARWIN that uses collected performance data as feedback to affect the behavior of the program through the OpenMP runtime, thus able to op...

2014
Xiaohui Chen Marc Moreno Maza Sushek Shekar Priya Unnikrishnan

We present MetaFork, a metalanguage for multithreaded algorithms based on the fork-join concurrency model and targeting multicore architectures. MetaFork is implemented as a source-to-source compilation framework allowing automatic translation of programs from one concurrency platform to another. The current version of this framework supports CilkPlus and OpenMP. We evaluate the benefits of the...

2006
Barbara M. Chapman Lei Huang Haoqiang Jin Gabriele Jost Bronis R. de Supinski

OpenMP provides a portable programming interface for shared memory parallel computers (SMPs). Although this interface has proven successful for small SMPs, it requies greater flexibility in light of the steadily growing size of individual SMPs and the recent advent of multithreaded chips. In this paper, we describe two application development experiences that exposed these expressivity problems...

2002
Rudolf Eigenmann Jay Hoeflinger Robert H. Kuhn David A. Padua Ayon Basumallik Seung-Jai Min Jiajing Zhu

This paper presents an overview of an ongoing NSFsponsored project for the study of runtime systems and compilers to support the development of efficient OpenMP parallel programs for distributed memory systems. The first part of the paper discusses a prototype compiler, now under development, that will accept OpenMP and will target TreadMarks, a Software Distributed Shared Memory System (SDSM),...

2017
Ali Alqazzaz Zijun Han

OpenMP has become a very successful user-model for developing parallel applications. However, there are still some challenges in terms of OpenMP interoperability with other programming APIs as well as within itself. In this paper, we introduce some extensions to the OpenMP runtime library related to the interoperability problem. Also, we evaluate and compare the performance of the different wai...

Journal: :CoRR 2015
John M. Campbell R. Keith Ellis Walter T. Giele

We report on our findings modifying MCFM using OpenMP to implement multi-threading. By using OpenMP, the modified MCFM will execute on any processor, automatically adjusting to the number of available threads. We modified the integration routine VEGAS to distribute the event evaluation over the threads, while combining all events at the end of every iteration to optimize the numerical integrati...

2012
Spiros N. Agathos Nikolaos D. Kallimanis Vassilios V. Dimakopoulos

In this work we present a highly efficient implementation of OpenMP tasks. It is based on a runtime infrastructure architected for data locality, a crucial prerequisite for exploiting the NUMA nature of modern multicore multiprocessors. In addition, we employ fast work-stealing structures, based on a novel, efficient and fair blocking algorithm. Synthetic benchmarks show up to a 6-fold increase...

2009
Ashay Rane Dan Stanzione

The Hybrid method of parallelization (using MPI for internode communication and OpenMP for intra-node communication) seems a natural fit for the way most clusters are built today. It is generally expected to help programs run faster due to factors like availability of greater bandwidth for intra-node communication. However, optimizing hybrid applications for maximum speedup is difficult primari...

2017
Sara Royuela Xavier Martorell Eduardo Quiñones Luís Miguel Pinho

The safety-critical real-time embedded domain increasingly demands the use of parallel architectures to fulfill performance requirements. Such architectures require the use of parallel programming models to exploit the underlying parallelism. This paper evaluates the applicability of using OpenMP, a widespread parallel programming model, with Ada, a language widely used in the safety-critical d...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید