نتایج جستجو برای: gpu

تعداد نتایج: 10995  

2014
Yue Gu Jian-Hua Gu Xing-She Zhou

Remote procedure call (RPC) is a simple, transparent and useful paradigm for providing communication between two processes across a network. The compute unified device architecture (CUDA) programming toolkit and runtime enhance the programmability of the graphics processing unit (GPU) and make GPU more versatile in high performance computing. The current researches mainly focus on the accelerat...

2016
Mochi Xue Kun Tian Yaozu Dong Jiacheng Ma Jiajun Wang Zhengwei Qi Bingsheng He Haibing Guan

With increasing GPU-intensive workloads deployed on cloud, the cloud service providers are seeking for practical and efficient GPU virtualization solutions. However, the cutting-edge GPU virtualization techniques such as gVirt still suffer from the restriction of scalability, which constrains the number of guest virtual GPU instances. This paper introduces gScale, a scalable GPU virtualization ...

Journal: :Journal of Systems and Software 2016
Wookhyun Han Hoon Sung Chwa Hwidong Bae Hyosu Kim Insik Shin

Multi-GPUs appear as an attractive platform to speed up data-parallel GPGPU computation. The idea of split-and-merge execution has been introduced to accelerate the parallelism of multiple GPUs even further. However, it has not been explored before how to exploit such an idea for real-time multi-GPU systems properly. This paper presents an open-source real-time multi-GPU scheduling framework, c...

Journal: :CoRR 2013
Teng Li Vikram K. Narayana Tarek A. El-Ghazawi

The High Performance Computing (HPC) field is witnessing a widespread adoption of Graphics Processing Units (GPUs) as co-processors for conventional homogeneous clusters. The adoption of prevalent SingleProgram Multiple-Data (SPMD) programming paradigm for GPU-based parallel processing brings in the challenge of resource underutilization, with the asymmetrical processor/co-processor distributio...

2005
Stefan Schenke Burkhard C. Wünsche Joachim Denzler

Volume segmentation is an important part of any medical image analysis framework used for diagnoses, treatment planning and biomedical modelling and visualisation. Recent advances in modern graphics hardware have made it possible to perform general purpose computing on the GPU. In this paper we survey and analyse the current state-of-the-art of GPU-based volume segmentation algorithms. Limitati...

2011
Shinpei Kato Scott Brandt Yutaka Ishikawa

The graphics processing unit (GPU) is becoming a very powerful platform to accelerate graphics and data-parallel compute-intensive applications. It significantly outperforms traditional multi-core processors in performance and energy efficiency. Its application domains also range widely from embedded systems to high-performance computing systems. However, operating systems support is not adequa...

2008
Stephen W. Abell John Jaehwan Lee

Abell, Stephen W. MSECE, Purdue University, August 2013. Parallel Acceleration of Deadlock Detection and Avoidance Algorithms on GPUs. Major Professor: Dr. John Jaehwan Lee. Current mainstream computing systems have become increasingly complex. Most of which have Central Processing Units (CPUs) that invoke multiple threads for their computing tasks. The growing issue with these systems is resou...

2012
Gert-Jan van den Braak Cedric Nugteren Bart Mesman Henk Corporaal

Voting algorithms, such as histogram and Hough transforms, are frequently used algorithms in various domains, such as statistics and image processing. Algorithms in these domains may be accelerated using GPUs. Implementing voting algorithms efficiently on a GPU however is far from trivial due to irregularities and unpredictable memory accesses. Existing GPU implementations therefore target only...

2012
Jiadong Wu Weiming Shi Bo Hong

With their high computation throughput and outstanding performance-per-watt figures, the graphics processing units (GPU) are becoming increasingly important for high-performance computing (HPC) systems. Existing GPU execution environment restricts the GPU usage to local host node. This is suitable for standalone computer nodes, but becomes inefficient for HPC systems that consist of a large num...

2010
Christian R. Trott Lars Winterfeld Paul S. Crozier

We present a GPU implementation of LAMMPS, a widely-used parallel molecular dynamics (MD) software package, and show 5x to 13x single node speedups versus the CPU-only version of LAMMPS. This new CUDA package for LAMMPS also enables multi-GPU simulation on hybrid heterogeneous clusters, using MPI for inter-node communication, CUDA kernels on the GPU for all methods working with particle data, a...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید