نتایج جستجو برای: multi gpu
تعداد نتایج: 473736 فیلتر نتایج به سال:
با توجه به اینکه در سال¬های اخیر پردازنده¬های چند هسته¬ای و gpu های چند هسته¬ای به عنوان ابزار مقرون به صرفه¬ای برای استفاده در سیستم¬های مختلف مناسب بوده¬اند، امروزه کامپیوتر¬های رو میزی، لپ¬تاپ¬ها و ابر رایانه¬ها و محیط¬های ابری که شامل پردازنده¬های چند هسته¬ای cpu و gpu می¬باشند بسیار رایج هستند. در نتیجه ارائه¬ی سیستم عامل¬هایی برای محاسبات که بر روی cpu و gpu اجرا شوند مورد توجه بسیار زیاد...
We have ported the numerical factorization and triangular solve phases of sparse direct solver STRUMPACK to GPU . implements LU using multifrontal algorithm, which performs most its operations in dense linear algebra on so-called frontal matrices various sizes. Our implementation off-loads these operations, as well scatter–gather between matrices. For larger matrices, our relies vendor librarie...
We provide timing results for common linear algebra subroutines across BLAS (Basic Linear Algebra Subprograms) and GPU (Graphics Processing Unit)-based implementations. Several BLAS implementations are compared. The first is the unoptimised reference BLAS which provides a baseline to measure against. Second is the Atlas tuned BLAS, configured for single-threaded mode. Third is the development v...
In this paper we introduce Bohrium, a runtimesystem for mapping array-operations onto a number of different hardware platforms, from multi-core systems to clusters and GPU enabled systems. As a result, the Bohrium runtime system enables NumPy code to utilize CPU, GPU, and Clusters. Bohrium integrates seamlessly into NumPy through the implicit data parallelization of array operations, which are ...
In this paper, we present an acceleration strategy for Smoothed Particle Hydrodynamics (SPH) on multi-GPU platform. For single-GPU, we first use a neighborhood search algorithm of compacting cell index combined with spatial domain characteristics. For multi-GPU, we focus on the changing patterns of SPH's computational time. Simple dynamic load balancing algorithm works well because the computat...
Recent work has demonstrated that the use of programmable GPUs can be advantageous during relational query processing on analytical workloads. In this paper, we take a closer look at graph problems such as finding all triangles and all four-cliques of a graph. In particular, we present two different join algorithms for the GPU. The first is an implementation of Leapfrog-Triejoin (LFTJ), a recen...
In this work we present an adaptive multi-GPU Exchange Monte Carlo method designed for the simulation of the 3D Random Field Model. The algorithm design is based on a twolevel parallelization scheme that allows the method to scale its performance in the presence of faster and GPUs as well as multiple GPUs. The set of temperatures is adapted according to the exchange rate observed from short tri...
With the development of satellite remote sensing technology, satellite remote sensing data obtained by the amount will increase rapidly. Consequently, the process of Wallis transformation is faced with such challenges as large data size, high intensity, high computational complexity and large computational quantity, and so on. A fast algorithm and efficient implementation of Wallis filtering ba...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید