Performance analysis of massively parallel programs for graphics processing units
نویسندگان
چکیده
Any modern Graphics Processing Unit (graphics card) is a good platform to run massively parallel programs. Still, we lack tools observe and measure performance characteristics of GPU-based software. We state that due complex memory hierarchy thou- sands execution threads the all issues are about efficient use graphics card hierarchy. propose GPGPUSim simulator, previously used mostly for architecture validation, validation CUDA-based program. provide examples which show how simulation analysis
منابع مشابه
Massively parallel chemical potential calculation on graphics processing units
Oneand two-stage free energy methods are common approaches for calculating the chemical potential from a molecular dynamics or Monte Carlo molecular simulation trajectory. Although these methods require significant amounts of CPU time spent on post-simulation analysis, this analysis step is wellsuited for parallel execution. In this work, we implement this analysis step on graphics processing u...
متن کاملParallel Genetic Programming on Graphics Processing Units
In program inference, the evaluation of how well a candidate solution solves a certain task is usually a computationally intensive procedure. Most of the time, the evaluation involves either submitting the program to a simulation process or testing its behavior on many input arguments; both situations may turn out to be very time-consuming. Things get worse when the optimization algorithm needs...
متن کاملRigid body constraints realized in massively-parallel molecular dynamics on graphics processing units
a r t i c l e i n f o a b s t r a c t Molecular dynamics (MD) methods compute the trajectory of a system of point particles in response to a potential function by numerically integrating Newton's equations of motion. Extending these basic methods with rigid body constraints enables composite particles with complex shapes such as anisotropic nanoparticles, grains, molecules, and rigid proteins t...
متن کاملAlgorithmic performance studies on graphics processing units
We report on our experience with integrating and using graphics processing units (GPUs) as fast parallel floatingpoint co-processors to accelerate two fundamental computational scientific kernels on the GPU: sparse direct factorization and nonlinear interior-point optimization. Since a full re-implementation of these complex kernels is typically not feasible, we identify the matrix-matrix multi...
متن کاملStrategies for Parallel Ant Colony Optimization on Graphics Processing Units
Ant colony algorithms are known to have a significant ability of finding high-quality solutions in a reasonable time [2]. However, the computational time of these methods is seriously compromised when the current instance of the problem has a high dimension and/or is hard to solve. In this line, a significant amount of research has been done in order to reduce computation time and improve the s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Problemy programmirovaniâ
سال: 2022
ISSN: ['1727-4907']
DOI: https://doi.org/10.15407/pp2022.03-04.051