Partial parallelization of VMEC system
نویسنده
چکیده
A C K N O W L E D G M E N T S ........................................................................................... vi 1 In tro d u ctio n ... 1 2 Parallel P rocessin g ............................................................................. 3 2.1 Parallel C o m p u te rs ................................... 3 2.2 Parallel C o m p u ta tio n s ................................................................................... 4 2.2.1 D ata P a ra l le l is m .................................................................................. 5 2.2.2 Shared M em o ry .......................................................................... 5 2.2.3 Message Passing ............................................................................ . 6 2.3 Performance Issues ................................................................................... 7 3 H igh Perform ance F ortran ..................................................................................... 8 3.1 Basics of High Performance F o r tr a n ............................................................. 8 3.1.1 Fortran 9 0 ............................................................................................... 9 3.1.2 Compiler D irectives.............................................................................. 10 3.1.3 Parallelism Features ........................................................................... 11 3.2 D ata Mapping . . ......................................................................................... 12 3.2.1 DISTRIBUTE directive .................................................................... 13 3.2.2 ALIGN d i r e c t iv e .................................................................................. 13 3.2.3 TEMPLATE d i r e c t iv e ............................ 14 3.2.4 PROCESSOR directive ............................................ 15 3.2.5 D ata Mapping for Procedure A rg u m e n ts ...................................... 15 3.3 D ata Parallelism ............................................................................................. 17 3.3.1 FORALL statement ........................................................................... 17
منابع مشابه
Parallel Strategies for the VMEC Program
The magnetohydrodynamic equilibrium problem in magnetic confinement fusion devices is solved by using the Variational Moments Equilibrium Code (VMEC). The sequential version of the code is computationally very expensive and may take several days of CPU time. In this work we present two parallelization techniques for the code. First, an automatic tool has been used as a guidance for the parallel...
متن کاملParallelization of Rich Models for Steganalysis of Digital Images using a CUDA-based Approach
There are several different methods to make an efficient strategy for steganalysis of digital images. A very powerful method in this area is rich model consisting of a large number of diverse sub-models in both spatial and transform domain that should be utilized. However, the extraction of a various types of features from an image is so time consuming in some steps, especially for training pha...
متن کاملAutomatic Parallelization by Pattern-Matching
We present the top{down design of a new system which performs automatic parallelization of numerical Fortran 77 or C source programs for execution on distributed{memory message { passing multi-processors such as e.g. the INTEL iPSC860 or the TMC CM{5. The key idea is a high{level pattern{matching approach which in some useful way permits partial restructuring of a wide class of numerical progra...
متن کاملEfficient parallelization of the genetic algorithm solution of traveling salesman problem on multi-core and many-core systems
Efficient parallelization of genetic algorithms (GAs) on state-of-the-art multi-threading or many-threading platforms is a challenge due to the difficulty of schedulation of hardware resources regarding the concurrency of threads. In this paper, for resolving the problem, a novel method is proposed, which parallelizes the GA by designing three concurrent kernels, each of which running some depe...
متن کامل