Evaluating Intel’s Many Integrated Core Architecture for Climate Science
نویسندگان
چکیده
We evaluate Intel’s Many Integrated Core (MIC) architecture for climate science. We present preliminary performance results with a general circulation model (HOMME) and a cloud-resolving physics model (CRM). The results show promise in parallel scalability. However, single-thread performance needs further improvement. Keywords-multi-core; many-core; HPC; parallel computing; climate; geoscience
منابع مشابه
Automatic Transformations for Effective Parallel Execution on Intel Many Integrated Core
We demonstrate in this work the potential effectiveness of a source-to-source framework for automatically optimizing a sub-class of affine programs on the Intel Many Integrated Core Architecture. Data locality is achieved through complex and automated loop transformations within the polyhedral framework to enable parallel tiling, and the resulting tiles are processed by an aggressive automatic ...
متن کاملSelf-adaptive Multiprecision Preconditioners on Multicore and Manycore Architectures
Based on the premise that preconditioners needed for scientific computing are not only required to be robust in the numerical sense, but also scalable for up to thousands of light-weight cores, we argue that this two-fold goal is achieved for the recently developed self-adaptive multi-elimination preconditioner. For this purpose, we revise the underlying idea and analyze the performance of impl...
متن کاملFirst experiences with the Intel MIC architecture at LRZ
With the rapidly growing demand for computing power new accelerator based architectures have entered the world of high performance computing since around 5 years. In particular GPGPUs have recently become very popular, however programming GPGPUs using programming languages like CUDA or OpenCL is cumbersome and errorprone. Trying to overcome these difficulties, Intel developed their own Many Int...
متن کاملSplotch: porting and optimizing for the Xeon Phi
With the increasing size and complexity of data produced by large scale numerical simulations, it is of primary importance for scientists to be able to exploit all available hardware in heterogenous High Performance Computing environments for increased throughput and efficiency. We focus on the porting and optimization of Splotch, a scalable visualization algorithm, to utilize the Xeon Phi, Int...
متن کاملAchieving Portable High Performance for Iterative Solvers on Accelerators
Many supercomputers, clusters, and workstations today are equipped with accelerators such as graphics processing units (GPUs) and Intel’s many-integrated core architecture (MIC). While their highly parallel architectures are very efficient for dense linear algebra operations, particularly those which are compute-bound rather than limited by memory bandwidth, their use for iterative solvers such...
متن کامل