نتایج جستجو برای: parallel architectures

تعداد نتایج: 268182  

2008
Ankit Jain

We have developed pOSKI: the Parallel Optimized Sparse Kernel Interface – an autotuning framework to optimize Sparse Matrix Vector Multiply (SpMV) performance on emerging shared memory multicore architectures. Our autotuning methodology extends previous work done in the scientific computing community targeting serial architectures. In addition to previously explored parallel optimizations, we f...

2005
Frank Hannig Hritam Dutta Alexey Kupriyanov Jürgen Teich Rainer Schaffer Sebastian Siegel Renate Merker Ronan Keryell Bernard Pottier Daniel Chillet Daniel Ménard Olivier Sentieys

In this paper, we introduce a methodology for the systematic mapping, evaluation, and exploration of massively parallel processor architectures that are designed for special purpose applications in the world of embedded computers. The investigated class of computer architectures can be described by massively parallel networked processing elements that, using today’s hardware technology, may be ...

2009
Tilman Küstner Josef Weidendorfer Jasmine Schirmer Tobias Klug Carsten Trinitis Sybille Ziegler

The efficient use of multicore architectures for sparse matrixvector multiplication (SpMV) is currently an open challenge. One algorithm which makes use of SpMV is the maximum likelihood expectation maximization (MLEM) algorithm. When using MLEM for positron emission tomography (PET) image reconstruction, one requires a particularly large matrix. We present a new storage scheme for this type of...

1997
Dominik HENRICH Thomas HÖNIGER

Abstract – This paper presents the different possibilities for parallel processing in robot control architectures. At the beginning, we shortly review the historic development of control architectures. Then, a list of requirements for control architectures is set up from a parallel processing point of view. As our main topic, we identify the levels of parallel processing in robot control archit...

1993
Joachim Diederich Ah Chung Tsoi

Joachim Diederich, Queensland University of Technology (Brisbane), started with a brief introduction to connectionist modeling and parallel machines. Neural network modeling can be done on various levels of abstraction. On a low level of abstraction, a simulator can support the definition and simulation of "compartmental models," chemical synapses, dendritic trees etc., i.e. explicit computatio...

2001
Lasse Natvig

BSPlab is a simulation environment for studying the interplay between hardware and software in parallel computing. It offers the BSPlib parallel programming library and is based on Bulk Synchronous Parallel (BSP) computing [1], [2]. BSPlab contains a set of high-level performance models of parallel architectures. It can be used as a tool for architectural level design space exploration of BSP c...

Journal: :ICGA Journal 1995
Lewis Stiller

This thesis describes techniques for the design of parallel programs that solve well-structured problems with inherent symmetry. Part I demonstrates the reduction of such problems to generalized matrix multiplication by a group-equivariant matrix. Fast techniques for this multiplication are described, including factorization, orbit decomposition, and Fourier transforms over nite groups. Our alg...

Journal: :IEEE Trans. Computers 1999
Mark W. Goudreau Kevin J. Lang Satish Rao Torsten Suel Thanasis Tsantilas

The Bulk-Synchronous Parallel (BSP) model was proposed by Valiant as a standard interface between parallel software and hardware. In theory, the BSP model has been shown to allow the asymptotically optimal execution of architecture-independent software on a variety of architectures. Our goal in this work is to experimentally examine the practical use of the BSP model on current parallel archite...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید