Memory Centric Interconnection Mechanism for Message Passing in Parallel Systems

نویسندگان

  • Yamin Li
  • Sanli Li
  • Wanming Chu
چکیده

The Interconnection Network (IN) connecting computing nodes in parallel systems has become one of the key research issues in parallel computer architecture. Traditionally, the INs in parallel systems for message passing have been built on the basis of logic circuits with different topology structures. Currently, the bandwidth of data transmission for message passing in available parallel systems reaches the magnitude of Gbps. In this paper, we investigate a kind of interconnection network based on multiport memory for message passing parallel systems, which is termed Memory Centric Interconnection Mechanism (MCIM). The memory is a Multi-Port Fast Static Memory (MPFSRAM) which acts as mailbox for message passing. With considerably simpler Arbitration and Selection Units, MCIM can fully utilize the bandwidth of MPFSRAM for parallel data transfer to achieve the bandwidth of dozen Gbps for message passing with pipelined mode operation of data sending and receiving, meanwhile with much less complexity than that of the IN based on logic circuits, such as routers with hundreds of lines for each link in conventional Mesh structure parallel systems. Furthermore, the buffers in the mail box provide flexible measures for scheduling the message passing. This paper describes the principle of MCIM, the mechanism of data sending/receiving operations, the arbitration/selection of communication path, and the scheduling flowchart of message passing in the buffer. The paper also gives the experimental simulation results of the MCIM performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Message-Passing Distributed Memory Parallel Algorithm for a Dual-Code Thin Layer, Parabolized Navier-Stokes Solver

In this study, the results of parallelization of a 3-D dual code (Thin Layer, Parabolized Navier-Stokes solver) for solving supersonic turbulent flow around body and wing-body combinations are presented. As a serial code, TLNS solver is very time consuming and takes a large part of memory due to the iterative and lengthy computations. Also for complicated geometries, an exceeding number of grid...

متن کامل

Implementation of production systems on message passing computers : techniques, simulation results and analysis

The two important parallel architecture classes are the shared-memory architectures and the message-passing architectures. In the past researchers working on the parallel implementations of production systems have focused either on shared-memory multiprocessors or on special-purpose architectures. Message-passing computers have not been studied. The main reasons have been the large message-pass...

متن کامل

Interconnection Topologies and Routing for Parallel Processing Systems

The major aims of this work is to give a comparative survey of static interconnection topologies and to discuss their properties with respect to their use as interconnection topolo gies in message passing multi computer systems i e each processing element has its own local memory there is no common memory and the processing elements communicate via message passing To this end it was necessary t...

متن کامل

OpenMP: An Industry- Standard API for Shared- Memory Programming

46 1070-9924/98/$10.00 © 1998 IEEE IEEE COMPUTATIONAL SCIENCE & ENGINEERING Application developers have long recognized that scalable hardware and software are necessary for parallel scalability in application performance. Both have existed for some time in their lowest common denominator form, and scalable hardware—as physically distributed memories connected through a scalable interconnection...

متن کامل

OpenMP : An Industry - Standard API for Shared - Memory Programming LEONARDO

46 1070-9924/98/$10.00 © 1998 IEEE IEEE COMPUTATIONAL SCIENCE & ENGINEERING Application developers have long recognized that scalable hardware and software are necessary for parallel scalability in application performance. Both have existed for some time in their lowest common denominator form, and scalable hardware—as physically distributed memories connected through a scalable interconnection...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003