Execution Based Evaluation of MINs for Cache-Coherent Multiprocessors
نویسندگان
چکیده
In this paper, performance of multistage interconnection network with wormhole routing and packet switching has been evaluated for cache-coherent shared-memory multiproces-sors. The evaluation is based on execution-driven simulation using various applications. The traac in cache-coherent systems is very diierent from the traac in message-passing environment and is characterized by traac bursts, one-to-many and many-to-one traac, and small xed length messages. The performance of packet switching and wormhole routing has been evaluated for diierent buuer sizes. The comparison of execution time between packet-switched and wormhole-routed network for same amount of buuer space per switch shows that wormhole-routed network provides signiicant advantage over packet-switched network. We have also evaluated wormhole networks with virtual channels for variable number of virtual channels and it buuers per channel. The study shows that for wormhole routing 4 virtual channels per link and 4 it buuers per channel is the best connguration in most of the cases.
منابع مشابه
Analysis of Interconnection Networks for Cache Coherent Multiprocessors with Scientific Applications
Interconnection networks, such as, shared bus and multistage interconnection networks (MINs) are very suitable for the design of shared memory multiprocessors. The existing analytical models of these networks are based on unrealistic synthetic workload for simplicity of the analyses. Also, they consider the networks in isolation without incorporating other architectural details of a multiproces...
متن کاملExecution Based Evaluation of Multistage Interconnection Networks for Cache-Coherent Multiprocessors
In this paper, performance of multistage interconnection network with wormhole routing and packet switching has been evaluated for cache-coherent shared-memory multiprocessors. The traac in cache-coherent systems is characterized by traac bursts, one-to-many and many-to-one traac, and small xed length messages. The evaluation is based on execution-driven simulation using various applications. T...
متن کاملImpact of Switch Design on the Application Performance of Cache-Coherent Multiprocessors
In this paper, the effect of switch design on the application performance of cache-coherent non-uniform memory access (CC-NUMA) multiprocessors is studied in detail. Wormhole routing and cut-through switching are evaluated for these shared-memory multiprocessors that employ multistage interconnection network (MIN) and full map directory-based cache coherence protocol. The switch design also con...
متن کاملThe MINC (Multistage Interconnection Network with Cache Control Mechanism) Chip
Although bus connected multiprocessors have been widely used as high-end workstations or servers, the number of connected processors is strictly limited by the maximum bandwidth of the shared bus. Instead of them, a switch connected multiprocessor which uses a crossbar or Multistage Interconnection Networks(MINs) for connecting processors and memory modules is a hopeful candidate. However, in s...
متن کاملA Preliminary Evaluation of Cache-miss-initiated Prefetching Techniques in Scalable Multiprocessors
Prefetching is an important technique for reducing the average latency of memory accesses in scalable cache-coherent multiprocessors. Aggressive prefetching can signiicantly reduce the number of cache misses, but may introduce bursty network and memory traac, and increase data sharing and cache pollution. Given that we anticipate enormous increases in both network bandwidth and latency, we exam...
متن کامل