Exploring Optimal Cost-Performance Designs for Raw Microprocessors

نویسندگان

  • Csaba Andras Moritz
  • Donald Yeung
  • Anant Agarwal
چکیده

The semiconductor industry roadmap projects that advances in VLSI technology will permit more than one billion transistors on a chip by the year 2010. The MIT Raw microprocessor is a proposed architecture that strives to exploit these chip-level resources by implementing thousands of tiles, each comprising a processing element and a small amount of memory, coupled by a static two-dimensional interconnect. A compiler partitions ne-grain instruction-level parallelism across the tiles and statically schedules inter-tile communication over the interconnect. Because Raw microprocessors fully expose their internal hardware structure to the software, they can be viewed as a gigantic FPGA with coarse-grained tiles, in which software orchestrates communication over static interconnections. One open challenge in Raw architectures is to determine their optimal grain size and balance. The grain size is the area of each tile, and the balance is the proportion of area in each tile devoted to memory, processing, communication, and I/O. If the total chip area is xed, more area devoted to processing will result in a higher processing power per node, but will lead to a fewer number of tiles. This paper presents an analytical framework using which designers can reason about the design space of Raw microprocessors. Based on an architectural model and a VLSI cost analysis, the framework computes the performance of applications, and uses an optimization process to identify designs that will execute these applications most cost-e ectively. Although the optimal machine con gurations obtained vary for di erent applications, problem sizes and budgets, the general trends for various applications are similar. Accordingly, for the applications studied, assuming an 1 billion logic transistor equivalent area, we recommend building a Raw chip with approximately 1000 tiles, 30 words/cycle global I/O, 20Kbytes of local memory per node, 3-4 words/cycle local communication bandwidth, and single-issue processors. This con guration will give performance near the global optimum for most applications.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SimpleFit: A Framework for Analyzing Design Trade-Offs in Raw Architectures

ÐThe semiconductor industry roadmap projects that advances in VLSI technology will permit more than one billion transistors on a chip by the year 2010. The MIT Raw microprocessor is a proposed architecture that strives to exploit these chip-level resources by implementing thousands of tiles, each comprising a processing element and a small amount of memory, coupled by a static two-dimensional i...

متن کامل

Design Verification of Advanced Pipelined Machines

We have developed formal verification techniques for the high-level design of pipelined microprocessors. Our technique represents the instruction stream in such a way that pipeline properties, such as RAW hazards do not exist, can be directly expressed. This approach allows us to prove the correctness of the machine design incrementally, which reduces the computational cost of the verification....

متن کامل

A Design Environment for Counterflow Pipeline Synthesis

The Counterflow Pipeline (CFP) organization may be a good target for synthesis of application-specific microprocessors for embedded systems because it has a regular and simple structure. This paper describes a design environment for tailoring CFP's to an embedded application to improve performance. Our system allows exploring the design space of all possible CFP's for a given embedded applicati...

متن کامل

OPTIMAL PERFORMANCE-BASED SEISMIC DESIGN OF COMPOSITE BUILDING FRAMES WITH RC COLUMNS AND STEEL BEAMS

Composite RCS building frames integrate reinforced concrete columns with structural steel beams to provide an efficient solution for the design and construction of earthquake-resisting structures. In this paper, an optimization framework is developed for performance-based seismic design of planar RCS moment resisting frames. The objective functions are defined as minimizing the construction cos...

متن کامل

RISC Microprocessors and Scienti c Computing

This paper discusses design features in currently available RISC microprocessors that result in less-than-optimal sustained performance on large-scale scienti c calculations. Recommendations for future designs are suggested. The author is with the Numerical Aerodynamic Simulation (NAS) Systems Division at NASA Ames Research Center, Mo ett Field, CA 94035.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998