Optimizing the Use of Static Buffers for DMA on a CELL Chip

نویسندگان

  • Tong Chen
  • Zehra Sura
  • Kathryn M. O'Brien
  • John K. O'Brien
چکیده

The CELL architecture has one Power Processor Element (PPE) core, and eight Synergistic Processor Element (SPE) cores that have a distinct instruction set architecture of their own. The PPE core accesses memory via a traditional caching mechanism, but each SPE core can only access memory via a small 256K software-controlled local store. The PPE cache and SPE local stores are connected to each other and main memory via a high bandwidth bus. Software is responsible for all data transfers to and from the SPE local stores. To hide the high latency of DMA transfers, data may be prefetched into SPE local stores using loop blocking transformations and static buffers. We find that the performance of an application can vary depending on the size of the buffers used, and whether a single-, double-, or triple-buffer scheme is used. Constrained by the limited space available for data buffers in the SPE local store, we want to choose the optimal buffering scheme for a given space budget. Also, we want to be able to determine the optimal buffer size for a given scheme, such that using a larger buffer size results in negligible performance improvement. We develop a model to automatically infer these parameters for static buffering, taking into account the DMA latency and transfer rates, and the amount of computation in the application loop being targeted. We test the accuracy of our prediction model using a research prototype compiler developed on top of the IBM XL compiler infrastructure.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimization of Material Removal Rate in Electrical Discharge Machining Alloy on DIN1.2080 with the Neural Network and Genetic Algorithm

Electrical discharge machining process is one of the most Applicable methods in Non-traditional machining for Machining chip in Conduct electricity Piece that reaching to the Pieces that have good quality and high rate of machining chip is very important. Due to the rapid and widespread use of alloy DIN1.2080 in different industry such as Molding, lathe tools, reamer, broaching, cutting guillot...

متن کامل

An Optimization on the DIN1.2080 Alloy in the Electrical Discharge Machining Process Using ANN and GA

Electrical Discharge Machining (EDM) process is one of the most widely used methods for machining. This method is used to form parts that conduct electricity. This method of machining has used for hard materials and therefore selects the correct values of parameters which are so effective on the quality machining of parts. Reaching to optimum condition of the DIN1.2080 alloy (D3) machining is v...

متن کامل

An Optimization on the DIN1.2080 Alloy in the Electrical Discharge Machining Process Using ANN and GA

Electrical Discharge Machining (EDM) process is one of the most widely used methods for machining. This method is used to form parts that conduct electricity. This method of machining has used for hard materials and therefore selects the correct values of parameters which are so effective on the quality machining of parts. Reaching to optimum condition of the DIN1.2080 alloy (D3) machining is v...

متن کامل

Segment Gating for Static Energy Reduction with Introspective Networks-on-Chip Undergraduate Honors Thesis

Chip multiprocessors (CMPs) have emerged as a primary vehicle for overcoming the limitations of uniprocessor scaling, with power constraints now representing a key factor of CMP design. Recent studies have shown that the on-chip interconnection network (NOC) can consume as much as 36% of overall chip power. To date, researchers have employed several techniques to reduce power consumption in the...

متن کامل

Non-Blocking Routers Design Based on West First Routing Algorithm & MZI Switches for Photonic NoC

For the first time, the 4- and 5-port optical routers are designed by using the West First routing algorithm for use in optical network on chip. The use of the WF algorithm has made the designed routers to provide non-blocking routing in photonic network on chip. These routers not only are based on high speed Mach-Zehnder switches(Which have a higher bandwidth and more thermal tolerance than mi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006