Value Prediction Design for High-Frequency Microprocessors
نویسندگان
چکیده
This paper studies value predictor design under table access latency and energy constraints for high-frequency, wide-issue microprocessors. Previous value prediction efforts make generous assumptions regarding table sizes and access conditions, while ignoring prediction latencies and energy issues. Our work shows that the latency of a high-performance value predictor cannot be completely hidden by the early stages of the instruction pipeline as previously assumed, and causes noticeable performance degradation versus unconstrained value prediction. This work describes and compares three varieties of value prediction strategies: At-Fetch, Post-Decode, and Decoupled. Decoupled value prediction excels at overcoming the high-frequency table access constraints but with high energy consumption requirements. A load-only value predictor can greatly reduce energy consumption but at the cost of performance. We propose a value prediction approach that combines the latency-friendly approach of Decoupled value prediction with a more energyefficient implementation. The traditional PC-indexed prediction tables are removed and replaced by updates of buffered prediction traces. This latency and energy aware form of accounting for and distributing the value predictions leads to a 58%-95% reduction in dynamic energy versus common high-performance value prediction hardware while still maintaining high performance.
منابع مشابه
Improving achievable ILP through value prediction and program profiling
This paper explores the possibility of using program profiling to enhance the efficiency of value prediction. Value prediction attempts to eliminate true-data dependencies by predicting the outcome values of instructions at run-time and executing true-data dependent instructions based on that prediction. So far, all published techniques in this area have examined hardware-only value prediction ...
متن کاملHigh-Performance Microprocessor Design
Three generations of Alpha microprocessors have been designed using a proven custom design methodology. The performance of these microprocessors was optimized by focusing on high-frequency design. The Alpha instruction set architecture facilitates high clock speed, and the chip organization for each generation was carefully chosen to meet critical paths. Digital has developed six generations of...
متن کاملThe Impact of Value Mispeculation on Branch Resolution in Out-of-Order Superscalar Microprocessors
As contemporary microprocessor design attempt to expose additional instruction level parallelism (ILP), data value speculation has become a mainstream microarchitectural feature. Many different value speculation techniques have been proposed. The goals are to both achieve a high prediction rate, while predicting values for critical instructions. One key issue is how value-based speculation affe...
متن کاملThe Impact of Value Misspeculation on Branch Resolution in Out - of - Order Superscalar Microprocessors
As contemporary microprocessor design attempt to expose additional instruction level parallelism (ILP), data value speculation has become a mainstream microarchitectural feature. Many different value speculation techniques have been proposed. The goals are to both achieve a high prediction rate, while predicting values for critical instructions. One key issue is how value-based speculation affe...
متن کاملHigh-Performance Microprocessor Design
Three generations of Alpha microprocessors have been designed using a proven custom design methodology. The performance of these microprocessors was optimized by focusing on high-frequency design. The Alpha instruction set architecture facilitates high clock speed, and the chip organization for each generation was carefully chosen to meet critical paths. Digital has developed six generations of...
متن کامل