CMOS floating-point unit for the S/390 Parallel Enterprise Server G4
نویسندگان
چکیده
The S/390@ floating-point unit (FPU) on the fourth-generation (G4) CMOS microprocessor chip has been implemented in a CMOS technology with a 0.20-pm effective channel length and has been demonstrated at more than 400 MHz. The microprocessor chip is 17.35 by 17.30 mm in size, and one copy of the FPU including the dataflow and control flow but not including the FPR register file is 5.3 by 4.7 mm in size. There are two copies on the chip for error-detection purposes only; both copies execute the same instruction stream and are checked against each other. The highperformance implementation has a throughput of one instruction per cycle and an average latency of three execution cycles, yielding approximately 70 MFLOPS at 300 MHz on the Linpack benchmark. Currently, the G4 FPU is the highest-performance S/390 CMOS FPU with fault tolerance. It uses several innovative and high-performance algorithms not commonly found in S/390 FPUs or other FPUs, such as a radix-8 Booth multiplier, a Goldschmidt division and squareroot algorithm, techniques for updating the exponent in parallel with normalization, and avoidance of the remainder comparison in quadratically converging division and squareroot algorithms. Also demonstrated is a practical design technique for designing control flow into the dataflow and early floorplanning techniques.
منابع مشابه
PLL modeling and verification in a cycle-simulation environment
Recent advances in technology, computer architecture, and automated design environments have ushered in a new era of computer design in which large complex servers such as the S/390 G5 Parallel Enterprise ServerTM can be delivered with times to market once reserved for low-end systems such as single-user workstations and personal computers. Yet, the time to market is inversely proportional to c...
متن کاملS/390 Parallel Enterprise Server CMOS Cryptographic Coprocessor
As the Internet becomes the basis for electronic commerce, and as more businesses automate their data processing operations, the potential for unauthorized disclosure of sensitive data increases. On-line databases are becoming increasingly large and complex. Sensitive data is transmitted on communication lines and often stored off-line. As a result, the efficient, economical protection of enter...
متن کاملArchitecture and software support in IBM S/390 Parallel Enterprise Servers for IEEE Floating-Point arithmetic
IEEE Binary Floating-Point is an industrystandard architecture. The IBM System/360 hexadecimal floating-point architecture predates the IEEE standard and has been carried forward through the System/370 to current System/390 processors. The growing importance of industry standards and floatingpoint combined to produce a need for IEEE Floating-Point on System/390. At the same time, customer inves...
متن کاملThe S/390 G5/G6 binodal
The IBM S/390 fifth-generation CMOS-based server (more commonly known as the G5) produced a dramatic improvement in system-level performance in comparison with its predecessor, the G4. Much of this improvement can be attributed to an innovative approach to the cache and memory hierarchy: the binodal cache architecture. This design features shared caching and very high sustainable bandwidths at ...
متن کاملFunctional verification of the z990 superscalar, multibook microprocessor complex
verification of the z990 superscalar, multibook microprocessor complex D. G. Bair S. M. German W. D. Wollyung E. J. Kaminski, Jr. J. Schafer M. P. Mullen W. J. Lewis R. Wisniewski J. Walter S. Mittermaier V. Vokhshoori R. J. Adkins M. Halas T. Ruane U. Hahn This paper describes the verification methods and techniques that were established to verify the microarchitecture and architectural correc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IBM Journal of Research and Development
دوره 41 شماره
صفحات -
تاریخ انتشار 1997