An Architecture for Runtime Evaluation of SoC Reliability
نویسندگان
چکیده
This paper presents an architecture to evaluate the reliability of a systemon-chip (SoC) during its runtime that also accounts for the system’s redundancy. We propose to integrate an autonomic layer into the SoC to detect the chip’s current condition and instruct appropriate countermeasures. In the autonomic layer, error counters are used to count the number of errors within a fixed time interval. The counters’ values accumulate into a global register representing the system’s reliability. The accumulation takes into account the series and parallel composition of the system.
منابع مشابه
A Fuzzy Partial Ordering Approach for QoS-based Selection of Web Services
As the development of Service-Oriented Computing (SOC), more and more functional similar Web services are deployed over the Internet. Nowadays, Web service selection becomes a crucial issue for making SOC more applicable. Troublesome Web services will affect the reliability of the whole SOC application which invokes the service. Therefore, when choosing Web services, not only the functional att...
متن کاملEmbedded Memory Test Strategies and Repair
The demand of self-testing proportionally increases with memory size in System on Chip (SoC). SoC architecture normally occupies the majority of its area by memories. Due to increase in density of embedded memories, there is a need of self-testing mechanism in SoC design. Therefore, this research study focuses on this problem and introduces a smooth solution for self-testing. In the proposed m...
متن کاملWho Watches the Watchmen? Protecting Operating System Reliability Mechanisms
We present the design and initial evaluation of a resilient operating system architecture that leverages HW architectures combining few resilient with many non-resilient CPU cores. To this end, we build our system around a Reliable Computing Base (RCB) consisting of those software components that must work for reliable operation, and run the RCB on the resilient cores. The remainder of the syst...
متن کاملFlexible Redundancy in Robust Processor Architecture
This paper proposes a reliable processor architecture that dynamically adapts the amount of protection to the characteristics of an individual chip and its runtime behavior. This architecture uses fine-grain redundancy, voltage scaling and timing speculation to adapt to variation and tolerate timing, soft and hard errors. The goal is to provide reliability with a minimum of resources. Our evalu...
متن کاملA Methodology for Reliability Enhancement of Nanometer-Scale Digital Systems Based on a-priori Functional Fault- Tolerance Analysis
This paper presents a new approach for monitoring and estimating device reliability of nanometer-scale devices prior to fabrication. A four-layer architecture exhibiting a large immunity to permanent as well as random failures is used. A complete tool for a-priori functional fault tolerance analysis was developed. It is a statistical Monte Carlo based tool that induces different failure models,...
متن کامل