Empirical testing of Open Source Operating System Reliability: Failure analysis by co-relating event logs and crashes
نویسندگان
چکیده
Open Source Operating Systems offer many advantages over proprietary systems in studying reliability. They are now in widespread use in a variety of specialized versions but are all accessible at the source code level for analysis and modification. In this paper we look at failure analysis within distributions of the Linux operating system. Event logs are vital for failure analysis – but our experiments so far have shown that the event logs of Linux leave a lot to be desired. In this paper we describe our exploration of the event logs and the experiments we performed to look at failure analysis through event logs. We then introduce enhancements to the Linux kernel and describe the advancements that are possible in failure analysis based on these enhancements. These enhancements are again tested through a set of experiments. We analyze the results of these experiments and suggest a few paths to enhance Linux event logs for better failure analysis..
منابع مشابه
Failure Mode and Effect Analysis Power Plant Boiler
The current electricity demand is increasing, and now the government has involved third parties in the implementation of electricity so that investors compete in building infrastructure in order to apply electricity. Thermal power is one source that has a fast break event point compared to other resources that more interested investors even with all forms of pollution caused. A form of heat pow...
متن کاملReliability Analysis of Redundant Repairable System with Degraded Failure
This investigation deals with the transient analysis of the machine repair system consisting of M-operating units operating under the care of single repairman. To improve the system reliability/availability, Y warm standby and S cold standby units are provided to replace the failed units. In case when all spares are being used, the failure of units occurs in degraded fashion. In such situation ...
متن کاملSimple Event Correlator for real-time security log monitoring
When it comes to the security of the IT system, event logs play a crucial role. Today, many applications, operating systems, network devices and other system components are capable of writing security related event messages to log files. The BSD syslog protocol is an event logging standard supported by majority of OS and network equipment vendors, which allows one to set up a central log server...
متن کاملBayes Networks and Fault Tree Analysis Application in Reliability Estimation (Case Study: Automatic Water Sprinkler System)
In this study, the application of Bayes networks and fault tree analysis in reliability estimation have been investigated. Fault tree analysis is one of the most widely used methods for estimating reliability. In recent years, a method called "Bayes Network" has been used, which is a dynamic method, and information about the probable failure of the system components will be updated according to...
متن کاملReliability Analysis of Three Elements Series and Parallel Systems under Time-varying Fuzzy Failure Rate
Reliability is the most important performance issue in the engineering design process but in the real world problems, there are limitations for using the conventional reliability. Fuzzy logic has proved to be effective in expressing uncertainties in different fields, including reliability engineering. In this paper, For both the series and parallel systems composed of three identical or differe...
متن کامل