An Analysis of Reproducibility and Non-Determinism in HEP Software and ROOT Data
نویسندگان
چکیده
Reproducibility is an essential component of the scientific method. In order to validate the correctness or facilitate the extension of a computational result, it should be possible to re-run a published result and verify that the same results are produced. However, reproducing a computational result is surprisingly difficult: non-determinism and other factors may make it impossible to get the same result, even when running the same code on the same machine on the same day. We explore this problem in the context of HEP codes and data, showing three high level methods for dealing with non-determinism in general: 1) Domain specific methods; 2) Domain specific comparisons; and 3) Virtualization adjustments. Using a CMS workflow with output data stored in ROOT files, we use these methods to prevent, detect, and eliminate some sources of non-determinism. We observe improved determinism using pre-determined random seeds, a predictable progression of system timestamps, and fixed process identifiers. Unfortunately, sources of non-determinism continue to exist despite the combination of all three methods. Hierarchical data comparisons also allow us to appropriately ignore some non-determinism when it is unavoidable. We conclude that there is still room for improvement, and identify directions that can be taken in each method to make an experiment more reproducible.
منابع مشابه
A First Look at Reproducibility and Non-Determinism in CMS Software and ROOT Data
Reproducibility is an essential component of the scientific process. Including software and data with a published paper is a good step towards reproducible research. However, the presence of non-determinism in a scientific workflow can make validating results very difficult even between two runs on the same machine, the same day, and using the exact same command and parameters. But for reproduc...
متن کاملSocial and Ethical Responsibility against Social Determinism
Background: Because of the highlighted role of ethical responsibility in social and individual life, it is looked as an important issue in ethics area. Social responsibility that is defined as a sub-factor of ethical responsibility will have such an important place. What is discussed in responsibility discussions commonly is the “human will”. Responsibility is more based on the free will. In th...
متن کاملAdherence patterns of diarrheagenic Escherichia coli strains isolated from children with gastrointestinal diarrhea and matched controls in a Hep-2 cell adhesion assay
In the present study, carbon species including organic carbon (OC), elemental carbon (EC), and water-soluble organic carbon (WSOC) concentration in PM2.5 were assessed at an urban site of Tehran, Iran during March to June 2014. The PM2.5 samples were collected using an frmOMNITM Ambient Air Sampler. Thermal gravimetric analysis (TGA) was used to analyze OC and EC. The results showed that PM2.5 ...
متن کاملDual role of CdSe quantum dots for simultaneous separation and spectrofluorimetric ultrasensitive determination of heparin
The present study offers a new method based on CdSe quantum dots (QDs) for simultaneous separation and determination of trace levels of heparin (Hep) in human serum samples. In this technique, CdSe QDs perform two different functions in Hep analysis process. Mercaptoacetic acid-capped red CdSe QDs (λex=690 nm) are conjugated to Hep and the Hep-QD conjugation is then used as an extraction tool ...
متن کاملDual role of CdSe quantum dots for simultaneous separation and spectrofluorimetric ultrasensitive determination of heparin
The present study offers a new method based on CdSe quantum dots (QDs) for simultaneous separation and determination of trace levels of heparin (Hep) in human serum samples. In this technique, CdSe QDs perform two different functions in Hep analysis process. Mercaptoacetic acid-capped red CdSe QDs (λex=690 nm) are conjugated to Hep and the Hep-QD conjugation is then used as an extraction...
متن کامل