Enhancing reproducibility in scientific computing: Metrics and registry for Singularity containers
نویسندگان
چکیده
Here we present Singularity Hub, a framework to build and deploy Singularity containers for mobility of compute, and the singularity-python software with novel metrics for assessing reproducibility of such containers. Singularity containers make it possible for scientists and developers to package reproducible software, and Singularity Hub adds automation to this workflow by building, capturing metadata for, visualizing, and serving containers programmatically. Our novel metrics, based on custom filters of content hashes of container contents, allow for comparison of an entire container, including operating system, custom software, and metadata. First we will review Singularity Hub's primary use cases and how the infrastructure has been designed to support modern, common workflows. Next, we conduct three analyses to demonstrate build consistency, reproducibility metric and performance and interpretability, and potential for discovery. This is the first effort to demonstrate a rigorous assessment of measurable similarity between containers and operating systems. We provide these capabilities within Singularity Hub, as well as the source software singularity-python that provides the underlying functionality. Singularity Hub is available at https://singularity-hub.org, and we are excited to provide it as an openly available platform for building, and deploying scientific containers.
منابع مشابه
Singularity: Scientific containers for mobility of compute
Here we present Singularity, software developed to bring containers and reproducibility to scientific computing. Using Singularity containers, developers can work in reproducible environments of their choosing and design, and these complete environments can easily be copied and executed on other platforms. Singularity is an open source initiative that harnesses the expertise of system and softw...
متن کاملAn optimization model for management of empty containers in distribution network of a logistics company under uncertainty
In transportation via containers, unbalanced movement of loaded containers forces shipping companies to reposition empty containers. This study addresses the problem of empty container repositioning (ECR) in the distribution network of a European logistics company, where some restrictions impose decision making in an uncertain environment. The problem involves dispatching empty contain...
متن کاملThe Impact of Runtime Metrics Collection on Adaptive Mobile Applications
This thesis proposes, implements, and evaluates in terms of correctness and performance, an online metrics collection strategy to assist mobile application adaptation with the support of a mobile object framework. The solution is based upon an abstract representation of the system, which holds containers aggregating metrics for each specific component comprising a mobile object framework, inclu...
متن کاملAre new metrics pushing out traditional measurement tools of academic communications and scholarship?
Introduction: Universities are the centers of interactions and communications and a basic of scientific products and publications in the country. Scientific production and publications are surveyed and evaluated for various reasons. Evaluation criteria with regard to the location, type and circumstances are different and are occasionally changed and new. In this study, we survey the new metrics...
متن کاملIdiomatic and Reproducible Software Builds using Containers for Reliable Computing
Containers as the unit of application delivery are the ‘next big thing’ in the software development world. They enable developers to create an executable image containing an application bundled with all its dependencies which a user can run inside a controlled environment with virtualized resources. Complex workflows for business-critical applications and research environments require a high de...
متن کامل