An ISA-TAB-Nano based data collection framework to support data-driven modelling of nanotoxicology
نویسندگان
چکیده
Analysis of trends in nanotoxicology data and the development of data driven models for nanotoxicity is facilitated by the reporting of data using a standardised electronic format. ISA-TAB-Nano has been proposed as such a format. However, in order to build useful datasets according to this format, a variety of issues has to be addressed. These issues include questions regarding exactly which (meta)data to report and how to report them. The current article discusses some of the challenges associated with the use of ISA-TAB-Nano and presents a set of resources designed to facilitate the manual creation of ISA-TAB-Nano datasets from the nanotoxicology literature. These resources were developed within the context of the NanoPUZZLES EU project and include data collection templates, corresponding business rules that extend the generic ISA-TAB-Nano specification as well as Python code to facilitate parsing and integration of these datasets within other nanoinformatics resources. The use of these resources is illustrated by a "Toy Dataset" presented in the Supporting Information. The strengths and weaknesses of the resources are discussed along with possible future developments.
منابع مشابه
ISA-TAB-Nano: A Specification for Sharing Nanomaterial Research Data in Spreadsheet-based Format
BACKGROUND AND MOTIVATION The high-throughput genomics communities have been successfully using standardized spreadsheet-based formats to capture and share data within labs and among public repositories. The nanomedicine community has yet to adopt similar standards to share the diverse and multi-dimensional types of data (including metadata) pertaining to the description and characterization of...
متن کاملNano(Q)SAR: Challenges, pitfalls and perspectives.
Regulation for nanomaterials is urgently needed, and the drive to adopt an intelligent testing strategy is evident. Such a strategy will not only provide economic benefits but will also reduce moral and ethical concerns arising from animal testing. For regulatory purposes, such an approach is promoted by REACH, particularly the use of quantitative structure-activity relationships [(Q)SAR] as a ...
متن کاملThe Stem Cell Commons: an exemplar for data integration in the biomedical domain driven by the ISA framework.
Comparisons of stem cell experiments at both molecular and semantic levels remain challenging due to inconsistencies in results, data formats, and descriptions among biomedical research discoveries. The Harvard Stem Cell Institute (HSCI) has created the Stem Cell Commons (stemcellcommons.org), an open, community-based approach to data sharing. Experimental information is integrated using the In...
متن کاملAn ISA-Tab specification for protein titration data exchange
Data curation presents a challenge to all scientific disciplines to ensure public availability and reproducibility of experimental data. Standards for data preservation and exchange are central to addressing this challenge: the Investigation-Study-Assay Tabular (ISA-Tab) project has developed a widely used template for such standards in biological research. This paper describes the application ...
متن کاملDebt Collection Industry: Machine Learning Approach
Businesses are increasingly interested in how big data, artificial intelligence, machine learning, and predictive analytics can be used to increase revenue, lower costs, and improve their business processes. In this paper, we describe how we have developed a data-driven machine learning method to optimize the collection process for a debt collection agency. Precisely speaking, we create a frame...
متن کامل