An Online Validator for Provenance: Algorithmic Design, Testing, and API
نویسندگان
چکیده
Provenance is a record that describes the people, institutions, entities, and activities involved in producing, influencing, or delivering a piece of data or a thing. The W3C Provenance Working group has just published the prov family of specifications, which include a data model for provenance on the Web. The working group introduces a notion of valid prov document whose intent is to ensure that a prov document represents a consistent history of objects and their interactions that is safe to use for the purpose of reasoning and other kinds of analysis. Valid prov documents satisfy certain definitions, inferences, and constraints, specified in prov-constraints. This paper discusses the design of ProvValidator, an online service for validating provenance documents according to prov-constraints. It discusses the algorithmic design of the validator, the complexity of the algorithm, how we demonstrated compliance with the standard, and its rest api.
منابع مشابه
Validata: A tool for testing profile conformance
Validata is an online web application for validating a dataset description expressed in RDF against a community profile expressed as a Shape Expression (ShEx). Additionally it provides an API for programmatic access to the validator. Validata is capable of being used for multiple community agreed standards, e.g. DCAT, the HCLS community profile, or the Open PHACTS guidelines, and there are curr...
متن کاملProvStore: A Public Provenance Repository
ProvStore is the first online public provenance repository supporting the new PROV standards by W3C. It allows users and applications to store and (optionally) publish the provenance of their data on the Web. Provenance documents can be transformed, visualized, and shared in various serializations, with all the functionality also available to automated applications via a RESTful API (OAuth supp...
متن کاملCMLLite: a design philosophy for CML
CMLLite is a collection of definitions and processes which provide strong and flexible validation for a document in Chemical Markup Language (CML). It consists of an updated CML schema (schema3), conventions specifying rules in both human and machine-understandable forms and a validator available both online and offline to check conformance. This article explores the rationale behind the change...
متن کاملAnalytic Provenance Datasets: A Data Repository of Human Analysis Activity and Interaction Logs
Datasets are available online at https://research.arch.tamu.edu/analytic-provenance/datasets/ for research purposes. Abstract We present an analytic provenance data repository that can be used to study human analysis activity, thought processes, and software interaction with visual analysis tools during exploratory data analysis. We conducted a series of user studies involving exploratory data ...
متن کاملIPAPI: Designing an Improved Provenance API
We investigate the main limitations imposed by existing provenance systems in the development of provenanceaware applications. In the case of disclosed provenance APIs, most of those limitations can be traced back to the inability to integrate provenance from different sources, layers and of different granularities into a coherent view of data production. We consider possible solutions in the d...
متن کامل