Managing Derived Data in the Gaea Scienti c DBMS
نویسندگان
چکیده
One important aspect of scienti c data management is metadata management. Metadata is information about data (e.g., content, source, processing applied, precision). One kind of metadata which needs special attention is the data derivation information, i.e., how data are generated. In our application domain of geographical information systems (GIS) and global change research, we view scienti c objects according to three di erent extents: spatial, temporal, and derivation. While the spatial and temporal extents have been studied and formal semantics to those extents proposed, derivation semantics have been ignored. This paper presents a framework for capturing and managing scienti c data derivation histories as implemented in the Gaea scienti c database management system. We focus on how Gaea handles metadata and propose to extend current semantic modeling and object-oriented technology with special constructs: concepts, processes, and tasks. Concepts are used to capture entity sets with imprecise de nitions. A process captures the derivation procedure of a speci c scienti c object class, while a task is the instance representing the derivation of a scienti c data object. We believe that this framework, useful for GIS and global change studies, generalizes well to other scienti c elds.
منابع مشابه
Managing Derived Data in the Gaea Scientific DBMS
One important aspect of scientific data management is metadata management. Metadata is information about data (e.g., content, source, processing applied, precision). One kind of metsdata which needs special attention is the data derivation information, i.e., how data are generated. In our application domain of geographical information systems (GIS) and global change research, we view scientific...
متن کاملDistributed Database Management for Scienti c Data Analysis
Scienti c databases have recently become a challenging research area for a number of reasons: 1) the amount of data stored in scienti c databases is rapidly increasing, with orders of magnitude increases on the horizon, 2) the data are becoming increasing complex, as more complicated data structures and data relationships must be captured, 3) there is a need to integrate incompatible data forma...
متن کاملObject-Relational Queries into Multidimensional Databases with the Active Data Repository
As computational power and storage capacity increase, processing and analyzing large volumes of multi-dimensional datasets play an increasingly important role in many domains of scienti c research. Scienti c applications that make use of very large scienti c datasets have several important characteristics: datasets consist of complex data and are usually multi-dimensional; applications usually ...
متن کاملThe Gaea System: A Spatio-Temporal Database System for Global Change Studies
The Gaea system is a spatio-temporal database management system under development at Worcester Polytechnic Institute. Gaea is intended to provide advanced data management and analysis to geographical information systems (GIS) for global change studies. We present the objectives and long-term vision of the Gaea project, describe the Gaea system architecture and discuss the current state of devel...
متن کاملTioga: Providing Data Management Support for Scienti c Visualization Applications
We present a user interface paradigm for database management systems motivated by scienti c visualization applications. Our graphical user interface includes a \boxes and arrows" notation for database access and a ight simulator model of movement through information space. We also provide means to specify a hierarchy of abstracts of data of di erent types and resolutions. In addition, multiple ...
متن کامل