Aggregate Evaluability in Statistical Databases

نویسندگان

  • Francesco M. Malvestuto
  • Marina Moscarini
چکیده

Usually a statistical database contains many summary tables representing the distribution of the same statistical variable over the classes of as many partitions of a certain universe of objects. Existing query systems allow only queries on single tables. Indeed, in most cases additional queries can be evaluated by combining the information contained in similar tables in a suitable way. attribute” [ 14,201) relat.ed to a given universe of 0bject.s or individuals, partitioned according to a set of (category) attributes, referred to as the scheme of the table. Example 1. Untuerse: Soviet people in the year 1959. Variable: Population (1000 individuals). Scheme: {Sex, Schooling, Part,y-Membership} (the data is obtained by processing data from Bishop et al. [4]). In order to improve the responsiveness of the database and allow an integrated use of the stored informat.ion, we propose to inform t,he database system of the relationship among the partitions adopted in the tables. Such a relationship, called zntersection dependency, states which classes of the partitions have a nonempty intersection and can be represented by a uniform multipartite hypergraph, called intersection hypergraph. On the grounds of the algebraic properties of the intel Jection hypergraph and under the assumption of data additivity, we shall provide a characteriration of evaluable queries, which allows us to define polynomial-time procedures both for testing evaluability and for evaluating queries. Table: Distribution of the soviet populatiion by schooling, sex and party (1000 individuals) 1959 Sex / Schooling Party-Membership Yes No

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Model for Representing Statistical Objects

In this paper the structure and the semantic properties of the entities stored in databases, whose data are only aggregate-type data, are defined and discussed. This choice is justified by the wide spread use of aggregate data without the corresponding raw data (i.e. micro-data, such as census data). Aggregate data are often derived by applying statistical aggregation (e.g. sum, count) and stat...

متن کامل

Aggregate and Grouping Functions in Object Oriented Databases

E cient evaluation of aggregate functions in object oriented databases OODB can have considerable impact on performance in many application areas like geographic information systems and statistical and scienti c databases The problem with current systems is ine cient execution of aggregate functions with large data volumes and lack of exibility it is not possible to extend the systems with new ...

متن کامل

Applying OLAP Pre-Aggregation Techniques to Speed Up Query Processing in Raster-Image Databases

Aggregate functions are particularly useful when dealing with extremely large volumes of data. In business and statistical databases, aggregate queries have been leveraged by powerful methods such as On-Line Analytical Processing (OLAP). In contrast, current technology for raster image databases is lagging behind. A comparative study between raster image and business databases has shown similar...

متن کامل

STORM: A Statistical Object Representation Model

For the last several years, a number of researchers have been interested in the various problems which arise when modelling aggregate-type data [1st SDBM], [2nd SDBM], [3rd SSDBM], [Rafanelli 89]. Since aggregate data is often derived by applying statistical aggregation (e.g. SUM, COUNT) and statistical analysis functions over micro-data [Wong 84] the aggregate data bases are also called "stati...

متن کامل

Operators for Multidimensional Aggregate Data

Copyright © 2003, Idea Group Inc. Copying or distributing in print or electronic forms without written permission of Idea Group Inc. is prohibited. ABSTRACT In this chapter the author proposes the different approaches for defining operators able to manipulate this multidimensional structure. In particular, he initially considers operators for multidimensional aggregate data which extend relatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1989