Adding Structure to Unstructured Data

نویسندگان

  • Peter Buneman
  • Susan B. Davidson
  • Mary F. Fernández
  • Dan Suciu
چکیده

We develop a new schema for unstructured data. Traditional schemas resemble the type systems of programming languages. For unstructured data, however, the underlying type may be much less constrained and hence an alternative way of expressing constraints on the data is needed. Here, we propose that both data and schema be represented as edge-labeled graphs. We develop notions of conformance between a graph database and a graph schema and show that there is a natural and efficiently computable ordering on graph schemas. We then examine certain subclasses of schemas and show that schemas are closed under query applications. Finally, we discuss how they may be used in query decomposition and optimization. Comments Postprint version. Published in Lecture Notes in Computer Science, International Conference on Database Theory, Volume 1186, 1997, pages 336-350. Publisher URL: http://dx.doi.org/10.1007/3-540-62222-5_55 This conference paper is available at ScholarlyCommons: http://repository.upenn.edu/db_research/35 Adding Structure to Unstructured Data Peter Buneman Susan Davidson Mary Fernandez Dan Suciu

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Enhancing Search with Structure

Keyword search has traditionally focussed on retrieving documents in ranked order, given simple keyword queries. Similarly, work on keyword queries on structured data has focussed on retrieving closely connected pieces of data that together contain given query keywords. In recent years, there has been a good deal of work that attempts to go beyond the above paradigms, to improve search experien...

متن کامل

Nonparametric Regression Estimation under Kernel Polynomial Model for Unstructured Data

The nonparametric estimation(NE) of kernel polynomial regression (KPR) model is a powerful tool to visually depict the effect of covariates on response variable, when there exist unstructured and heterogeneous data. In this paper we introduce KPR model that is the mixture of nonparametric regression models with bootstrap algorithm, which is considered in a heterogeneous and unstructured framewo...

متن کامل

Investigating the Amount of Forces Caused by Solitary Waves on Coastal Walls Using OpenFOAM Software

Coastal walls (dyke) are one of the methods of protecting the coast against coastal erosion and destructive forces of waves. The purpose of this study is to simulate the wave collision with the coastal dyke and compare the results with the laboratory model. Open FOAM open source software and K-ω SST turbulence model were used to simulate the amount of wave consumed by the coastal dyke. Taking i...

متن کامل

In-depth Interactive Visual Exploration for Bridging Unstructured and Structured Document Content

Semi-structured data refers to the combination of unstructured and structured data. Unstructured data is free text in natural language, while structured data is typically stored in tables and following a data schema. Recent statistics shows that 80% of the data generated in the last two years is unstructured. However, one interesting observation is that free text usually comes along with some s...

متن کامل

The Effect of Adding Alginate Natural Polymer on the Structure of Polyvinyl Alcohol Biocompatible Nanofibers in Electrospinning Process

Background: Nowadays, in order to preserve the environment and sustainable development, the use of natural and renewable resources is a priority for industries. High performance and specific structure of nano-biocompatible materials has attracted researchers. In this research, alginate polymer, which is generally obtained from marine sources such as algae, was added to polyvinyl alcohol nanofi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997