Incremental View Maintenance for Property Graph Queries
نویسنده
چکیده
Graph processing challenges are common in modern database systems, with the property graph data model gaining widespread adoption [29]. Due to the novelty of the eld, graph databases and frameworks typically provide their own query language, such as Cypher for Neo4j [27], Gremlin for TinkerPop [28] and GraphScript for SAP HANA [24]. These languages often lack a formal background for their data model and semantics [1]. To address this, the openCypher initiative [21] aims to standardise a subset of the Cypher language, for which it currently provides grammar speci cation and a set of acceptance tests to allow vendors to implement their openCypher compatible engine. Incremental view maintenance has been used for decades in relational database systems [4]. In the graph domain, numerous use cases rely on complex queries and require low latency, including nancial fraud detection, source code analysis [32] and checking integrity (or well-formedness) constraints in databases [30]. While these could bene t from incremental evaluation, currently no property graph system provides incremental views. Our research investigates the incremental view maintenance for openCypher queries. A key challenge is that the property graph data model includes lists and maps, and queries can return arbitrarily nested data structures. We propose three desirable properties for an incremental property graph query engine: (IVM) incremental view maintenance, (FGN) ne granularity update operations on nested data structures, (ORD) ordering. Previous research showed that IVM and FGN is possible [19]. However, as stated in [8], "incremental view maintenance [IVM] strategies for data models that preserve order [ORD] remain an open problem to date". While removing support for ordering might seem a plausible workaround, it would pose serious limitations: (1) queries that require top-k results are common [17] and (2) even more importantly, Cypher handles paths as an alternating list of vertices and edges, which must be kept ordered. Therefore, we investigate the following research question: Which practical fragment of the openCypher language is incrementally maintainable?
منابع مشابه
Incremental View Maintenance for Property Graph eries
Graph processing challenges are common in modern database systems, with the property graph data model gaining widespread adoption [29]. Due to the novelty of the eld, graph databases and frameworks typically provide their own query language, such as Cypher for Neo4j [27], Gremlin for TinkerPop [28] and GraphScript for SAP HANA [24]. These languages often lack a formal background for their data ...
متن کاملIncremental View Maintenance for Deductive Graph Databases Using Generalized Discrimination Networks
Nowadays, graph databases are employed when relationships between entities are in the scope of database queries to avoid performance-critical join operations of relational databases. Graph queries are used to query and modify graphs stored in graph databases. Graph queries employ graph pattern matching that is NP-complete for subgraph isomorphism. Graph database views can be employed that keep ...
متن کاملIncremental Maintenance for Materialized Views over Semistructured Data
Semistructured data is not strictly typed like relational or object-oriented data and may be irregular or incomplete. It often arises in practice, e.g., when heterogeneous data sources are integrated or data is taken from the World Wide Web. Views over semistructured data can be used to lter the data and to restructure (or provide structure to) it. To achieve fast query response time, these vie...
متن کاملViewDF: Declarative incremental view maintenance for streaming data
In this paper, we present ViewDF: a declarative framework for incremental maintenance of materialized views over append-only streaming data. The main component of the proposed framework is the View Delta Function (ViewDF), which declaratively specifies how to update a view when a new batch of data arrives. We describe and experimentally evaluate a prototype system based on this idea, which allo...
متن کاملافزایش سرعت نگهداری افزایشی دید با استفاده از الگوریتم فاخته
Data warehouse is a repository of integrated data that is collected from various sources. Data warehouse has a capability of maintaining data from various sources in its view form. So, the view should be maintained and updated during changes of sources. Since the increase in updates may cause costly overhead, it is necessary to update views with high accuracy. Optimal Delta Evaluation method is...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1712.04108 شماره
صفحات -
تاریخ انتشار 2017