Prefix Path Streaming: a New Clustering Method for XML Twig Pattern Matching
نویسندگان
چکیده
Searching for all occurrences of a twig pattern in a XML document is an important operation in XML query processing. Recently a class of holistic twig pattern matching algorithms has been proposed. Compared with the prior approaches, the holistic method avoids generating large intermediate results which do not contribute to the final answer. The method is CPU and I/O optimal when twig patterns only have ancestor-descendant relationships.The holistic twig-pattern matching method proposed earlier [1] operates on element streams which cluster all XML elements with the same tag name together. In this paper we introduce a clustering method called Prefix Path Streaming (PPS) and new holistic twig pattern matching algorithms based on PPS. PPS clusters elements of XML documents according to the paths from root to the elements. This clustering approach avoids unnecessary scanning of irrelevant portion of XML documents.More importantly, we develop optimal algorithms based on PPS streaming which can process a large class of twig patterns consisting of both ancestor-descendant and parent-child relationships.
منابع مشابه
Evaluation of Twig Pattern Queries for Streaming Xml Data Using Lineage Encoding
In this paper, we propose an energy and latency efficient XML dissemination scheme for the mobile computing. It describes a novel unit structure called G-node for streaming XML data in the wireless environment. It exploits the profit of the structure indexing and attributes summarization that may integrate relevant XML elements into a group. It provides a way for choosy access of their attribut...
متن کاملXML Dissemination Scheme for Mobile Computing Based on Lineage Encoding
In wireless environments, broadcasting is an efficient and scalable method to broadcast information to a massive number of clients. We propose an energy and latency efficient XML dissemination scheme for the wireless mobile computing environments. This paper presents a novel unit structure called G-node for streaming XML data in the wireless system. It applies the benefits of the structure inde...
متن کاملDissemination of Xml Data in Wireless Environment Supporting Twig Pattern Queries
The main aim of this paper is to improve energy and latency efficiency of XML dissemination scheme for the mobile computing, which is based on Lineage Encoding, G-node and scheduling algorithm for streaming XML data in the wireless environment. In this paper we propose a new broadcasting scheduling algorithm Frequently Access First (FAF) which effectively organize XML data on wireless channels....
متن کاملStreamTX: extracting tuples from streaming XML data
We study the problem of extracting flattened tuple data from streaming, hierarchical XML data. Tuple-extraction queries are essentially XML pattern queries with multiple extraction nodes. Their typical applications include mapping-based XML transformation and integrated (set-based) processing of XML and relational data. Holistic twig joins are known for the optimal matching of XML pattern queri...
متن کاملAn Well-Organised Wireless XML Streaming Supporting Twig Pattern Queries using Lineage Encoding
In this paper, we propose an energy and latency efficient XML dissemination scheme for the mobile computing. It describes a novel unit structure called Gnode for streaming XML data in the wireless environment. It exploits the profit of the structure indexing and attributes summarization that may integrate relevant XML elements into a group. It provides a way for choosy access of their attribute...
متن کامل