Robust and scalable content-and-structure indexing

نویسندگان

چکیده

Abstract Frequent queries on semi-structured hierarchical data are Content-and-Structure (CAS) that filter items based their location in the structure and value for some attribute. We propose Robust Scalable (RSCAS) index to efficiently answer CAS big data. To get an is robust against with varying selectivities, we introduce a novel dynamic interleaving merges path dimensions of composite keys balanced manner. store interleaved our trie-based RSCAS index, which supports wide range queries, including wildcards descendant axes. implement as log-structured merge tree scale it data-intensive applications high insertion rate. illustrate RSCAS’s robustness scalability by indexing from Software Heritage (SWH) archive, world’s largest, publicly available source code archive.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Content-based Image Indexing

In this paper we present a robust information integration approach to identifying images of persons in large collections, such as the web. The underlying system relies on combining content analysis, which involves face detection and recognition, with context analysis which involves extraction of text or HTML features. Two aspects are explored to test the robustness of this approach: Sensitivity...

متن کامل

Content-Scalable Analysis for Video Indexing and Retrieval

Video content-scalability for video indexing and retrieval is proposed. Recently, the demand for content-based multimedia applications is increasing even beyond the capabilities of best-effort transmission networks. Therefore, the trend is toward constructing a content-oriented multimedia server that is capable of handling high volumes of content as well as of fulfilling high performance and va...

متن کامل

the underlying structure of language proficiency and the proficiency level

هدف از انجام این تخقیق بررسی رابطه احتمالی بین سطح مهارت زبان خارجی (foreign language proficiency) و ساختار مهارت زبان خارجی بود. تعداد 314 زبان آموز مونث و مذکر که عمدتا دانشجویان رشته های زبان انگلیسی در سطوح کارشناسی و کارشناسی ارشد بودند در این تحقیق شرکت کردند. از لحاظ سطح مهارت زبان خارجی شرکت کنندگان بسیار با هم متفاوت بودند، (75 نفر سطح پیشرفته، 113 نفر سطح متوسط، 126 سطح مقدماتی). کلا ...

15 صفحه اول

Content-based Watermarking for Indexing Using Robust Segmentation

In this paper, a novel approach to image indexing is presented using content-based watermarking. Some concepts associated with the application of watermarking to image indexing are discussed and a segmentation algorithm, appropriate for content-based watermarking, is presented. The segmentation algorithm is applied on reduced images and derives the exact same objects when performed on either th...

متن کامل

ShapeFit and ShapeKick for Robust, Scalable Structure from Motion

We introduce a new method for location recovery from pairwise directions that leverages an efficient convex program that comes with exact recovery guarantees, even in the presence of adversarial outliers. When pairwise directions represent scaled relative positions between pairs of views (estimated for instance with epipolar geometry) our method can be used for location recovery, that is the de...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The Vldb Journal

سال: 2022

ISSN: ['0949-877X', '1066-8888']

DOI: https://doi.org/10.1007/s00778-022-00764-y