Building workload-independent storage with VT-trees
نویسندگان
چکیده
As the Internet and the amount of data grows, the variability of data sizes grows too—from small MP3 tags to large VM images. With applications using increasingly more complex queries and larger data-sets, data access patterns have become more complex and randomized. Current storage systems focus on optimizing for one band of workloads at the expense of other workloads due to limitations in existing storage system data structures. We designed a novel workload-independent data structure called the VT-tree which extends the LSM-tree to efficiently handle sequential and file-system workloads. We designed a system based solely on VT-trees which offers concurrent access to data via file system and database APIs, transactional guarantees, and consequently provides efficient and scalable access to both large and small data items regardless of the access pattern. Our evaluation shows that our user-level system has 2–6.6× better performance for random-write workloads and only a small average overhead for other workloads.
منابع مشابه
The ν-stable Lévy Motion in Heavy-traffic Analysis of Queueing Models with Heavy-tailed Distributions
For the GI/G/1 queueing model with heavy-tailed serviceand arrival time distributions and traffic a < 1 the limiting distribution of the contracted actual waiting time ∆(a)w has been derived for ∆(a) ↓ 0 for a ↑ 1, see [2]. In the present study we consider the workload process {vt, t > 0}, when properly scaled, i.e. ∆(a)vτ/∆1(a) for a ↑ 1 with ∆1(a) = ∆(a)(1 − a). We further consider the noise ...
متن کاملBF-Tree: Approximate Tree Indexing
The increasing volume of time-based generated data and the shift in storage technologies suggest that we might need to reconsider indexing. Several workloads like social and service monitoring often include attributes with implicit clustering because of their time-dependent nature. In addition, solid state disks (SSD) (using flash or other low-level technologies) emerge as viable competitors of...
متن کاملDetermine the most suitable Allometric equations for Estimating Above-ground Biomass of the Juniperus excelsa
Today, modeling and determination of allometric equations of forest trees, especially Junipers trees, are very important for determination of biological status and carbon storage capacity of forest species. The aim of this study was to determine the most suitable allometric equations for estimating the biomass of leaf, sub branch, main branch, trunk, and biomass of total Juniperus excelsa tr...
متن کاملAssociation between different Non-Invasively Derived Thresholds with Lactate Threshold during graded incremental exercise
We compared lactate threshold (TLac) with non-invasive markers of an aerobic-anaerobic transition; namely, ventilatory (VT) and tissue saturation index (TSIT) thresholds. While identification of a breakpoint in blood lactate concentration ([BLa]) is common for determination of an aerobic-anaerobic transition, non-invasive measures, VT and NIRS, have also received attention as a means of determi...
متن کاملIntegrity Checking in Cryptographic File Systems with Constant Trusted Storage
In this paper we propose two new constructions for protecting the integrity of files in cryptographic file systems. Our constructions are designed to exploit two characteristics of many file-system workloads, namely low entropy of file contents and high sequentiality of file block writes. At the same time, our approaches maintain the best features of the most commonly used algorithm today (Merk...
متن کامل