Coordinated Placement and Replacement for Large-Scale Distributed Caches

نویسندگان

  • Madhukar R. Korupolu
  • Michael Dahlin
چکیده

In a large-scale information system such as a digital library or the web, a set of distributed caches can improve their eeectiveness by coordinating their data placement decisions. Using simulation, we examine three practical cooperative placement algorithms including one that is provably close to optimal, and we compare these algorithms to the optimal placement algorithm and several cooperative and non-cooperative replacement algorithms. We draw ve conclusions from these experiments: (1) cooperative placement can signiicantly improve performance compared to local replacement algorithms particularly when the size of individual caches is limited compared to the universe of objects; (2) although the amortizing placement algorithm is only guaranteed to be within 14 times the optimal, in practice it seems to provide an excellent approximation of the optimal; (3) in a cooperative caching scenario, the recent greedy-dual local replacement algorithm performs much better than the other local replacement algorithms; (4) our hierarchical-greedy-dual replacement algorithm yields further improvements over the greedy-dual algorithm especially when there are idle caches in the system; and (5) a key challenge to coordinated placement algorithms is generating good predictions of access patterns based on past accesses .

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Coordinated resource scheduling in a large scale virtual power plant considering demand response and energy storages

Virtual power plant (VPP) is an effective approach to aggregate distributed generation resources under a central control. This paper introduces a mixed-integer linear programming model for optimal scheduling of the internal resources of a large scale VPP in order to maximize its profit. The proposed model studies the effect of a demand response (DR) program on the scheduling of the VPP. The pro...

متن کامل

Coordinated Placement and Replacement for Grid-Based Hierarchical Web Caches

Web caching has been well accepted as a viable method for saving network bandwidth and reducing user access latency. To provide cache sharing on a large scale, hierarchical web caching has been widely deployed to improve the scalability of content dissemination through the World Wide Web. In this paper, we present GHC, a grid-based hierarchical web caching architecture, for constructing efficie...

متن کامل

RCache: Design and Analysis of Scalable, Fault Tolerant Multimedia Stream Caching Schemes

In the current Internet, web content is increasingly being cached closer to the end-user to reduce network and web server load and therefore improve performance and user perceived quality. Existing web caching systems typically cache entire web documents and attempt to keep them consistent with the origin server. This approach works well for text and images; for bandwidth intensive multimedia d...

متن کامل

Dynamic Placement of Content Replicas in Distributed Multimedia Systems

We present in this article a multimedia content replica management system, able to satisfy users' constraints and preferences, while limiting the load of the servers and the proxy-caches. Our goal is to optimize the placement of a limited number of replicas via mechanisms of duplication and migration. Our system allows owners to keep control over their multimedia contents even with large scale ...

متن کامل

Adaptive Dynamic Data Placement Algorithm for Hadoop in Heterogeneous Environments

Hadoop MapReduce framework is an important distributed processing model for large-scale data intensive applications. The current Hadoop and the existing Hadoop distributed file system’s rack-aware data placement strategy in MapReduce in the homogeneous Hadoop cluster assume that each node in a cluster has the same computing capacity and a same workload is assigned to each node. Default Hadoop d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEEE Trans. Knowl. Data Eng.

دوره 14  شماره 

صفحات  -

تاریخ انتشار 2002