Thin-client Web access patterns: Measurements from a cache-busting proxy

نویسنده

  • Terence Kelly
چکیده

This paper describes a new technique for measuring Web client request patterns and analyzes a large client trace collected using the new method. In this approach a modified proxy intercepts requests and serves all responses to clients marked uncacheable, effectively disabling browser caches and allowing the proxy to record requests that would otherwise result in silent browser cache hits. WebTV Networks used a “cache-busting proxy” to collect an unusually large and detailed anonymized Web client trace in September 2000. It contains over 347 million requests for over 36 million documents by over 37,000 clients and spans 16 days. By most measures it is two orders of magnitude larger than existing Web client traces. We compare cache-busting proxies with conventional client instrumentation and use the WebTV trace to explore browser cache performance, reference locality, and document aliasing. We present the aggregate browser cache success function (hit rate vs. cache size) of the entire client population and discuss design implications for memoryand bandwidth-constrained Web clients. For the workload studied, eliminating redundant data transfers would increase browser cache hit rates by 35% to 45% over their current levels. A simple and practical technique for eliminating redundant transfers is described. Document sharing across client reference streams is so strong that the hit rate of a shared proxy cache could exceed 57% even if browser caches were infinitely large.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rapid, Trace-Driven Simulation of the Performance of Web Caching Proxies

We have designed and validated a rapid, accurate simulation method for evaluating the performance of Web proxy cache replacement algorithm designs. We model the client-proxy-server system by combining a linear model of client-proxy response times with real measurements of proxy-server response times. We experimentally validate the model’s accuracy on the Apache proxy server. Our method should b...

متن کامل

بهینه‌سازی اجرا و پاسخ صفحات وب در فضای ابری با روش‌های پیش‌پردازش، مطالعه موردی سامانه‌های وارنیش و انجینکس

The response speed of Web pages is one of the necessities of information technology. In recent years, renowned companies such as Google and computer scientists focused on speeding up the web. Achievements such as Google Pagespeed, Nginx and varnish are the result of these researches. In Customer to Customer(C2C) business systems, such as chat systems, and in Business to Customer(B2C) systems, s...

متن کامل

Association Rule-Based Data Mining Agents for Personalized Web Caching

Proxy web caching is commonly implemented to decrease web access latency, internet bandwidth costs and origin web server load. We propose a transparent shareable proxy caching methodology in which the proxy caches maintain a continuously optimal performance with a significant improvement in the cache hit ratio without requiring any additional overhead at the client or at the routers. This appro...

متن کامل

A Top Approach to Prefetching on the Web

In the World Wide Web bottlenecks close to popular servers are very common These bottlenecks can be attributed to the servers lack of computing power and the network tra c induced by the increased number of access requests One way to eliminate these bottlenecks is through the use of caching However several recent studies suggest that the maximum hit rate achievable by any caching algorithm is j...

متن کامل

The Measured Access Characteristics of World-Wide-Web Client Proxy Caches

The growing popularity of the World Wide Web is placing tremendous demands on the Internet. A key strategy for scaling the Internet to meet these increasing demands is to cache data near clients and thus improve access latency and reduce network and server load. Unfortunately, research in this area has been hampered by a poor understanding of the locality and sharing characteristics of Web-clie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computer Communications

دوره 25  شماره 

صفحات  -

تاریخ انتشار 2002