Kraken: Leveraging Live Traffic Tests to Identify and Resolve Resource Utilization Bottlenecks in Large Scale Web Services

نویسندگان

  • Kaushik Veeraraghavan
  • Justin Meza
  • David Chou
  • Wonho Kim
  • Sonia Margulis
  • Scott Michelson
  • Rajesh Nishtala
  • Daniel Obenshain
  • Dmitri Perelman
  • Yee Jiun Song
چکیده

Modern web services such as Facebook are made up of hundreds of systems running in geographicallydistributed data centers. Each system needs to be allocated capacity, configured, and tuned to use data center resources efficiently. Keeping a model of capacity allocation current is challenging given that user behavior and software components evolve constantly. Three insights motivate our work: (1) the live user traffic accessing a web service provides the most current target workload possible, (2) we can empirically test the system to identify its scalability limits, and (3) the user impact and operational overhead of empirical testing can be largely eliminated by building automation which adjusts live traffic based on feedback. We build on these insights in Kraken, a new system that runs load tests by continually shifting live user traffic to one or more data centers. Kraken enables empirical testing by monitoring user experience (e.g., latency) and system health (e.g., error rate) in a feedback loop between traffic shifts. We analyze the behavior of individual systems and groups of systems to identify resource utilization bottlenecks such as capacity, load balancing, software regressions, performance tuning, and so on, which can be iteratively fixed and verified in subsequent load tests. Kraken, which manages the traffic generated by 1.7 billion users, has been in production at Facebook for three years and has allowed us to improve our hardware utilization by over 20%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Measurement and Improvement of Healthcare Service Using Discrete Event Simulation in Bahir Dar Clinic

This paper deals with the service performance analysis and improvement using discrete event simulation has been used. The simulation of the health care has been done by arena master development 14-version software. The performance measurement for this study are patients output, service rate, service efficiency and it is directly related to waiting time of patients in each service station, work ...

متن کامل

Semantic Constraint and QoS-Aware Large-Scale Web Service Composition

Service-oriented architecture facilitates the running time of interactions by using business integration on the networks. Currently, web services are considered as the best option to provide Internet services. Due to an increasing number of Web users and the complexity of users’ queries, simple and atomic services are not able to meet the needs of users; and to provide complex services, it requ...

متن کامل

Integrating Engineering Web Services with Distributed Data Flows and Mobile Classes

This paper describes a software paradigm for composing engineering web services. Software modules have been designed and implemented to facilitate the construction and integration of web services. The paradigm employs a distributed data flow approach that supports direct data exchanges among web services, and thus avoid many performance bottlenecks attending centralized processing. The performa...

متن کامل

RDF Ontology (Re-)Engineering through Large-scale Data Mining

As Linked Open Data originates from various sources, leveraging well-defined ontologies aids integration. However, oftentimes the utilization of RDF vocabularies by data publishers differs from the intended application envisioned by ontology engineers. Especially in largescale datasets as presented in the Billion Triple Challenge a significant divergence between vocabulary specification and usa...

متن کامل

A procedure for Web Service Selection Using WS-Policy Semantic Matching

In general, Policy-based approaches play an important role in the management of web services, for instance, in the choice of semantic web service and quality of services (QoS) in particular. The present research work illustrates a procedure for the web service selection among functionality similar web services based on WS-Policy semantic matching. In this study, the procedure of WS-Policy publi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016