RLQ: Workload Allocation With Reinforcement Learning in Distributed Queues

نویسندگان

چکیده

Distributed workload queues are nowadays widely used due to their significant advantages in terms of decoupling, resilience, and scaling. Task allocation worker nodes distributed queue systems is typically simplistic (e.g., Least Recently Used) or uses hand-crafted heuristics that require task-specific information task resource demands expected time execution). When such not available node capabilities homogeneous, the existing placement strategies may lead unnecessarily large execution timings usage costs. In this work, we formulate problem Markov Decision Process framework, which an agent assigns tasks resource, receives a numerical reward signal upon completion. Our adaptive learning-based solution, Reinforcement Learning based Queues ( xmlns:xlink="http://www.w3.org/1999/xlink">RLQ ), implemented integrated with popular Celery queuing system for Python. We compare against traditional solutions using both synthetic real traces. On average, workloads, reduces cost by approximately 70%, factor at least 3×, waiting almost 7×. Using traces, observe improvement about 20% cost, around 70% time, reduction 20× time. also strategy inspired E-PVM, state-of-the-art solution Google's Borg cluster manager, showing able outperform it five out six scenarios.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multicast Routing in Wireless Sensor Networks: A Distributed Reinforcement Learning Approach

Wireless Sensor Networks (WSNs) are consist of independent distributed sensors with storing, processing, sensing and communication capabilities to monitor physical or environmental conditions. There are number of challenges in WSNs because of limitation of battery power, communications, computation and storage space. In the recent years, computational intelligence approaches such as evolutionar...

متن کامل

Temporal-Difference Reinforcement Learning with Distributed Representations

Temporal-difference (TD) algorithms have been proposed as models of reinforcement learning (RL). We examine two issues of distributed representation in these TD algorithms: distributed representations of belief and distributed discounting factors. Distributed representation of belief allows the believed state of the world to distribute across sets of equivalent states. Distributed exponential d...

متن کامل

Reinforcement Learning for DynamicChannel Allocation in Cellular

In cellular telephone systems, an important problem is to dynamically allocate the communication resource (channels) so as to maximize service in a stochastic caller environment. This problem is naturally formulated as a dynamic programming problem and we use a reinforcement learning (RL) method to nd dynamic channel allocation policies that are better than previous heuristic solutions. The pol...

متن کامل

A Framework for workload allocation in distributed transaction processing systems

Ever-increasing demands for high transaction rates, limitations of high-end processors, high availability, and modular growth considerations are all driving forces toward distributed architectures for transaction processing. However, a prerequisite to taking advantage of the capacity of a distributed transaction processing system is an effective strategy for workload allocation. The distributio...

متن کامل

Online Resource Allocation Using Decompositional Reinforcement Learning

This paper considers a novel application domain for reinforcement learning: that of “autonomic computing,” i.e. selfmanaging computing systems. RL is applied to an online resource allocation task in a distributed multi-application computing environment with independent time-varying load in each application. The task is to allocate servers in real time so as to maximize the sum of performance-ba...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Parallel and Distributed Systems

سال: 2023

ISSN: ['1045-9219', '1558-2183', '2161-9883']

DOI: https://doi.org/10.1109/tpds.2022.3231981