Dynamic Markov Decision Policies for Delay Constrained Wireless Scheduling
نویسندگان
چکیده
منابع مشابه
Constrained Markov Decision Process and Optimal Policies
In the course lectures, we have discussed a lot regarding unconstrained Markov Decision Process (MDP). The dynamic programming decomposition and optimal policies with MDP are also given. However, in this report we are going to discuss a different MDP model, which is constrained MDP. There are many realistic demand of studying constrained MDP. For instance, in the wireless sensors networks, each...
متن کاملNon-randomized policies for constrained Markov decision processes
This paper addresses constrained Markov decision processes, with expected discounted total cost criteria, which are controlled by nonrandomized policies. A dynamic programming approach is used to construct optimal policies. The convergence of the series of finite horizon value functions to the infinite horizon value function is also shown. A simple example illustrating an application is presented.
متن کاملDynamic programming in constrained Markov decision processes
We consider a discounted Markov Decision Process (MDP) supplemented with the requirement that another discounted loss must not exceed a specified value, almost surely. We show that the problem can be reformulated as a standard MDP and solved using the Dynamic Programming approach. An example on a controlled queue is presented. In the last section, we briefly reinforce the connection of the Dyna...
متن کاملDelay and rate constrained transmission policies over wireless channels
Abstract—In this paper, we study delay and rate-constrained transmission of bursty traffic over wireless channels. We characterize the minimum power requirements via bounds for both single user and multiuser downlink problems, using a class of randomized first-come first-serve policies. We show that larger tolerable delay leads to power reduction, even for single-user Gaussian channels; a sourc...
متن کاملConstrained Markov Decision Processes
2 i To Tania and Einat ii Preface In many situations in the optimization of dynamic systems, a single utility for the optimizer might not suuce to describe the real objectives involved in the sequential decision making. A natural approach for handling such cases is that of optimization of one objective with constraints on other ones. This allows in particular to understand the tradeoo between t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Automatic Control
سال: 2013
ISSN: 0018-9286,1558-2523
DOI: 10.1109/tac.2013.2256682