average processes

نتایج جستجو برای: average processes

تعداد نتایج: 887356 فیلتر نتایج به سال:

Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes

Journal: :Discrete Event Dynamic Systems 2007

Mohammed Shahid Abdulla Shalabh Bhatnagar

This article proposes several two-timescale simulation-based actor-critic algorithms for solution of infinite horizon Markov Decision Processes with finite state-space under the average cost criterion. Two of the algorithms are for the compact (non-discrete) action setting while the rest are for finite-action spaces. On the slower timescale, all the algorithms perform a gradient search over cor...

متن کامل

Learning Algorithms for Markov Decision Processes with Average Cost

Journal: :SIAM J. Control and Optimization 2001

Jinane Abounadi Dimitri P. Bertsekas Vivek S. Borkar

This paper gives the first rigorous convergence analysis of analogs of Watkins’ Q-learning algorithm, applied to average cost control of finite-state Markov chains. We discuss two algorithms which may be viewed as stochastic approximation counterparts of two existing algorithms for recursively computing the value function of average cost problem the traditional relative value iteration algorith...

متن کامل

Risk-Sensitive and Average Optimality in Markov Decision Processes

2012

Karel Sladký

Abstract. This contribution is devoted to the risk-sensitive optimality criteria in finite state Markov Decision Processes. At first, we rederive necessary and sufficient conditions for average optimality of (classical) risk-neutral unichain models. This approach is then extended to the risk-sensitive case, i.e., when expectation of the stream of one-stage costs (or rewards) generated by a Mark...

متن کامل

On covariance coefficients estimates of finite order moving average processes

Journal: :Kybernetika 1981

Emil Pelikán Miloslav Vosvrda

In the present paper the necessary and sufficient conditions for the estimates of covariance coefficients of moving average processes are presented. Further the modification for estimates of Wilson's method covariance coefficients is introduced.

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید

Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes

Learning Algorithms for Markov Decision Processes with Average Cost

Risk-Sensitive and Average Optimality in Markov Decision Processes

On covariance coefficients estimates of finite order moving average processes

Average optimality for continuous-time Markov decision processes with a policy iteration approach

The effect of memory on functional large deviations of infinite moving average processes

Damage Detection in Plate Structures Based on Space-time Autoregressive Moving Average Processes

Bootstrapping Autoregressive and Moving Average Parameter Estimates of Infinite Order Vector Autoregressive Processes

Estimation and control in finite Markov decision processes with the average reward criterion

Average Optimal Stationary Policies and Linear Programming in Countable Space Markov Decision Processes