stochastic shortest path

نتایج جستجو برای: stochastic shortest path

تعداد نتایج: 272650 فیلتر نتایج به سال:

The Stochastic Shortest Path Problem : A polyhedral combinatorics perspective

Journal: :CoRR 2017

Matthieu Guillot Gautier Stauffer

In this paper, we give a new framework for the stochastic shortest path problem in finite state and action spaces. Our framework generalizes both the frameworks proposed by Bertsekas and Tsitsiklis [7] and by Bertsekas and Yu [8]. We prove that the problem is well-defined and (weakly) polynomial when (i) there is a way to reach the target state from any initial state and (ii) there is no transi...

متن کامل

Proper Policies in Infinite-State Stochastic Shortest Path Problems

2017

Dimitri P. Bertsekas

We consider stochastic shortest path problems with infinite state and control spaces, a nonnegative cost per stage, and a termination state. We extend the notion of a proper policy, a policy that terminates within a finite expected number of steps, from the context of finite state space to the context of infinite state space. We consider the optimal cost function J*, and the optimal cost functi...

متن کامل

A Framework to Recover Single Link Failure on Shortest Path in Shortest Path Tree

Journal: :Journal of Applied Sciences 2011

متن کامل

In Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence ( UAI - 95 ) , Montreal , Quebec , Canada , August 18 - 20 , 1995 Path Planning under Time - Dependent

1995

Michael P. Wellman Matthew Ford Kenneth Larson

Standard algorithms for nding the shortest path in a graph require that the cost of a path be additive in edge costs, and typically assume that costs are determinis-tic. We consider the problem of uncertain edge costs, with potential probabilistic dependencies among the costs. Although these dependencies violate the standard dynamic-programming decomposition, we identify a weaker stochastic con...

متن کامل

An Online Convergent Q-learning Algorithm with Linear Function Approximation

2014

Shalabh Bhatnagar

We present in this article a variant of Q-learning with linear function approximation that is based on two-timescale stochastic approximation. Whereas it is difficult to prove convergence of regular Q-learning with linear function approximation because of the off-policy problem, we prove that our algorithm is convergent. Numerical results on a multi-stage stochastic shortest path problem show t...

متن کامل

Time-dependent Stochastic Shortest Path(s) Algorithms for a Scheduled Transportation Network

2005

QIUJIN WU JOANNA HARTLEY DAVID AL-DABASS

Following on from our work concerning travellers’ preferences in public transportation networks (Wu and Hartley, 2004), we introduce the concept of stochasticity to our algorithms. Stochasticity greatly increases the complexity of the route finding problem, so greater algorithmic efficiency becomes imperative. Public transportation networks (buses, trains) have two important features: edges can...

متن کامل

Reinforcement Learning for Average Reward Zero-Sum Games

2004

Shie Mannor

We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The first is based on relative Q-learning and the second on Q-learning for stochastic shortest path games. Convergence is proved using the ODE (Ordinary Differential Equation) method. We further discuss the case where not all the actions are played by the opponent with comparab...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید

The Stochastic Shortest Path Problem : A polyhedral combinatorics perspective

Proper Policies in Infinite-State Stochastic Shortest Path Problems

A Framework to Recover Single Link Failure on Shortest Path in Shortest Path Tree

In Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence ( UAI - 95 ) , Montreal , Quebec , Canada , August 18 - 20 , 1995 Path Planning under Time - Dependent

An Online Convergent Q-learning Algorithm with Linear Function Approximation

Time-dependent Stochastic Shortest Path(s) Algorithms for a Scheduled Transportation Network

Reinforcement Learning for Average Reward Zero-Sum Games

Towards Online Shortest Path Computation

Parallel shortest path auction algorithms

ON ASSOCIATIVE SHORTEST PATH PROBLEMS