bellman

Finite-sample Analysis of Bellman Residual Minimization

2010

Odalric-Ambrym Maillard Rémi Munos Alessandro Lazaric Mohammad Ghavamzadeh

We consider the Bellman residual minimization approach for solving discounted Markov decision problems, where we assume that a generative model of the dynamics and rewards is available. At each policy iteration step, an approximation of the value function for the current policy is obtained by minimizing an empirical Bellman residual defined on a set of n states drawn i.i.d. from a distribution ...

متن کامل

Numerical Approximations of Stochastic Optimal Stopping and Control Problems

2007

David Šiška

We study numerical approximations for the payoff function of the stochastic optimal stopping and control problem. It is known that the payoff function of the optimal stopping and control problem corresponds to the solution of a normalized Bellman PDE. The principal aim of this thesis is to study the rate at which finite difference approximations, derived from the normalized Bellman PDE, converg...

متن کامل

The Bellman-ford Algorithm and “distributed Bellman-ford”

2006

DAVID WALDEN

In the spring of 2003, I began to wonder about the history of the Bellman-Ford algorithm [CLRS01] for finding shortest paths in a graph. In particular, I was interested in understanding when Bellman’s name and Ford’s name became jointly associated with the algorithm. To research these questions, I made trips to the BBN Technologies and MIT libraries, did Web searches, and sent emails to univers...

متن کامل

Was Darwin the Bellman

Journal: :Nature 1977

متن کامل

Further Results on the Bellman Equation for Optimal Control Problems with Exit Times and Nonnegative Instantaneous Costs

2000

Michael Malisoff Héctor J. Sussmann

We study the Bellman equation for undiscounted exit time optimal control problems for nonlinear systems and nonnegative instantaneous costs using the dynamic programming approach. Using viscosity solution theory, we prove a uniqueness theorem that characterizes the value functions for these problems as the unique viscosity solutions of the corresponding Bellman equations that satisfy appropriat...

متن کامل

Computing Maximum Reachability Probabilities in Markovian Timed Automata

2010

Taolue Chen Tingting Han Alexandru Mereacre

We propose a novel stochastic extension of timed automata, i.e. Markovian Timed Automata (MTA). We study the problem of optimizing the reachability probabilities in this model. Two variants are considered, namely, the timebounded and unbounded reachability. In each case, we propose Bellman equations to characterize the probability. For the former, we provide two approaches to solve the Bellman ...

متن کامل

Miroslav Pǐstěk Approximate Dynamic Programming based on High Dimensional Model Representation 2310 October 2011

2011

Miroslav Pǐstěk

In this article, an efficient algorithm for an optimal decision strategy approximation is introduced. The proposed approximation of the Bellman equation is based on HDMR technique. This non-parametric function approximation is used not only to reduce memory demands necessary to store Bellman function, but also to allow its fast approximate minimization. On that account, a clear connection betwe...

متن کامل

A Fine Tuned Hybrid Implementation for Solving Shortest Path Problems using Bellman Ford

2014

Gaurav Hajela Manish Pandey

In this paper a hybrid implementation for Bellman-Ford to solve shortest path problems is proposed using OpenCL. Here first parallel implementation for Bellman-Ford for single source shortest path (SSSP) problem and all pair shortest path (APSP) are analyzed on CPU and GPU and based on this analysis work is divided among CPU and GPU and hybrid implementation is done. As proper resource utilizat...

متن کامل

The Geodesics in Godel's Universe.

Journal: :Proceedings of the National Academy of Sciences of the United States of America 1961

S Chandrasekhar J P Wright

1 Wald, A., Statistical Decision Functions (New York: John Wiley & Sons, 1950). 2 Wald, A., Sequential Analysis (New York: John Wiley & Sons, 1947). Bussgang, J. J., and D. Middleton, "Optimum sequential detection of signals in noise," IRE Trans. on Inform. Theory, IT-1, 5 (December) 1955. 4Middleton, D., An Introduction to Statistical Communication Theory (New York: McGrawHill, 1960), Chapter ...

متن کامل

Hybrid Bellman-Ford-Dijkstra algorithm

Journal: :J. Discrete Algorithms 2017

Yefim Dinitz Rotem Itzhak

Consider the single-source cheapest paths problem in a digraph with negative edge costs allowed. A hybrid of Bellman-Ford and Dijkstra algorithms is suggested, improving the running time bound upon Bellman-Ford for graphs with a sparse distribution of negative cost edges. The algorithm iterates Dijkstra several times without reinitializing values d(v) at vertices. At most k+2 executions of Dijk...

متن کامل