نتایج جستجو برای: bellman

تعداد نتایج: 4956  

2010
Odalric-Ambrym Maillard Rémi Munos Alessandro Lazaric Mohammad Ghavamzadeh

We consider the Bellman residual minimization approach for solving discounted Markov decision problems, where we assume that a generative model of the dynamics and rewards is available. At each policy iteration step, an approximation of the value function for the current policy is obtained by minimizing an empirical Bellman residual defined on a set of n states drawn i.i.d. from a distribution ...

2007
David Šiška

We study numerical approximations for the payoff function of the stochastic optimal stopping and control problem. It is known that the payoff function of the optimal stopping and control problem corresponds to the solution of a normalized Bellman PDE. The principal aim of this thesis is to study the rate at which finite difference approximations, derived from the normalized Bellman PDE, converg...

2006
DAVID WALDEN

In the spring of 2003, I began to wonder about the history of the Bellman-Ford algorithm [CLRS01] for finding shortest paths in a graph. In particular, I was interested in understanding when Bellman’s name and Ford’s name became jointly associated with the algorithm. To research these questions, I made trips to the BBN Technologies and MIT libraries, did Web searches, and sent emails to univers...

2000
Michael Malisoff Héctor J. Sussmann

We study the Bellman equation for undiscounted exit time optimal control problems for nonlinear systems and nonnegative instantaneous costs using the dynamic programming approach. Using viscosity solution theory, we prove a uniqueness theorem that characterizes the value functions for these problems as the unique viscosity solutions of the corresponding Bellman equations that satisfy appropriat...

2010
Taolue Chen Tingting Han Alexandru Mereacre

We propose a novel stochastic extension of timed automata, i.e. Markovian Timed Automata (MTA). We study the problem of optimizing the reachability probabilities in this model. Two variants are considered, namely, the timebounded and unbounded reachability. In each case, we propose Bellman equations to characterize the probability. For the former, we provide two approaches to solve the Bellman ...

2011
Miroslav Pǐstěk

In this article, an efficient algorithm for an optimal decision strategy approximation is introduced. The proposed approximation of the Bellman equation is based on HDMR technique. This non-parametric function approximation is used not only to reduce memory demands necessary to store Bellman function, but also to allow its fast approximate minimization. On that account, a clear connection betwe...

2014
Gaurav Hajela Manish Pandey

In this paper a hybrid implementation for Bellman-Ford to solve shortest path problems is proposed using OpenCL. Here first parallel implementation for Bellman-Ford for single source shortest path (SSSP) problem and all pair shortest path (APSP) are analyzed on CPU and GPU and based on this analysis work is divided among CPU and GPU and hybrid implementation is done. As proper resource utilizat...

Journal: :Proceedings of the National Academy of Sciences of the United States of America 1961
S Chandrasekhar J P Wright

1 Wald, A., Statistical Decision Functions (New York: John Wiley & Sons, 1950). 2 Wald, A., Sequential Analysis (New York: John Wiley & Sons, 1947). Bussgang, J. J., and D. Middleton, "Optimum sequential detection of signals in noise," IRE Trans. on Inform. Theory, IT-1, 5 (December) 1955. 4Middleton, D., An Introduction to Statistical Communication Theory (New York: McGrawHill, 1960), Chapter ...

Journal: :J. Discrete Algorithms 2017
Yefim Dinitz Rotem Itzhak

Consider the single-source cheapest paths problem in a digraph with negative edge costs allowed. A hybrid of Bellman-Ford and Dijkstra algorithms is suggested, improving the running time bound upon Bellman-Ford for graphs with a sparse distribution of negative cost edges. The algorithm iterates Dijkstra several times without reinitializing values d(v) at vertices. At most k+2 executions of Dijk...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید