A fast point-based algorithm for POMDPs
نویسندگان
چکیده
We describe a point-based approximate value iteration algorithm for partially observable Markov decision processes. The algorithm performs value function updates ensuring that in each iteration the new value function is an upper bound to the previous value function, as estimated on a sampled set of belief points. A randomized belief-point selection scheme allows for fast update steps. Results indicate that the proposed algorithm achieves competitive performance, both in terms of solution quality
منابع مشابه
Applying Metric-Trees to Belief-Point POMDPs
Recent developments in grid-based and point-based approximation algorithms for POMDPs have greatly improved the tractability of POMDP planning. These approaches operate on sets of belief points by individually learning a value function for each point. In reality, belief points exist in a highly-structured metric simplex, but current POMDP algorithms do not exploit this property. This paper pres...
متن کاملA New Maximum Power Point Tracking Method for PEM Fuel Cells Based On Water Cycle Algorithm
Maximum Power Point (MPP) tracker has an important role in the performance of fuel cell (FC) systems improvement. Tow parameters which have effect on the Fuel cell output power are temperature and membrane water. So contents make the MPP change by with variations in each parameter. In this paper, a new maximum power point tracking (MPPT) method for Proton Exchange Membrane (PEM) fuel cell is pr...
متن کاملSolving POMDPs: RTDP-Bel vs. Point-based Algorithms
Point-based algorithms and RTDP-Bel are approximate methods for solving POMDPs that replace the full updates of parallel value iteration by faster and more effective updates at selected beliefs. An important difference between the two methods is that the former adopt Sondik’s representation of the value function, while the latter uses a tabular representation and a discretization function. The ...
متن کاملAnytime Point Based Approximations for Interactive POMDPs
Partially observable Markov decision processes (POMDPs) have been largely accepted as a rich-framework for planning and control problems. In settings where multiple agents interact POMDPs prove to be inadequate. The interactive partially observable Markov decision process (I-POMDP) is a new paradigm that extends POMDPs to multiagent settings. The added complexity of this model due to the modeli...
متن کاملPlanning under Uncertainty
In a fast point-based algorithm for POMDPs (see [Vlassis, Spaan]) the proposed algorithm is tested using various experiments. One of these experiments concerns the Tag domain as proposed in [Pineau, Gordon, Thrun]. The goal in this domain is to search for a moving robot and tag it. All the predator robot observes is its own square. We propose a method to increase this observation model, thereby...
متن کامل