partially observable markov decision process

Dynamic Decision Making in Stochastic

1997

Milos Hauskrecht

The focus of this paper is the framework of partially observable Markov decision processes (POMDPs) and its role in modeling and solving complex dynamic decision problems in stochastic and partially observable medical domains. The paper summarizes some of the basic features of the POMDP framework and explores its potential in solving the problem of the management of the patient with chronic isc...

متن کامل

Learning policies for partially observable environments : Scaling upMichael

1995

Michael L. Littman Anthony R. Cassandra Leslie Pack Kaelbling

Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor feedback. While the study of pomdp's is motivated by a need to address realistic problems , existing techniques for nding optimal behavior do not appear to scale well and have been unable to nd satisfactory policies for proble...

متن کامل

Speeding up Online POMDP Planning - Unification of Observation Branches by Belief-state Compression Via Expected Feature Values

2015

Gavin Rens

A novel algorithm to speed up online planning in partially observable Markov decision processes (POMDPs) is introduced. I propose a method for compressing nodes in beliefdecision-trees while planning occurs. Whereas belief-decision-trees branch on actions and observations, with my method, they branch only on actions. This is achieved by unifying the branches required due to the nondeterminism o...

متن کامل

Dynamic Decision Making in Stochastic Partially Observable Medical Domains: Ischemic Heart Disease Example

2004

Milos Hauskrecht

The focus of this paper is the framework of partially observable Markov decision processes (POMDPs) and its role in modeling and solving complex dynamic decision problems in stochastic and partially observable medical domains. The paper summarizes some of the basic features of the POMDP framework and explores its potential in solving the problem of the management of the patient with chronic isc...

متن کامل

Partially Observable Sequential Decision Making for Problem Selection in an Intelligent Tutoring System

2011

Emma Brunskill Stuart J. Russell

A key part of effective teaching is adaptively selecting pedagogical activities to maximize long term student learning. In this poster we report on ongoing work to both develop a tutoring strategy that leverages insights from the partially observable Markov decision process (POMDP) framework to improve problem selection relative to state-of-the-art intelligent tutoring systems, and evaluate the...

متن کامل

Hybrid POMDP Algorithms

2006

Sébastien Paquet Brahim Chaib-draa Stéphane Ross

When an agent evolves in a partially observable environment, it has to deal with uncertainties when choosing its actions. An efficient model for such environments is to use partially observable Markov decision processes (POMDPs). Many algorithms have been developed for POMDPs. Some use an offline approach, learning a complete policy before the execution. Others use an online approach, construct...

متن کامل

Learning Policies for Partially Observable Environments: Scaling Up

1995

Michael L. Littman Anthony R. Cassandra Leslie Pack Kaelbling

Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor feedback. While the study of pomdp's is motivated by a need to address realistic problems, existing techniques for nding optimal behavior do not appear to scale well and have been unable to nd satisfactory policies for problem...

متن کامل

A Decision-Theoretic Approach to Task Assistance for Persons with Dementia

2005

Jennifer Boger Pascal Poupart Jesse Hoey Craig Boutilier Geoff Fernie Alex Mihailidis

Cognitive assistive technologies that aid people with dementia (such as Alzheimer’s disease) hold the promise to provide such people with an increased level of independence. However, to realize this promise, such systems must account for the specific needs and preferences of individuals. We argue that this form of customization requires a sequential, decision-theoretic model of interaction. We ...

متن کامل

Learning Mazes with Aliasing States: An LCS Algorithm with Associative Perception

Journal: :Adaptive Behaviour 2009

Zhanna V. Zatuchna Anthony J. Bagnall

Maze problems represent a simplified virtual model of real environments that can be used for developing core algorithms of many real-world application related to the problem of navigation. However, the best achievements of Learning Classifier Systems (LCS) in maze problems are still mostly bounded to non-aliasing environments, while LCS complexity seems to obstruct a proper analysis of the reas...

متن کامل

Optimization of Prostate Biopsy Referral Decisions

Journal: :Manufacturing & Service Operations Management 2012

Jingyu Zhang Brian T. Denton Hari Balasubramanian Nilay D. Shah Brant A. Inman

Prostate cancer is the most common solid tumor in American men and is screened for using prostate-specific antigen (PSA) tests. We report on a non-stationary partially observable Markov decision process (POMDP) for prostate biopsy referral decisions. The core states are the patients’ prostate cancer related health states, and PSA test results are the observations. Transition probabilities and r...

متن کامل