partially observable markov decision process

نتایج جستجو برای: partially observable markov decision process

تعداد نتایج: 1776231 فیلتر نتایج به سال:

Stratified breast cancer follow-up using a continuous state partially observable Markov decision process

Journal: :European Journal of Operational Research 2020

متن کامل

A Large-Scale Agent-Based Model of Taxpayer Reporting Compliance

Journal: :J. Artificial Societies and Social Simulation 2015

Kim M. Bloomquist Matt Koehler

This paper describes the development of the Individual Reporting Compliance Model (IRCM), an agent-based model for simulating tax reporting compliance in a community of 85,000 U.S. taxpayers. Design features include detailed tax return characteristics, taxpayer learning, social networks, and tax agency enforcement measures. The taxpayer's compliance reporting decision is modeled as a partially ...

متن کامل

Policy Search via Density Estimation

1999

Andrew Y. Ng Ronald Parr Daphne Koller

We propose a new approach to the problem of searching a space of stochastic controllers for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP). Following several other authors, our approach is based on searching in parameterized families of policies (for example, via gradient descent) to optimize solution quality. However, rather than trying to estimate th...

متن کامل

Learning Stationary Temporal Probabilistic Networks

1998

Daniel Nikovski

The paper describes a method for learning representations of partially observable Markov decision processes in the form of temporal probabilistic networks, which can subsequently be used by robotic agents for action planning and policy determination. A solution is provided to the problem of enforcing stationarity of the learned Markov model. Several preliminary experiments are described that co...

متن کامل

Interactively Learning Nonverbal Behavior for Inference and Production: A Machine Learning Approach

2014

Jin Joo Lee Cynthia Breazeal

By designing socially intelligent robots that can more effectively communicate and interact with us, we can increase their capacity to function as collaborative partners. Our research goal is to develop robots capable of engaging in nonverbal communication, which has been argued to be at the core of social intelligence. We take a human-centric approach that closely aligns with how people are th...

متن کامل

Optimal Control for Partially Observable Markov Decision Processes over an Infinite Horizon

2009

Katsushige Sawaki Akira Ichikawa A. Ichikawa

In this paper we consider an optimal control problem for partially observable Markov decision processes with finite states, signals and actions OVE,r an infinite horizon. It is shown that there are €optimal piecewise·linear value functions and piecl~wise-constant policies which are simple. Simple means that there are only finitely many pieces, each of which is defined on a convex polyhedral set...

متن کامل

Convergence of Probability Measures and Markov Decision Models with Incomplete Information

2015

Eugene A. Feinberg Pavlo O. Kasyanov Michael Z. Zgurovsky

This paper deals with three major types of convergence of probability measures on metric spaces: weak convergence, setwise convergence, and convergence in total variation. First, it describes and compares necessary and sufficient conditions for these types of convergence, some of which are well-known, in terms of convergence of probabilities of open and closed sets and, for the probabilities on...

متن کامل

Opportunistic Spectrum Access in Imperfect Spectrum Sensing Cognitive Networks

Journal: :JCM 2015

Yonghong Chen Huijian Wang Shibing Zhang

—Spectrum sensing strategy is key to realize cognitive radio. However, spectrum sensing error would affect the access strategy of secondary users in cognitive networks. This paper addresses the spectrum sensing strategy under imperfect spectrum sensing, and proposes opportunistic spectrum access strategies for the imperfect spectrum sensing and fading channels respectively. By setting the opti...

متن کامل

Active Feature Acquisition with POMDP Models

2006

Qi An Hui Li Xuejun Liao Lawrence Carin

We consider the problem of active feature acquisition (AFA), where the selection of a new feature is conditional on the instantiations of previously selected features. The problem is formulated as a partially observable Markov decision process (POMDP). We present a method to construct an approximate POMDP for the AFA problem and discuss its accuracy. We propose a non-stationary policy to improv...

متن کامل

A multi-objective constrained partially observable Markov decision process model for breast cancer screening

Journal: :Operational Research 2023

Breast cancer is a common and deadly disease, but it often curable when diagnosed early. While most countries have large-scale screening programs, there no consensus on single globally accepted guideline for breast screening. The complex nature of the disease; limited availability methods such as mammography, magnetic resonance imaging (MRI), ultrasound; public health policies all factor into d...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید