sequential decision making

Distributionally Robust Optimization for Sequential Decision Making

Journal: :CoRR 2018

Zhi Chen Pengqian Yu William B. Haskell

The distributionally robust Markov Decision Process approach has been proposed in the literature, where the goal is to seek a distributionally robust policy that achieves the maximal expected total reward under the most adversarial joint distribution of uncertain parameters. In this paper, we study distributionally robust MDP where ambiguity sets for uncertain parameters are of a format that ca...

متن کامل

Scalable Algorithms for Multiagent Sequential Decision Making

2015

Ekhlas Sonu

Introduction In artificial intelligence, decision theory deals with computing a sequence of actions (policy) that an autonomous agent must take in order to optimize its rewards (obtain its goals in the most efficient manner). In many real world situation , an autonomous agent must deal with various sources of uncertainty while computing its optimal policy. In single agent settings, such decisio...

متن کامل

Data Generation as Sequential Decision Making

2015

Philip Bachman Doina Precup

We connect a broad class of generative models through their shared reliance on sequential decision making. Motivated by this view, we develop extensions to an existing model, and then explore the idea further in the context of data imputation – perhaps the simplest setting in which to investigate the relation between unconditional and conditional generative modelling. We formulate data imputati...

متن کامل

Sequential decision making with untrustworthy service providers

2008

W. T. Luke Teacy Georgios Chalkiadakis Alex Rogers Nicholas R. Jennings

In this paper, we deal with the sequential decision making problem of agents operating in computational economies, where there is uncertainty regarding the trustworthiness of service providers populating the environment. Specifically, we propose a generic Bayesian trust model, and formulate the optimal Bayesian solution to the exploration-exploitation problem facing the agents when repeatedly i...

متن کامل

Focus of Attention in Sequential Decision Making

2004

Lihong Li Vadim Bulitko Russell Greiner

We investigate the problem of using function approximation in reinforcement learning (RL) where the agent’s control policy is represented as a classifier mapping states to actions. The innovation of this paper lies with introducing a measure of state’s decision-making importance. We then use an efficient approximation to this measure as misclassification costs in learning the agent’s policy. As...

متن کامل

Sequential Decision Making with Adaptive Utility

2006

Brett Houlding

Decision making with adaptive utility provides a generalisation to classical Bayesian decision theory, allowing the creation of a normative theory for decision selection when preferences are initially uncertain. The theory of adaptive utility was introduced by Cyert & DeGroot [27], but had since received little attention or development. In particular, foundational issues had not been explored a...

متن کامل

Fairness in Multi-Agent Sequential Decision-Making

2014

Chongjie Zhang Julie A. Shah

We define a fairness solution criterion for multi-agent decision-making problems, where agents have local interests. This new criterion aims to maximize the worst performance of agents with a consideration on the overall performance. We develop a simple linear programming approach and a more scalable game-theoretic approach for computing an optimal fairness policy. This game-theoretic approach ...

متن کامل

Group Decision-Making Models for Sequential Tasks

Journal: :SIAM Review 2012

Margot Kimura Jeff Moehlis

The sequential probability ratio test (SPRT) and related drift-diffusion model (DDM) are optimal for choosing between two hypotheses using the minimal (average) number of samples and relevant for modeling the decision-making process in human observers. This work extends these models to group decision making. Previous works have focused almost exclusively on group accuracy; here, we explicitly a...

متن کامل

Losing a dime with a satisfied mind: positive affect predicts less search in sequential decision making.

Journal: :Psychology and aging 2012

Bettina von Helversen Rui Mata

We investigated the contribution of cognitive ability and affect to age differences in sequential decision making by asking younger and older adults to shop for items in a computerized sequential decision-making task. Older adults performed poorly compared to younger adults partly due to searching too few options. An analysis of the decision process with a formal model suggested that older adul...

متن کامل

sequential and dynamic decisions on sales price and accepting the customer’s demand by markov decision process

Journal: :مهندسی صنایع 0

مهدی احمدی دانشجوی دکتری دانشکدة مهندسی صنایع، دانشگاه صنعتی شریف حسن شوندی دانشیار دانشکدة مهندسی صنایع، دانشگاه صنعتی شریف

in this article, decisions about price and stock allocation for a seller with multiple customer classes are analyzed. with each customer arrival, the seller needs to decide about accepting or rejecting the customer’s demand by considering the stock on hand. in the case of acceptance, one needs to decide about the selling price. after any change in the inventory level, decision about continuing ...

متن کامل