markov decision process

نتایج جستجو برای: markov decision process

تعداد نتایج: 1627273 فیلتر نتایج به سال:

Solving Generalized Semi-Markov Decision Processes Using Continuous Phase-Type Distributions

2004

Håkan L. S. Younes Reid G. Simmons

We introduce the generalized semi-Markov decision process (GSMDP) as an extension of continuous-time MDPs and semi-Markov decision processes (SMDPs) for modeling stochastic decision processes with asynchronous events and actions. Using phase-type distributions and uniformization, we show how an arbitrary GSMDP can be approximated by a discrete-time MDP, which can then be solved using existing M...

متن کامل

optimum process adjustment under inspection errors with considering the cycle time of production and two markets for the sale of goods

Journal: :shiraz journal of system management 0

this paper is devoted to the study of determining optimal process mean in system production with the two markets for the sale of goods. in this paper, we developed an absorbing markov chain model in production systems where all items are inspected %100 for conformance with their specification limits. when the value of the quality characteristic of an item falls below a lower limit, the item is ...

متن کامل

An Exact Dynamic Programming Solution for a Decentralized Two-Player Markov Decision Process

2010

Jeff Wu Sanjay Lall

We present an exact dynamic programming solution for a finite-horizon decentralized two-player Markov decision process, where player 1 only has access to its own states, while player 2 has access to both player’s states but cannot affect player 1’s states. The solution is obtained by solving several centralized partially-observable Markov decision processes. We then conclude with several comput...

متن کامل

Abstractive Meeting Summarization as a Markov Decision Process

2015

Gabriel Murray

ive Meeting Summarization as a Markov Decision Process

متن کامل

Deciding the Value 1 Problem for $\sharp$ -acyclic Partially Observable Markov Decision Processes

2014

Hugo Gimbert Youssouf Oualhadj

The value 1 problem is a natural decision problem in algorithmic game theory. For partially observable Markov decision processes with reachability objective, this problem is defined as follows: are there observational strategies that achieve the reachability objective with probability arbitrarily close to 1? This problem was shown undecidable recently. Our contribution is to introduce a class o...

متن کامل

MDPs Semi - Markov decision processes Hidden Markov models Partially observable SMDPs Hierarchical HMMs

2007

Sridhar Mahadevan

متن کامل

Opponent Modeling in Scrabble

2007

Mark Richards Eyal Amir

Computers have already eclipsed the level of human play in competitive Scrabble, but there remains room for improvement. In particular, there is much to be gained by incorporating information about the opponent’s tiles into the decision-making process. In this work, we quantify the value of knowing what letters the opponent has. We use observations from previous plays to predict what tiles our ...

متن کامل

Towards a Meta Motion Planner B: Algorithm and Applications

2001

Amit Adam Ehud Rivlin Ilan Shimshoni

In ci componioiz paper [l] we have developed a frame,work fo,r rating 01‘ comparing navigation packages. For a giuen enuir.onment a navigation package consists of (I motion planner and a sensor to be used during nauigation. The cibility to rate or measure a navigation package is aniportant in order to address issues like sensor custom%zution for an environment and choice of a m.otion plarumr in...

متن کامل

Toward Optimal Pedagogical Action Patterns by Means of Partially Observable Markov Decision Process

2016

Manuel Mejía-Lavalle Hermilo Victorio Meza Alicia Martínez Rebollar Grigori Sidorov Luis Enrique Sucar Obdulia Pichardo-Lagunas

Good pedagogical actions are key components in all learning-teaching schemes. Automate that is an important Intelligent Tutoring Systems objective. We propose apply Partially Observable Markov Decision Process (POMDP) in order to obtain automatic and optimal pedagogical recommended action patterns in benefit of human students, in the context of Intelligent Tutoring System. To achieve that goal,...

متن کامل

Planning with Delayed State Information

2007

Sascha E. Engelbrecht Konstantinos V. Katsikopoulos

We consider a special case of partially observable Markov decision processes that arises when state information is perfect but arrives with a delay. We rst formulate the decision process in its standard form and derive the Bellman equation that corresponds to it. We then introduce a second decision process that has a much simpler Bellman equation than the rst, and is therefore, in general, much...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید