Markov Decision Process

In this paper, we consider the problem of optimizing the quantiles of the cumulative rewards of Markov Decision Processes (MDP), to which we refers as Quantile Markov Decision Processes (QMDP). Traditionally, the goal of a Markov Decision Process (MDP) is to maximize expected cumulative reward over a defined horizon (possibly to be infinite). In many applications, however, a decision maker may ...

متن کامل

افزایش رضایتمندی مشتریان از محصولات سازمان به کمک ترکیب سه تکنیک anp ، qfd، bgp در محیط فازی

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه یزد 1388

مریم ظریفیان, محمد باقر فخرزاد, حسن خادمی زارع,

this study considers the level of increase in customer satisfaction by supplying the variant customer requirements with respect to organizational restrictions. in this regard, anp, qfd and bgp techniques are used in a fuzzy set and a model is proposed in order to help the organization optimize the multi-objective decision-making process. the prioritization of technical attributes is the result ...

15 صفحه اول

Optimizing Red Blood Cells Consumption Using Markov Decision Process

Journal: Journal of Quality Engineering and Production Optimization 2019

Ali Ebadi Torkayesh, ensiyeh neishabouri jami, mahdi yousefi nejad attari, mehran khayat rasoli,

In healthcare systems, one of the important actions is related to perishable products such as red blood cells (RBCs) units that its consumption management in different periods can contribute greatly to the optimality of the system. In this paper, main goal is to enhance the ability of medical community to organize the RBCs units’ consumption in way to deliver the unit order timely with a focus ...

متن کامل

A MARKOV DECISION PROCESS WITH DISCOUNTED REWARDS

Journal: :Memoirs of the Faculty of Science, Kyushu University. Series A, Mathematics 1967

متن کامل

Controllable Summarization with Constrained Markov Decision Process

Journal: :Transactions of the Association for Computational Linguistics 2021

Abstract We study controllable text summarization, which allows users to gain control on a particular attribute (e.g., length limit) of the generated summaries. In this work, we propose novel training framework based Constrained Markov Decision Process (CMDP), conveniently includes reward function along with set constraints, facilitate better summarization control. The encourages generation res...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید