نتایج جستجو برای: Markov Decision Process

تعداد نتایج: 1627273  

Journal: :RAIRO - Operations Research 1980

Journal: :Operations Research 1992

Journal: :JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES 2020

Journal: :Journal of the Operations Research Society of Japan 1987

Journal: :Journal of Korean Institute of Industrial Engineers 2016

Journal: :CoRR 2017
Xiaocheng Li Huaiyang Zhong Margaret L. Brandeau

In this paper, we consider the problem of optimizing the quantiles of the cumulative rewards of Markov Decision Processes (MDP), to which we refers as Quantile Markov Decision Processes (QMDP). Traditionally, the goal of a Markov Decision Process (MDP) is to maximize expected cumulative reward over a defined horizon (possibly to be infinite). In many applications, however, a decision maker may ...

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه یزد 1388

this study considers the level of increase in customer satisfaction by supplying the variant customer requirements with respect to organizational restrictions. in this regard, anp, qfd and bgp techniques are used in a fuzzy set and a model is proposed in order to help the organization optimize the multi-objective decision-making process. the prioritization of technical attributes is the result ...

In healthcare systems, one of the important actions is related to perishable products such as red blood cells (RBCs) units that its consumption management in different periods can contribute greatly to the optimality of the system. In this paper, main goal is to enhance the ability of medical community to organize the RBCs units’ consumption in way to deliver the unit order timely with a focus ...

Journal: :Memoirs of the Faculty of Science, Kyushu University. Series A, Mathematics 1967

Journal: :Transactions of the Association for Computational Linguistics 2021

Abstract We study controllable text summarization, which allows users to gain control on a particular attribute (e.g., length limit) of the generated summaries. In this work, we propose novel training framework based Constrained Markov Decision Process (CMDP), conveniently includes reward function along with set constraints, facilitate better summarization control. The encourages generation res...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید