نتایج جستجو برای: markov decision process

تعداد نتایج: 1627273  

2017
GARY J. KOEHLER

— In this paper we present a generalized Markov décision process that subsumes the traditional discounted, infinité horizon, finite state and action Markov décision process, VeinotCs discountéd décision processes, and Koehler's generalization of these two problem classes. Résumé. — Nous présentons dans cet article un processus de Markov généralisé qui englobe le processus de décision markovien ...

Journal: :Journal of the Brazilian Computer Society 2015

Journal: :Emerging Information Science and Technology 2023

Class learning is a teaching and activity involving both teachers students. Students in the class have different levels of intelligence, divided into three categories: lowest, average, most intelligent. Teachers usually pay less attention to intelligent Consider student who not enough remain school. However, if smart required repeat content he already comprehends, will become bored. Therefore, ...

Journal: :Mathematics 2023

We propose a Markov Decision Process Model that blends ideas from Psychological research and Economics to study decision-making in individuals with self-control problems. have borrowed dual-process of self-awareness research, we introduce present bias inter-temporal preferences, phenomenon widely explored Economics. allow for both an exogenous endogenous, state-dependent, explore, by means nume...

Journal: :IISE transactions 2023

In this paper, we present a Distributionally Robust Markov Decision Process (DRMDP) approach for addressing the dynamic epidemic control problem. The Susceptible-Exposed-Infectious-Recovered (SEIR) model is widely used to represent stochastic spread of infectious diseases, such as COVID-19. While Processes (MDP) offers mathematical framework identifying optimal actions, vaccination and transmis...

Fallahnezhad, Niaki,

  A novel optimal single machine replacement policy using a single as well as a two-stage decision making process is proposed based on the quality of items produced. In a stage of this policy, if the number of defective items in a sample of produced items is more than an upper threshold, the machine is replaced. However, the machine is not replaced if the number of defective items is less than ...

Journal: :Journal of Quantitative Analysis in Sports 2016

Journal: :Siam Journal on Control and Optimization 2021

Related DatabasesWeb of Science You must be logged in with an active subscription to view this.Article DataHistorySubmitted: 20 April 2020Accepted: 03 February 2021Published online: 29 2021KeywordsMarkov decision process, partial observation, long-run average payoffAMS Subject Headings90C39, 90C40, 37A50, 60J20Publication DataISSN (print): 0363-0129ISSN (online): 1095-7138Publisher: Society for...

Journal: :Bulletin of informatics and cybernetics 1997

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید