نتایج جستجو برای: action value function
تعداد نتایج: 2342819 فیلتر نتایج به سال:
Mass Action in Cerebral Function: PROFESsoR K. S. Scientific Apparatus and Laboratory Methods: LASHLEY ..................................... 245 A New Singing Tube: DR. F. L. ROBESON. A Simple Microscope Eyepiece Pointer: JAMES A. Obituary: LOUNSBURY .......L..265 Ignatius Urban, Erik L. Ekman: PAUL C. STANDLEY. Recent Deaths ................ .................... 254 Special Articles: Observati...
The task consists of three states (first stage: sA; second stage: sB and sC), each with two actions (aA and aB). The goal of both the model-based and model-free subcomponents of the algorithm is to learn a state-action value function Q(s,a) mapping each state-action pair to its expected future value. On trial t, we denote the first-stage state (always sA) by s1,t, the second-stage state by s2,t...
Bacground and Objective: Foot orthoses are a common intervention for patients with patellofemoral pain syndrome but, limited information is available in the effects of foot orthoses on knee pain and function of athletes with patellofemoral pain syndrome. The aim of our study was to determinate the effects of foot orthoses on reducing pain and increasing function of athletes with patellofemoral ...
As an important approach to solving complex sequential decision problems, reinforcement learning (RL) has been widely studied in the community of artificial intelligence and machine learning. However, the generalization ability of RL is still an open problem and it is difficult for existing RL algorithms to solve Markov decision problems (MDPs) with both continuous state and action spaces. In t...
Value Engineering (VE) techniques based on function have been the means to improved products and processes for several decades. It is a social design methodology that is usually episodic in application and often confused with narrow interests, such as cost cutting. This paper addresses the role, or function, of VE in a larger model of design practice to give insight into its use, non-use and mi...
LVAD is a mechanical pump supporting a weak heart function and blood flow. Sometimes, the heart may not recover fast enough to take over the pumping action immediately after surgery, in such patients a temporary support device has been employed to maintain the pumping action until the patient’s own heart recovers. This device can be considered as a temporary alternative before the process of ar...
Classical work on eliciting and representing preferences over multi-attribute alternatives has attempted to recognize conditions under which value functions take on particularly simple and compact form, making their elicitation much easier. In this paper we consider preferences over discrete domains, and show that for a certain class of simple and intuitive qualitative preference statements, on...
eduction model. We postulate that this implementation can potentially ooer superior performance scalability when compared to the single-master implementations even when the granularity of parallelism is not very coarse. Immediate future work will focus on evaluating the performance scalability of the new implementationus-ing GLU applications with modest granularity of par-allelism. In the long ...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید