reward packages

نتایج جستجو برای: reward packages

تعداد نتایج: 46029 فیلتر نتایج به سال:

Learning to Drive a Bicycle Using Reinforcement Learning and Shaping

1998

Jette Randløv Preben Alstrøm

We present and solve a real-world problem of learning to drive a bicycle. We solve the problem by online reinforcement learning using the Sarsa( )-algorithm. Then we solve the composite problem of learning to balance a bicycle and then drive to a goal. In our approach the reinforcement function is independent of the task the agent tries to learn to solve.

متن کامل

Self-Improving Factory Simulation using Continuous-time Average-Reward Reinforcement Learning

1997

Sridhar Mahadevan Tapas K. Das Abhijit Gosavi

Many factory optimization problems, from inventory control to scheduling and reliability , can be formulated as continuous-time Markov decision processes. A primary goal in such problems is to nd a gain-optimal policy that minimizes the long-run average cost. This paper describes a new average-reward algorithm called SMART for nd-ing gain-optimal policies in continuous time semi-Markov decision...

متن کامل

Using the probabilistic evaluation tool for the analytical solution of large Markov models

1995

Boudewijn R. Haverkort Aad P. A. van Moorsel

Introduction Stochastic Petri net based Markov modeling is a potentially very powerful and generic approach for evaluating the performance and depend ability of many di erent systems such as computer systems communication networks manufacturing sys tems etc As a consequence of their general appli cability SPN based Markov models form the basic solution approach for several software packages tha...

متن کامل

Procedural justice in children: Preschoolers accept unequal resource distributions if the procedure provides equal opportunities.

Journal: :Journal of experimental child psychology 2015

Patricia Grocke Federico Rossano Michael Tomasello

When it is not possible to distribute resources equitably to everyone, people look for an equitable or just procedure. In the current study, we investigated young children's sense of procedural justice. We tested 32 triads of 5-year-olds in a new resource allocation game. Triads were confronted with three unequal reward packages and then agreed on a procedure to allocate them among themselves. ...

متن کامل

Effective Strategies for Optimal Implementation of Evolution and Innovation Packages in Medical Education

ژورنال: دوفصلنامه آموزش پزشکی مرکز مطالعات و توسعه آموزش علوم پزشکی بابل 2020

, ,

ABSTRACT BACKGROUND AND OBJECTIVE: Evolution and innovation packages in medical science education are the main program of medical education and it is necessary to pay attention to the provision of infrastructure of their implementation. This study was conducted to identify effective strategies for optimal implementation of evolution and innovation packages in medical education. METHODS: The met...

متن کامل

Computer aided teaching packages.

Journal: :BMJ 1991

متن کامل

Low-serotonin levels increase delayed reward discounting in humans.

Journal: :The Journal of neuroscience : the official journal of the Society for Neuroscience 2008

Nicolas Schweighofer Mathieu Bertin Kazuhiro Shishida Yasumasa Okamoto Saori C Tanaka Shigeto Yamawaki Kenji Doya

Previous animal experiments have shown that serotonin is involved in the control of impulsive choice, as characterized by high preference for small immediate rewards over larger delayed rewards. Previous human studies under serotonin manipulation, however, have been either inconclusive on the effect on impulsivity or have shown an effect in the speed of action-reward learning or the optimality ...

متن کامل

Fuzzy decision making in testing hypotheses: An introduction to the packages ``FPV" and ``Fuzzy.p.value" with practical examples

Journal: Iranian Journal of Fuzzy Systems 2020

A. Parchami

This paper reviews and compares two R packages ``FPV" and ``Fuzzy.p.value".These packages are designed for testing hypotheses in a fuzzy environment using a fuzzy $p$-value based approach.In fact, the packages ``FPV" and ``Fuzzy.p.value" propose some useful functions for testing hypotheses when the data / hypotheses are fuzzy rather than crisp.The proposed methods and function...

متن کامل

The Pathology of Transformational Innovation Packages in Medical Education "A Qualitative Study"

ژورنال: پژوهش در آموزش علوم پزشکی 2019

Abbasian , H, Arasteh , HR, Ghorbandoost , R, Zeinabadi , HR ,

Introduction: The education field is one of the infrastructural fields of the health system and in order to evolving this field training of the human resources should be evolved. The evolution and innovation document is a special opportunity for education practitioners and universities' authorities to take a step towards the promotion of medical education in the country. Proper and timely patho...

متن کامل

CYPROS - Cybernetic Program Packages

Journal: :Modeling, Identification and Control: A Norwegian Research Bulletin 1980

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید