نتایج جستجو برای: internal rewards
تعداد نتایج: 248018 فیلتر نتایج به سال:
Abstract Policy gradient methods have become one of the most popular classes algorithms for multi-agent reinforcement learning. A key challenge, however, that is not addressed by many these credit assignment: assessing an agent’s contribution to overall performance, which crucial learning good policies. We propose a novel algorithm called Dr.Reinforce explicitly tackles this combining differenc...
Introduction: The existence of standard tools is one of the basic needs of scientists of healthy behavior for predicting health-related behaviors. The aim of the present study was to design a psychometrically sound instrument to measure the protection motivation theory constructs regarding self-medication for elderly Iranians. Methods: The study was conducted in spring 2016. The sample con...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید