نتایج جستجو برای: s policy

تعداد نتایج: 960898  

Journal: :Russia and America in the 21st Century 2019

Journal: :CoRR 2012
Thomas Degris Martha White Richard S. Sutton

This paper presents the first actor-critic algorithm for off-policy reinforcement learning. Our algorithm is online and incremental, and its per-time-step complexity scales linearly with the number of learned weights. Previous work on actor-critic algorithms is limited to the on-policy setting and does not take advantage of the recent advances in offpolicy gradient temporal-difference learning....

Journal: :سیاست 0
جهانگیر کرمی استادیار گروه مطالعات روسیه، دانشکده مطالعات جهان، دانشگاه تهران

from 2006 to 2008 russian government adopted an aggressive foreign policy toward the west, particularly the united states. in this article, the author claims that one should search for the main factor in the ‘feeling danger in the european borders of russia’ and more clearly in ‘serious endangering of european balance of power’ from the perspective of russian authorities after 60 years. i have ...

Journal: :مطالعات اوراسیای مرکزی 0
حیبب اله ابوالحسن شیرازی دانشیار دانشکدۀ علوم سیاسی، دانشگاه آزاد اسلامی واحد تهران مرکزی قدرت اله بهبودی نژاد دانشجوی کارشناسی ارشد روابط بین الملل، دانشگاه آزاد اسلامی واحد تهران مرکزی

as putin regained the power in russia, the situation gave a second chance to this country to avoid both the west and the benefits of convergence with its foreign policy and regional and trans-regional equations. this can take advantages from other side. approach of russia to divergence and independence with the west will increase level of russian hegemony on international arena. the main questi...

1999
Robert A.J. Dur

This paper offers an explanation for why policy makers stick to inefficient policy decisions. I argue that repealing a policy is a bad signal to voters about the policy maker’s competence if voters do not have complete knowledge about the effects of implemented policies. I derive the optimal policy maker’s decision on continuation of a policy, assuming that voters’ beliefs about the policy make...

2008
Aditya Jain Harry Groenevelt Nils Rudi

We study a stochastic inventory model of a firm, that sources the product from a make-to-order manufacturer, and can ship orders by a combination of two freight modes. The two freight modes differ in lead-times, and each has a fixed and a quantity proportional cost for each use. The ordering decisions are made periodically; however, the inventory holding and back-order penalty costs are incurre...

Journal: :سیاست 0
سعیده لطفیان حمید رهنورد

six year after the terrorist attacks on the world trade center and pentagon on september 11, 2001 and the onset of us- led war in iraq and afghanistan in line with the bush's policy of regime-change aimed at establishing pro-western governments in baghdad and kabul, we are still witnessing widespread instability and reign of terror in the region. hasty executive branch anti-terrorism polic...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید