نتایج جستجو برای: action value function

تعداد نتایج: 2342819  

Journal: :Games and Economic Behavior 2004
John Morgan Felix Várdy

We report on experiments examining the value of commitment in Stackelberg games where the follower chooses whether to pay some cost to perfectly observe the leader’s action. Várdy [Games Econ. Behav. (2004)] shows that in the unique pure-strategy subgame perfect equilibrium of this game, the value of commitment is lost completely; however, there exists a mixed-strategy subgame perfect equilibri...

2003
Antoni W. Mazurkiewicz

Multi-agent systems considered in the paper consist of a finite number of agents, positions of which can be changed by system actions, and of an evaluation function which assigns to each agent a value of its current position (as e.g. the distance from the intended target). The set of all possible values is ordered; the intention of each agent is to reach a position with the minimum value. Any s...

2002
Helen E. Longino

In Science, Truth, and Democracy, Philip Kitcher develops the notion of well-ordered science: scientific inquiry whose research agenda and applications (but not methods) are subject to public control guided by democratic deliberation. Kitcher’s primary departure from his earlier views involves rejecting the idea that there is any single standard of scientific significance. The context-dependenc...

2009
David H. Bailey Jonathan M. Borwein

Abstract A previous study by one of the present authors, together with D. Borwein and I. Leonard [8], studied the asymptotic behavior of the p-norm of the sinc function: sinc(x) = (sin x)/x and along the way looked at closed forms for integer values of p. In this study we address these integrals with the tools of experimental mathematics, namely by computing their numerical values to high preci...

2012
Eric Mandelbaum

The idea that people can entertain propositions without believing them is widespread, intuitive, and most probably false. The main goal of this essay is to argue against the claim that people can entertain a proposition without believing it. Evidence is presented demonstrating that we cannot withhold assent from any proposition we happen to consider. A model of belief fixation is then sketched ...

پایان نامه :دانشگاه تربیت معلم - تهران - دانشکده علوم 1393

in this thesis, structural, electronical, and optical properties of inverse pervskite(ca3pbo) in cubic phase have been investigated.the calculation have been done based on density functional theory and according to generalized gradiant approximate (gga) as correlating potential. in order to calculate the configurations, implementing in the wien2k code have been used from 2013 version. first of ...

Journal: :The Japanese Journal of Educational Psychology 1993

Journal: :Metodički obzori/Methodological Horizons 2013

2018
Ofir Nachum Mohammad Norouzi George Tucker Dale Schuurmans

State-action value functions (i.e., Q-values) are ubiquitous in reinforcement learning (RL), giving rise to popular algorithms such as SARSA and Qlearning. We propose a new notion of action value defined by a Gaussian smoothed version of the expected Q-value. We show that such smoothed Q-values still satisfy a Bellman equation, making them learnable from experience sampled from an environment. ...

ژورنال: Hormozgan Medical Journal 2010
Ghofranipour, F, hajizadeh, E, Hidarnia, A.R, Montazeri, A, Taremian, F, Tavousi, M,

Introduction: Objective of present study was assessing the competence of self efficacy to development of theory of Reasoned Action (TRA) and comparison with original version by path analysis for substance abuse prevention among adolescents. Methods: In this analytic study, 433 randomly selected adolescents (range of age 15–19) from Tehran participated in study. The study design was based ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید