action value function

نتایج جستجو برای: action value function

تعداد نتایج: 2342819 فیلتر نتایج به سال:

An experimental study of commitment in Stackelberg games with observation costs

Journal: :Games and Economic Behavior 2004

John Morgan Felix Várdy

We report on experiments examining the value of commitment in Stackelberg games where the follower chooses whether to pay some cost to perfectly observe the leader’s action. Várdy [Games Econ. Behav. (2004)] shows that in the unique pure-strategy subgame perfect equilibrium of this game, the value of commitment is lost completely; however, there exists a mixed-strategy subgame perfect equilibri...

متن کامل

Competition, Cooperation, and Authorization

2003

Antoni W. Mazurkiewicz

Multi-agent systems considered in the paper consist of a finite number of agents, positions of which can be changed by system actions, and of an evaluation function which assigns to each agent a value of its current position (as e.g. the distance from the intended target). The set of all possible values is ordered; the intention of each agent is to reach a position with the minimum value. Any s...

متن کامل

Science and the Common Good: Thoughts on Philip Kitcher’s Science, Truth, and Democracy*

2002

Helen E. Longino

In Science, Truth, and Democracy, Philip Kitcher develops the notion of well-ordered science: scientific inquiry whose research agenda and applications (but not methods) are subject to public control guided by democratic deliberation. Kitcher’s primary departure from his earlier views involves rejecting the idea that there is any single standard of scientific significance. The context-dependenc...

متن کامل

Experimental computation with oscillatory integrals

2009

David H. Bailey Jonathan M. Borwein

Abstract A previous study by one of the present authors, together with D. Borwein and I. Leonard [8], studied the asymptotic behavior of the p-norm of the sinc function: sinc(x) = (sin x)/x and along the way looked at closed forms for integer values of p. In this study we address these integrals with the tools of experimental mathematics, namely by computing their numerical values to high preci...

متن کامل

Thinking Is Believing

2012

Eric Mandelbaum

The idea that people can entertain propositions without believing them is widespread, intuitive, and most probably false. The main goal of this essay is to argue against the claim that people can entertain a proposition without believing it. Evidence is presented demonstrating that we cannot withhold assent from any proposition we happen to consider. A model of belief fixation is then sketched ...

متن کامل

اثر اسپین مدار بر خواص اپتیکی ساختارپیروسکایت معکوس ca3pbo در فاز مکعبی

پایان نامه :دانشگاه تربیت معلم - تهران - دانشکده علوم 1393

محسن محمدی, فرامرز کنجوری,

in this thesis, structural, electronical, and optical properties of inverse pervskite(ca3pbo) in cubic phase have been investigated.the calculation have been done based on density functional theory and according to generalized gradiant approximate (gga) as correlating potential. in order to calculate the configurations, implementing in the wien2k code have been used from 2013 version. first of ...

FUNCTION OF UTTERANCE IN EXPLAINING ACTION-SEQUENCES

Journal: :The Japanese Journal of Educational Psychology 1993

متن کامل

ACTION RESEARCH IN FUNCTION OF PEER VIOLENCE

Journal: :Metodički obzori/Methodological Horizons 2013

متن کامل

Smoothed Action Value Functions for Learning Gaussian Policies

2018

Ofir Nachum Mohammad Norouzi George Tucker Dale Schuurmans

State-action value functions (i.e., Q-values) are ubiquitous in reinforcement learning (RL), giving rise to popular algorithms such as SARSA and Qlearning. We propose a new notion of action value defined by a Gaussian smoothed version of the expected Q-value. We show that such smoothed Q-values still satisfy a Bellman equation, making them learnable from experience sampled from an environment. ...

متن کامل

Modification of Reasoned Action Theory and comparison with the original version by path analysis for substance abuse prevention among adolescents

ژورنال: Hormozgan Medical Journal 2010

Ghofranipour, F, hajizadeh, E, Hidarnia, A.R, Montazeri, A, Taremian, F, Tavousi, M,

Introduction: Objective of present study was assessing the competence of self efficacy to development of theory of Reasoned Action (TRA) and comparison with original version by path analysis for substance abuse prevention among adolescents. Methods: In this analytic study, 433 randomly selected adolescents (range of age 15–19) from Tehran participated in study. The study design was based ...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید