Distributed Policy Evaluation Under Multiple Behavior Strategies

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data-Efficient Policy Evaluation Through Behavior Policy Search

We consider the task of evaluating a policy for a Markov decision process (MDP). The standard unbiased technique for evaluating a policy is to deploy the policy and observe its performance. We show that the data collected from deploying a different policy, commonly called the behavior policy, can be used to produce unbiased estimates with lower mean squared error than this standard technique. W...

متن کامل

Evaluation of Join Strategies for Distributed Mediation

Three join algorithms are evaluated in an environment with distributed main-memory based mediators and data sources. A streamed ship-out join ships bulks of tuples to a mediator near a data source, followed by post-processing in the client. An extended streamed semi-join in addition builds a main-memory hash index in the client mediator. A ship-in algorithm materializes and joins the data in th...

متن کامل

Hedging Strategies: Electricity Investment Decisions under Policy Uncertainty

Given uncertainty in long-term carbon reduction goals, how much non-carbon generation should be developed in the near-term? This research investigates the optimal balance between the risk of overinvesting in non-carbon sources that are ultimately not needed and the risk of underinvesting in non-carbon sources and subsequently needing to reduce carbon emissions dramatically. We employ a novel fr...

متن کامل

Risk Hedging Strategies under Energy System and Climate Policy Uncertainties

The future development of the energy sector is rife with uncertainties. They concern virtually the entire energy chain, from resource extraction to conversion technologies, energy demand, and the stringency of future environmental policies. Investment decisions today need thus not only to be cost-effective from the present perspective, but have to take into account also the imputed future risks...

متن کامل

Provider Behavior Under Global Budgeting and Policy Responses

Third-party payer systems are consistently associated with health care cost escalation. Taiwan's single-payer, universal coverage National Health Insurance (NHI) adopted global budgeting (GB) to achieve cost control. This study captures ophthalmologists' response to GB, specifically service volume changes and service substitution between low-revenue and high-revenue services following GB implem...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Automatic Control

سال: 2015

ISSN: 0018-9286,1558-2523

DOI: 10.1109/tac.2014.2368731