Reinforcement Learning of Sequential Price Mechanisms

نویسندگان

چکیده

We introduce the use of reinforcement learning for indirect mechanisms, working with existing class sequential price which generalizes both serial dictatorship and posted mechanisms essentially characterizes all strongly obviously strategyproof mechanisms. Learning an optimal mechanism within this forms a partially-observable Markov decision process. provide rigorous conditions when is more powerful than simpler static sufficiency or insufficiency observation statistics learning, necessity complex (deep) policies. show that our approach can learn near-optimal in several experimental settings.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Human Reinforcement Learning of Sequential Action

Learning sequential actions is an essential human ability, for most daily activities are sequential. We modify the serial reaction time (SRT) task, originally used to teach people a consistent sequence of button presses by cueing them with the next target response, to record mouse movements, collecting continuous response trajectories. Further, we introduce a reinforcement learning version of t...

متن کامل

Stock Price Prediction Using Reinforcement Learning

Recently, numerous investigations for stock price prediction and portfolio management using machine learning have been trying to develop efficient mechanical trading systems. But these systems have a limitation in that they are mainly based on the supervised leaming which is not so adequate for leaming problems with long-term goals and delayed rewards. This paper proposes a method of applying r...

متن کامل

Reinforcement Learning-Based Load Shared Sequential Routing

We consider event dependent routing algorithms for on-line explicit source routing in MPLS networks. The proposed methods are based on load shared sequential routing in which load sharing factors are updated using learning algorithms. The learning algorithms we employ are either based on learning automata or on online learning algorithms that were originally devised for solving the adversarial ...

متن کامل

Sequential Constant Size Compressors for Reinforcement Learning

Traditional Reinforcement Learning methods are insufficient for AGIs who must be able to learn to deal with Partially Observable Markov Decision Processes. We investigate a novel method for dealing with this problem: standard RL techniques using as input the hidden layer output of a Sequential Constant-Size Compressor (SCSC). The SCSC takes the form of a sequential Recurrent Auto-Associative Me...

متن کامل

Reinforcement learning: Computational theory and biological mechanisms

Reinforcement learning is a computational framework for an active agent to learn behaviors on the basis of a scalar reward signal. The agent can be an animal, a human, or an artificial system such as a robot or a computer program. The reward can be food, water, money, or whatever measure of the performance of the agent. The theory of reinforcement learning, which was developed in an artificial ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i6.16659