Bandit Problems with Lévy Processes

نویسندگان

Asaf Cohen

Eilon Solan

چکیده

Authors are encouraged to submit new papers to INFORMS journals by means of a style file template, which includes the journal title. However, use of a template does not certify that the paper has been accepted for publication in the named journal. INFORMS journal templates are for the exclusive purpose of submitting to an INFORMS journal and should not be used to distribute the papers in print or online or to submit the papers to another publication.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ar X iv : 0 90 6 . 08 35 v 1 [ m at h . PR ] 4 J un 2 00 9 BANDIT PROBLEMS WITH LÉVY PAYOFF PROCESSES

Abstract. We study two-armed Lévy bandits in continuous-time, which have one safe arm that yields a constant payoff s, and one risky arm that can be either of type High or Low; both types yield stochastic payoffs generated by a Lévy process. The expectation of the Lévy process when the arm is High is greater than s, and lower than s if the arm is Low. The decision maker (DM) has to choose, at a...

متن کامل

Bandit Problems with Lévy Payoff Processes

We study one-arm Lévy bandits in continuous time, which have one safe arm that yields a constant payoff s, and one risky arm that can be either of type High or Low; both types yield stochastic payoffs generated by a Lévy process. The expectation of the Lévy process when the arm is High is greater than s, and lower than s if the arm is Low. The decision maker (DM) has to choose, at any given tim...

متن کامل

Additive Lévy Processes: Capacity and Hausdorff Dimension

This is a survey on recently-developed potential theory of additive Lévy processes and its applications to fractal geometry of Lévy processes. Additive Lévy processes arise naturally in the studies of the Brownian sheet, intersections of Lévy processes and so on. We first summarize some recent results on the novel connections between an additive Lévy process X in R , and a natural class of ener...

متن کامل

Optimal Policies for a Class of Restless Multiarmed Bandit Scheduling Problems with Applications to Sensor Management

Consider the Markov decision problems (MDPs) arising in the areas of intelligence, surveillance, and reconnaissance in which one selects among different targets for observation so as to track their position and classify them from noisy data [9], [10]; medicine in which one selects among different regimens to treat a patient [1]; and computer network security in which one selects different compu...

متن کامل

Local Bandit Approximation for Optimal Learning Problems

In general, procedures for determining Bayes-optimal adaptive controls for Markov decision processes (MDP's) require a prohibitive amount of computation-the optimal learning problem is intractable. This paper proposes an approximate approach in which bandit processes are used to model, in a certain "local" sense, a given MDP. Bandit processes constitute an important subclass of MDP's, and have ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

Math. Oper. Res.

دوره 38 شماره

صفحات -

تاریخ انتشار 2013

Bandit Problems with Lévy Processes

نویسندگان

چکیده

منابع مشابه

ar X iv : 0 90 6 . 08 35 v 1 [ m at h . PR ] 4 J un 2 00 9 BANDIT PROBLEMS WITH LÉVY PAYOFF PROCESSES

Bandit Problems with Lévy Payoff Processes

Additive Lévy Processes: Capacity and Hausdorff Dimension

Optimal Policies for a Class of Restless Multiarmed Bandit Scheduling Problems with Applications to Sensor Management

Local Bandit Approximation for Optimal Learning Problems

عنوان ژورنال:

اشتراک گذاری