Learning the Switching Rate by Discretising Bernoulli Sources Online

نویسندگان

  • Steven de Rooij
  • Tim van Erven
چکیده

The expert tracking algorithm Fixed-Share depends on a parameter α, called the switching rate. The switching rate can be learned online with regret 1 2 log T + O(1) bits. The current fastest method to achieve this is based on optimal discretisation of the Bernoulli distributions into O( √ T ) bins and runs in O(T √ T ) time. However, the exact locations of these bins have to be determined algorithmically, and the final number of outcomes T must be known in advance. This paper introduces a new discretisation scheme with the same regret bound for known T , that specifies the number and positions of the discretisation points explicitly. The scheme is especially useful, however, when T is not known in advance: a new fully online algorithm is presented, which runs in O(T √ T log T ) time and achieves a regret of 1 2 log 3 log T +O(log log T ) bits.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing collaborative learning model in online learning environments

Introduction: Most online learning environments are challenging for the design of collaborative learning activities to achieve high-level learning skills. Therefore, the purpose of this study was to design and validate a model for collaborative learning in online learning environments. Methods: The research method used in this study was a mixed method, including qualitative content analysis and...

متن کامل

Learning Styles and the Writing Process in a Digitally Blended Environment: Revising, Switching, and Pausing Behaviors in Focus

The present investigation sought to explore the relationship between learning styles and writing behaviors of EFL learners in a blended environment. It also aimed to identify the learning style types best predicting writing behaviors. Initially, the participants' preferred learning styles were identified through the Kolb’s learning style inventory (Kolb, 1984). Secondly, data were obtained thro...

متن کامل

Designinga Neuro-Sliding Mode Controller for Networked Control Systems with Packet Dropout

This paper addresses control design in networked control system by considering stochastic packet dropouts in the forward path of the control loop. The packet dropouts are modelled by mutually independent stochastic variables satisfying Bernoulli binary distribution. A sliding mode controller is utilized to overcome the adverse influences of stochastic packet dropouts in networked control system...

متن کامل

Thompson Sampling in Switching Environments with Bayesian Online Change Point Detection

Thompson Sampling has recently been shown to achieve the lower bound on regret in the Bernoulli Multi-Armed Bandit setting. This bandit problem assumes stationary distributions for the rewards. It is often unrealistic to model the real world as a stationary distribution. In this paper we derive and evaluate algorithms using Thompson Sampling for a Switching Multi-Armed Bandit Problem. We propos...

متن کامل

Systematic review of learning changes as technology grows

Introduction: With the advent of information and communication technology, in recent decades, a new gate opened to human beings and all its biological dimensions, and created many changes in the field of education and learning. Accordingly, the purpose of this study is to investigate how changes have been made in how learners learn from the growth and advancement of technologies. Methods:...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009