online learning algorithm

Online multiclass learning with ”bandit” feedback under a Passive-Aggressive approach

2015

Hongliang Zhong Emmanuel Daucé Liva Ralaivola

Abstract. This paper presents a new approach to online multi-class learning with bandit feedback. This algorithm, named PAB (Passive Aggressive in Bandit) is a variant of Online Passive-Aggressive Algorithm proposed by [2], the latter being an e↵ective framework for performing max-margin online learning. We analyze some of its operating principles, and show it to provide a good and scalable sol...

متن کامل

SpectralFPL: Online Spectral Learning for Single Topic Models

Journal: :CoRR 2017

Tong Yu Branislav Kveton Zheng Wen Hung Hai Bui Ole J. Mengshoel

We study the problem of learning a latent variable model from a stream of data. Latent variable models are popular in practice because they can explain observed data in terms of unobserved concepts. These models have been traditionally studied in the offline setting. The online EM is arguably the most popular algorithm for learning latent variable models online. Although it is computationally e...

متن کامل

Online Relative Entropy Policy Search using Reproducing Kernel Hilbert Space Embeddings

2016

Zhitang Chen Pascal Poupart Yanhui Geng

Kernel methods have been successfully applied to reinforcement learning problems to address some challenges such as high dimensional and continuous states, value function approximation and state transition probability modeling. In this paper, we develop an online policy search algorithm based on a recent state-of-the-art algorithm REPS-RKHS that uses conditional kernel embeddings. Our online al...

متن کامل

An Online Adaptive Bandwidth Allocation Optimization Algorithm for Wireless Multimedia Communication Networks

2007

JIANG Qi XI Hong-Sheng YIN Bao-Qun

The issue of QoS (quality of service) provisioning for adaptive multimedia in wireless communication networks is considered. A reinforcement learning based online adaptive bandwidth allocation optimization algorithm is proposed. First, an event-driven stochastic switching model is introduced to formulate the adaptive bandwidth allocation problem as a constrained continuous-time Markov decision ...

متن کامل

Adaptive Subgradient Methods for Online AUC Maximization

Journal: :CoRR 2016

Yi Ding Peilin Zhao Steven C. H. Hoi Yew-Soon Ong

Learning for maximizing AUC performance is an important research problem in Machine Learning and Artificial Intelligence. Unlike traditional batch learning methods for maximizing AUC which often suffer from poor scalability, recent years have witnessed some emerging studies that attempt to maximize AUC by single-pass online learning approaches. Despite their encouraging results reported, the ex...

متن کامل

Online-to-Confidence-Set Conversions and Application to Sparse Stochastic Bandits

2012

Yasin Abbasi-Yadkori Dávid Pál Csaba Szepesvári

We introduce a novel technique, which we call online-to-confidence-set conversion. The technique allows us to construct highprobability confidence sets for linear prediction with correlated inputs given the predictions of any algorithm (e.g., online LASSO, exponentiated gradient algorithm, online least-squares, p-norm algorithm) targeting online learning with linear predictors and the quadratic...

متن کامل

Notes on Single-Pass Online Learning Algorithms

2006

Vitor R. Carvalho William W. Cohen

Online learning methods are typically faster and have a much smaller memory footprint than batch learning methods. However, in practice online learners frequently require multiple passes over the same training data in order to achieve accuracy comparable to batch learners. We investigate the problem of single-pass online learning, i.e., training only on a single pass over the data. We compare t...

متن کامل

Online Learning with Pairwise Loss Functions

Journal: :CoRR 2013

Yuyang Wang Roni Khardon Dmitry Pechyony Rosie Jones

Efficient online learning with pairwise loss functions is a crucial component in building largescale learning system that maximizes the area under the Receiver Operator Characteristic (ROC) curve. In this paper we investigate the generalization performance of online learning algorithms with pairwise loss functions. We show that the existing proof techniques for generalization bounds of online a...

متن کامل

Adult learners' intention to adopt mobile learning: A motivational perspective

Journal: :BJET 2015

Kamarul Faizal Hashim Felix B. Tan Ammar Rashid

Mobile learning (m-learning) is gaining popularity as the „anytime, anywhere‟ online learning channel. Academics and practitioners alike are showing interest in examining its ability to support online learning. However, prior studies have highlighted the challenges in promoting m-learning adoption. The extantm-learning literature has mainly focused on technology related factors to examine m-lea...

متن کامل

Lossless Online Bayesian Bagging

Journal: :Journal of Machine Learning Research 2004

Herbert K. H. Lee Merlise A. Clyde

Bagging frequently improves the predictive performance of a model. An online version has recently been introduced, which attempts to gain the benefits of an online algorithm while approximating regular bagging. However, regular online bagging is an approximation to its batch counterpart and so is not lossless with respect to the bagging operation. By operating under the Bayesian paradigm, we in...

متن کامل