Hannan Consistency in On-Line Learning in Case of Unbounded Losses Under Partial Monitoring
نویسندگان
چکیده
In this paper the sequential prediction problem with expert advice is considered when the loss is unbounded under partial monitoring scenarios. We deal with a wide class of the partial monitoring problems: the combination of the label efficient and multi-armed bandit problem, that is, where the algorithm is only informed about the performance of the chosen expert with probability ε ≤ 1. For bounded losses an algorithm is given whose expected regret scales with the square root of the loss of the best expert. For unbounded losses we prove that Hannan consistency can be achieved, depending on the growth rate of the average squared losses of the experts.
منابع مشابه
Solving high-order partial differential equations in unbounded domains by means of double exponential second kind Chebyshev approximation
In this paper, a collocation method for solving high-order linear partial differential equations (PDEs) with variable coefficients under more general form of conditions is presented. This method is based on the approximation of the truncated double exponential second kind Chebyshev (ESC) series. The definition of the partial derivative is presented and derived as new operational matrices of der...
متن کاملComparison of two methods of education (lecture and self learning) on knowledge and practice of mothers with under 3 year old children about growth monitoring and nutritional development stages
Introduction: Assessment of national children growth has shown children‘s growth failure in a large percentage of them in Iran. Growth failure is easily diagnosed by growth monitoring card .On the other hand, mothers’ Knowledge of Nutritional development stages can help them to modify their practice in this field .In this case, conducting educational and interventional programs play a key role ...
متن کاملSeismic Risk Assessment of Optimally Designed Highway Bridge Isolated by Ordinary Unbounded Elastomeric Bearings
Recent experimental research has shown that ordinary unbounded steel reinforced elastomeric bearings (SREBs) can be considered as an attractive cost-effective option for the seismic isolation of highway bridges. To further investigate its benefits, the current study is focused on the seismic risk assessment of an optimally designed highway bridge isolated by SREB system. A typical three-span hi...
متن کاملConsistency of structured output learning with missing labels
In this paper we study statistical consistency of partial losses suitable for learning structured output predictors from examples containing missing labels. We provide sufficient conditions on data generating distribution which admit to prove that the expected risk of the structured predictor learned by minimizing the partial loss converges to the optimal Bayes risk defined by an associated com...
متن کاملImproving Inventory Control in Production Process using Value Stream Mapping (VSM) and Production Line Simulation using Software Arena in urban economic centers (Case Study: Iran Bushing and Bearing Company)
Value stream mapping because of being able to understand process bottlenecks, as one of the most common tools for analyzing, identifying and eliminating various losses in operational and support processes are used. On the other hand, inventory management, precise control entry and exit of goods, accurate and timely information about the inventories status and planning, reduce product maintenanc...
متن کامل