Learning Minimum Volume Sets
نویسندگان
چکیده
Given a probability measure P and a reference measure μ, one is often interested in the minimum μ-measure set with P -measure at least α. Minimum volume sets of this type summarize the regions of greatest probability mass of P , and are useful for detecting anomalies and constructing confidence regions. This paper addresses the problem of estimating minimum volume sets based on independent samples distributed according to P . Other than these samples, no other information is available regarding P , but the reference measure μ is assumed to be known. We introduce rules for estimating minimum volume sets that parallel the empirical risk minimization and structural risk minimization principles in classification. As in classification, we show that the performances of our estimators are controlled by the rate of uniform convergence of empirical to true probabilities over the class from which the estimator is drawn. Thus we obtain finite sample size performance bounds in terms of VC dimension and related quantities. We also demonstrate strong universal consistency, an oracle inequality, and rates of convergence. The proposed estimators are illustrated with histogram and decision tree set estimation rules.
منابع مشابه
Automated Intruder Detection from Image Sequences using Minimum Volume Sets
We propose a new algorithm based on machine learning techniques for automatic intruder detection in visual surveillance networks. The proposed algorithm is theoretically founded on the concept of Minimum Volume Sets. Through application to image sequences from two different scenarios and comparison with existing algorithms, we show that it is possible for our proposed algorithm to easily obtain...
متن کاملEfficient anomaly detection using bipartite k-NN graphs
Learning minimum volume sets of an underlying nominal distribution is a very effective approach to anomaly detection. Several approaches to learning minimum volume sets have been proposed in the literature, including the K-point nearest neighbor graph (K-kNNG) algorithm based on the geometric entropy minimization (GEM) principle [4]. The K-kNNG detector, while possessing several desirable chara...
متن کاملLevel sets and minimum volume sets of probability density functions
Summarizing the whole support of a random variable into minimum volume sets of its probability density function is studied in the paper. We prove that the level sets of a probability density function correspond to minimum volume sets and also determine the conditions for which the inverse proposition is verified. The distribution function of the level cuts of a density function is also introduc...
متن کاملIntruder Detection in Camera Networks using the One-Class Neighbor Machine
We propose a new algorithm based on machine learning techniques for automatic intruder detection in surveillance networks. The algorithm is theoretically founded on the concept of minimum volume sets. Through application to real images from an example, simple closed-circuit television system and comparison with some existing algorithms, we show that it is possible to easily obtain high detectio...
متن کاملThermal conductivity of Water-based nanofluids: Prediction and comparison of models using machine learning
Statistical methods, and especially machine learning, have been increasingly used in nanofluid modeling. This paper presents some of the interesting and applicable methods for thermal conductivity prediction and compares them with each other according to results and errors that are defined. The thermal conductivity of nanofluids increases with the volume fraction and temperature. Machine learni...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of Machine Learning Research
دوره 7 شماره
صفحات -
تاریخ انتشار 2005