Sequential Change-Point Detection for High-Dimensional and Non-Euclidean Data

نویسندگان

چکیده

In many applications, it is often of practical and scientific interest to detect anomaly events in a streaming sequence high-dimensional or non-Euclidean observations. We study non-parametric framework that utilizes nearest neighbor information among the observations changes an online setting. It can be applied data arbitrary dimension as long similarity measure on sample space defined. consider new test statistics under this more effectively than existing while keeping false discovery rate controlled at fixed level. Analytic formulas approximating average run lengths approaches are derived make them fast applicable modern datasets. Simulation studies provided support theoretical results. The proposed approach illustrated with analysis NYC taxi dataset.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

on the bayesian sequential change-point detection

the problems of sequential change-point have several important applications in quality control, signal processing, and failure detection in industry and finance. we discuss a bayesian approach in the context of statistical process control: at an unknown time $tau$, the process behavior changes and the distribution of the data changes from p0 to p1. two cases are considered: (i) p0 and p1 are fu...

متن کامل

Change-point models and performance measures for sequential change detection

For the problem of sequential change detection we propose a novel modelling of the change-point mechanism. In particular we regard the time of change as a stopping time controlled by Nature. Nature, in order to decide when to impose the change, accesses sequentially information which can be different from the information provided to the Statistician to detect the change. Using as performance me...

متن کامل

Dynamic Frailty and Change Point Models for Recurrent Events Data

Abstract. We present a Bayesian analysis for recurrent events data using a nonhomogeneous mixed Poisson point process with a dynamic subject-specific frailty function and a dynamic baseline intensity func- tion. The dynamic subject-specific frailty employs a dynamic piecewise constant function with a known pre-specified grid and the baseline in- tensity uses an unknown grid for the piecewise ...

متن کامل

Outlier Detection in High Dimensional, Spatial and Sequential Data Sets

Of all the data mining techniques, outlier detection seems closest to the definition of “discovering nuggets of information” in large databases. When an outlier is detected, and determined to be genuine, it can provide insights, which can radically change our understanding of the underlying process. The purpose of the research underlying this thesis was to investigate and devise methods to mine...

متن کامل

Statistics for Change Detection in High–dimensional Data Streams

The method of change (or anomaly) detection in high-dimensional discrete-time processes using a multivariate Hotelling chart is presented. We use normal random projections as a method of dimensionality reduction. We indicate diagnostic properties of the Hotelling control chart applied to data projected onto a random subspace of R. We examine the random projection method using artificial noisy i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Signal Processing

سال: 2022

ISSN: ['1053-587X', '1941-0476']

DOI: https://doi.org/10.1109/tsp.2022.3205763