Investigating the multimodality of multivariate data with principal curves

نویسندگان

  • Murat O. Ahmed
  • Guenther Walther
چکیده

We propose a simple method to assess the number of subpopulations in multivariate data by projecting the data on its principal curve and then applying Silverman’s bandwidth test to the resulting univariate sample. Our results indicate that this method works well even in high-dimensional settings with relatively small sample sizes, provided that the number of subpopulations is not large compared to the number of dimensions. © 2012 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating the accuracy of multivariate regression and ARIMA models in predicting water demand (Case Study: Mashhad city)

Awareness of water demand is of particular importance for its policy in urban management. Predicting water demand in the future will allow managers to take the necessary measures regarding sustainable water supply, given the constraints and crises ahead. The purpose of this study is to compare multivariate regression and ARIMA models to predict water demand in Mashhad. In this study, first, the...

متن کامل

Wavelet Functional ANOVA, Bayesian False Discovery Rate, and Longitudinal Measurements of Oxygen Pressure in Rats

In conventional statistical practice, an observation is usually a number or a vector. But in many situations, observed values are curves or vectors of curves. Prototypical examples are growth curves (e.g., measurements of height and weight in children at particular age times), brain potentials, and a variety of responses in biological, chemical, and geophysical measurements. A vibrant research ...

متن کامل

Bayesian Analysis of Multivariate Smoothing Splines

A general version of multivariate smoothing splines with correlated errors and correlated curves is proposed. A suitable symmetric smoothing parameter matrix is introduced, and practical priors are developed for the unknown covariance matrix of the errors and the smoothing parameter matrix. An efficient algorithm for computing the multivariate smoothing spline is derived, which leads to an effi...

متن کامل

Analysis of physiochemical and microbial quality of waters of the Karkheh River in southwestern Iran using multivariate statistical methods

Rapid population growth as well as agricultural and industrial development have increased the contamination of Iranian rivers. This study utilized principal components analysis (PCA) to determine the degree of significance of qualitative parameters of water resources in the Karkheh River in southwestern Iran. Cluster analysis (CA) grouped the monitoring stations based on the water quality data ...

متن کامل

Another Look at Principal Curves and Surfaces

Principal curves have been defined as smooth curves passing through the ``middle'' of a multidimensional data set. They are nonlinear generalizations of the first principal component, a characterization of which is the basis of the definition of principal curves. We establish a new characterization of the first principal component and base our new definition of a principal curve on this propert...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 56  شماره 

صفحات  -

تاریخ انتشار 2012