Investigating the multimodality of multivariate data with principal curves
نویسندگان
چکیده
We propose a simple method to assess the number of subpopulations in multivariate data by projecting the data on its principal curve and then applying Silverman’s bandwidth test to the resulting univariate sample. Our results indicate that this method works well even in high-dimensional settings with relatively small sample sizes, provided that the number of subpopulations is not large compared to the number of dimensions. © 2012 Elsevier B.V. All rights reserved.
منابع مشابه
Investigating the accuracy of multivariate regression and ARIMA models in predicting water demand (Case Study: Mashhad city)
Awareness of water demand is of particular importance for its policy in urban management. Predicting water demand in the future will allow managers to take the necessary measures regarding sustainable water supply, given the constraints and crises ahead. The purpose of this study is to compare multivariate regression and ARIMA models to predict water demand in Mashhad. In this study, first, the...
متن کاملWavelet Functional ANOVA, Bayesian False Discovery Rate, and Longitudinal Measurements of Oxygen Pressure in Rats
In conventional statistical practice, an observation is usually a number or a vector. But in many situations, observed values are curves or vectors of curves. Prototypical examples are growth curves (e.g., measurements of height and weight in children at particular age times), brain potentials, and a variety of responses in biological, chemical, and geophysical measurements. A vibrant research ...
متن کاملBayesian Analysis of Multivariate Smoothing Splines
A general version of multivariate smoothing splines with correlated errors and correlated curves is proposed. A suitable symmetric smoothing parameter matrix is introduced, and practical priors are developed for the unknown covariance matrix of the errors and the smoothing parameter matrix. An efficient algorithm for computing the multivariate smoothing spline is derived, which leads to an effi...
متن کاملAnalysis of physiochemical and microbial quality of waters of the Karkheh River in southwestern Iran using multivariate statistical methods
Rapid population growth as well as agricultural and industrial development have increased the contamination of Iranian rivers. This study utilized principal components analysis (PCA) to determine the degree of significance of qualitative parameters of water resources in the Karkheh River in southwestern Iran. Cluster analysis (CA) grouped the monitoring stations based on the water quality data ...
متن کاملAnother Look at Principal Curves and Surfaces
Principal curves have been defined as smooth curves passing through the ``middle'' of a multidimensional data set. They are nonlinear generalizations of the first principal component, a characterization of which is the basis of the definition of principal curves. We establish a new characterization of the first principal component and base our new definition of a principal curve on this propert...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computational Statistics & Data Analysis
دوره 56 شماره
صفحات -
تاریخ انتشار 2012