Investigating Outliers Detection Methods for the Iranian Manufacturing Establishment Survey Data

نویسندگان

  • Zahra Rezaei Ghahroodi
  • Taban Baghfalaki
  • Mojtaba Ganjali
  • Zahra Rezaei
چکیده

The role and importance of the industrial sector in the economic development specify the necessity of having accurate and timely data for exact planning. As outliers data in establishment surveys are common due to the structure of the economy, the evaluation of survey data by identifying and investigating outliers prior to the release of data is necessary. In this paper the practical application of different robust multivariate outlier detection methods based on the Mahalanobis distance with BACON algorithm, minimum volume ellipsoid (MVE) estimator, minimum covariance determinant (MCD) estimator, Stahel-Donoho estimator is presented. Also some univariate outlier detection methods such as Hadi and Simonoff (1993) method, using some regression models, are presented. These methods are illustrated using a real data set on Iranian Manufacturing Establishment Survey (IMES). These data are collected each year by the Statistical Center of Iran using sampling weights. In this paper it is demonstrated that the use of different robust outlier detection methods (multivariate and univariate), in a number of manufacturing industries, leads to the same results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Outliers and influential observations can have an important effect on work with estimation and inference from establishment survey data. Practical development and implementation of methods to identify and account for outliers and influential observations in complex survey data

Outliers and influential observations can have an important effect on work with estimation and inference from establishment survey data. Practical development and implementation of methods to identify and account for outliers and influential observations in complex survey data require an agency to balance several factors, including: (i) the mathematical statistics properties of detection method...

متن کامل

Identification of outliers types in multivariate time series using genetic algorithm

Multivariate time series data, often, modeled using vector autoregressive moving average (VARMA) model. But presence of outliers can violates the stationary assumption and may lead to wrong modeling, biased estimation of parameters and inaccurate prediction. Thus, detection of these points and how to deal properly with them, especially in relation to modeling and parameter estimation of VARMA m...

متن کامل

Introduction Package CircOutlier For Detection of Outliers in Circular-Circular Regression

One of the most important problem in any statistical analysis is the existence of unexpected observations. Some observations are not a part of the study and are known as outliers. Studies have shown that the outliers affect to the performance of statistical standard methods in models and predictions. The point of this work is to provide a couple of statistical package in R software to identi...

متن کامل

A statistical test for outlier identification in data envelopment analysis

In the use of peer group data to assess individual, typical or best practice performance, the effective detection of outliers is critical for achieving useful results. In these ‘‘deterministic’’ frontier models, statistical theory is now mostly available. This paper deals with the statistical pared sample method and its capability of detecting outliers in data envelopment analysis. In the prese...

متن کامل

A robust wavelet based profile monitoring and change point detection using S-estimator and clustering

Some quality characteristics are well defined when treated as response variables and are related to some independent variables. This relationship is called a profile. Parametric models, such as linear models, may be used to model profiles. However, in practical applications due to the complexity of many processes it is not usually possible to model a process using parametric models.In these cas...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003