Statistical evaluation of rough set dependency analysis

نویسندگان

  • Ivo Düntsch
  • Günther Gediga
چکیده

Rough set data analysis (RSDA) has recently become a frequently studied symbolic method in data mining. Among other things, it is being used for the extraction of rules from databases; it is, however, not clear from within the methods of rough set analysis, whether the extracted rules are valid. In this paper, we suggest to enhance RSDA by two simple statistical procedures, both based on ran-domization techniques, to evaluate the validity of prediction based on the approximation quality of attributes of rough set dependency analysis. The first procedure tests the casualness of a prediction to ensure that the prediction is not based on only a few (casual) observations. The second procedure tests the conditional casualness of an attribute within a prediction rule. The procedures are applied to three data sets, originally published in the context of rough set analysis. We argue that several claims of these analyses need to be modified because of lacking validity, and that other possibly significant results were overlooked.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rough Set Dependency Analysis in Evaluation Studies – An Application in the Study of Repeated Heart Attacks

One method for modelling uncertain or inaccurate information is rough set analysis which was introduced and studied by Pawlak (1982) and his co–workers. Unlike other methods such as fuzzy set theory, Dempster – Shafer theory or statistical methods, rough set analysis requires no external parameters and uses only the information presented in the given data. In the present study we apply rough se...

متن کامل

3 Rough Set Theory – Fundamental Concepts , Principals , Data Extraction , and Applications

Rough Set Theory, proposed in 1982 by Zdzislaw Pawlak, is in a state of constant development. Its methodology is concerned with the classification and analysis of imprecise, uncertain or incomplete information and knowledge, and of is considered one of the first non-statistical approaches in data analysis (Pawlak, 1982). The fundamental concept behind Rough Set Theory is the approximation of lo...

متن کامل

استفاده از تحلیل پوششی داده‌های ناهموار برای ارزیابی تأمین‌کنندگان، مطالعه موردی: گروه صنعتی ایران ترانسفو

Im this paper, the performance of suppliers is evaluated based on their efficiencies. Evaluation environment is not always precise and we may face imprecise for evaluation indexes values. In this situation, traditional and certain models cannot be employed. For overcoming uncertainty problem, there are different models such as stochastic, statistical, Rough, Fuzzy, etc for solving uncertainty e...

متن کامل

Slack-Based Measurement with Rough Data

Rough data envelopment analysis (RDEA) evaluates the performance of the decision making units (DMUs) under rough uncertainty assumption. In this paper, new discussion regarding RDEA is extended. The RSBM model is proposed by integrating SBM model and rough set theory. The process of reaching solution is presented and this model is applied to efficiency evaluation of the DMUs with uncertain ...

متن کامل

A Non-radial rough DEA model

  For efficiency evaluation of some of the Decision Making Units that have uncertain information, Rough Data Envelopment Analysis technique is used, which is derived from rough set theorem and Data Envelopment Analysis (DEA). In some situations rough data alter nonradially. To this end, this paper proposes additive rough–DEA model and illustrates the proposed model by a numerical example.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Int. J. Hum.-Comput. Stud.

دوره 46  شماره 

صفحات  -

تاریخ انتشار 1997