Weighted Distance Weighted Discrimination and Its Asymptotic Properties.
نویسندگان
چکیده
While Distance Weighted Discrimination (DWD) is an appealing approach to classification in high dimensions, it was designed for balanced datasets. In the case of unequal costs, biased sampling, or unbalanced data, there are major improvements available, using appropriately weighted versions of DWD (wDWD). A major contribution of this paper is the development of optimal weighting schemes for various nonstandard classification problems. In addition, we discuss several alternative criteria and propose an adaptive weighting scheme (awDWD) and demonstrate its advantages over nonadaptive weighting schemes under some situations. The second major contribution is a theoretical study of weighted DWD. Both high-dimensional low sample-size asymptotics and Fisher consistency of DWD are studied. The performance of weighted DWD is evaluated using simulated examples and two real data examples. The theoretical results are also confirmed by simulations.
منابع مشابه
Asymptotic Properties of Distance-Weighted Discrimination
While Distance-Weighted Discrimination (DWD) is an appealing approach to classification in high dimensions, it was designed for balanced data sets. In the case of unequal costs, biased sampling or unbalanced data, there are major improvements available, using appropriately weighted versions of DWD. A major contribution of this paper is the development of optimal weighting schemes for various no...
متن کاملAsymptotic Behavior of Weighted Sums of Weakly Negative Dependent Random Variables
Let be a sequence of weakly negative dependent (denoted by, WND) random variables with common distribution function F and let be other sequence of positive random variables independent of and for some and for all . In this paper, we study the asymptotic behavior of the tail probabilities of the maximum, weighted sums, randomly weighted sums and randomly indexed weighted sums of heavy...
متن کاملInverse Maximum Dynamic Flow Problem under the Sum-Type Weighted Hamming Distance
Inverse maximum flow (IMDF), is among the most important problems in the field ofdynamic network flow, which has been considered the Euclidean norms measure in previousresearches. However, recent studies have mainly focused on the inverse problems under theHamming distance measure due to their practical and important applications. In this paper,we studies a general approach for handling the inv...
متن کاملDiscrimination of Quaternary iron placer deposits by integrating remote sensing band ratio, magnetometry and geology data by weighted overlay index method compared to SAM and FCC methods in 1:100000 sheet of Hamedan
Abstract Quaternary placer deposits are becoming increasingly important. Remote sensing is a very powerful tool in discriminating altered areas related to intrusion deposits, which has significantly reduced the cost and time of exploration. In this study, to identify iron-bearing alluvial zones within the 1:100000 sheet of Hamedan, satellite image processing techniques such as band ratio (BR),...
متن کاملDecision Making with Distance Measures, Weighted Averages and Induced Owa Operators
We develop a new decision making model by using distance measures, weighted averages and OWA operators. We introduce the induced ordered weighted averaging – weighted averaging distance (IOWAWAD) operator. We study some of its main properties and particular cases such as the weighted Hamming distance, the induced OWA distance (IOWAD), the arithmetic weighted distance and the arithmetic IOWAD op...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of the American Statistical Association
دوره 105 489 شماره
صفحات -
تاریخ انتشار 2010