On the Well-Behavedness of Important Attribute Evaluation Functions

نویسندگان

  • Tapio Elomaa
  • Juho Rousu
چکیده

The class of well-behaved evaluation functions simplifies and makes efficient the handling of numerical attributes; for them it suffices to concentrate on the boundary points in searching for the optimal partition. This holds always for binary partitions and also for multisplits if only the function is cumulative in addition to being well-behaved. The class of well-behaved evaluation functions is a proper superclass of convex evaluation functions. Thus, a large proportion of the most important attribute evaluation functions are wellbehaved. This paper explores the extent and boundaries of well-behaved functions. In particular, we examine C4.5’s default attribute evaluation function gain ratio, which has been known to have problems with numerical attributes. We show that gain ratio is not convex, but is still well-behaved with respect to binary partitioning. However, it cannot handle higher arity partitioning well. Our empirical experiments show that a very simple cumulative rectification to the poor bias of information gain significantly outperforms gain ratio.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

General and Eecient Multisplitting of Numerical Attributes

Often in supervised learning numerical attributes require special treatment and do not t the learning scheme as well as one could hope. Nevertheless, they are common in practical tasks and, therefore, need to be taken into account. We characterize the well-behavedness of an evaluation function, a property that guarantees the optimal multi-partition of an arbitrary numerical domain to be deened ...

متن کامل

A simple approach to multiple attribute decision making using loss functions

Multiple attribute decision making (MADM) methods are very much essential in all fields of engineering, management and other areas where limited alternatives exist and the decision maker has to select the best alternative. Different methods are available in the literature to tackle the MADM problems. The MADM problems are classified as scoring methods, compromising methods and concordance metho...

متن کامل

Introducing a New Approach for Prioritizing Combating Desertification Strategies Based on Multi- Attribute Decision Making

Addressing desertification, due to its multi-criteria nature, increasing development, extensive and long-term impacts on natural resources and human populations, is necessary to achieve sustainable development. Therefore, for optimal utilization of facilities and limited funds allocated to this issue, evaluation of current strategies, based on different criteria is essential to avoid wasting na...

متن کامل

Well-Behaved Evaluation Functions for Numerical Attributes

The class of well-behaved evaluation functions simpliies and makes eecient the handling of numerical attributes; for them it suuces to concentrate on the boundary points in searching for the optimal partition. This holds always for binary partitions and also for multisplits if only the function is cumulative in addition to being well-behaved. A large portion of the most important attribute eval...

متن کامل

The Application of Multi Attribute Decision Methods (MADM) on prioritizing Iranian fisheries research projects

The ultimate goal of an agriculture research system is on-time, correct and clear response to the problems and expectations of agriculture household and stakeholders. In this respect, though, due to variation and frequency of the problems and expectations and as well as many limitations such as financial deficit, short time and shortage in work force and equipments etc, the system cannot be tho...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997