Optimal Decision Trees for Nonlinear Metrics

نویسندگان

چکیده

Nonlinear metrics, such as the F1-score, Matthews correlation coefficient, and Fowlkes–Mallows index, are often used to evaluate performance of machine learning models, in particular, when facing imbalanced datasets that contain more samples one class than other. Recent optimal decision tree algorithms have shown remarkable progress producing trees with respect linear criteria, accuracy, but unfortunately nonlinear metrics remain a challenge. To address this gap, we propose novel algorithm based on bi-objective optimisation, which treats misclassifications each binary separate objective. We show that, for large lies Pareto frontier. Consequently, obtain by using our method generate set all nondominated trees. best knowledge, is first compute provably metrics. Our approach leads trade-off compared optimising metrics: resulting may be desirable according given metric at expense higher runtimes. Nevertheless, experiments illustrate runtimes reasonable majority tested datasets.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimal Decision Trees

We propose an Extreme Point Tabu Search (EPTS) algorithm that constructs globally optimal decision trees for classiication problems. Typically, decision tree algorithms are greedy. They optimize the misclassiication error of each decision sequentially. Our non-greedy approach minimizes the misclassiication error of all the decisions in the tree concurrently. Using Global Tree Optimization (GTO)...

متن کامل

Optimal Cascaded Hierarchies of Linguistic Decision Trees for Decision Making

For multiple attribute decision making, the underlying relationship between attributes and classification, decision or utility variable is often highly uncertain and imprecise. This requires an integrated treatment of uncertainty and fuzziness when modeling the propagation of information from low-level attributes to high-level decision variables. One of the main drawbacks to fuzzy modeling of s...

متن کامل

Evolutionary Decision Trees and Software Metrics for Module Defects Identification

Software metric is a measure of some property of a piece of software or its specification. The aim of this paper is to present an application of evolutionary decision trees in software engineering in order to classify the software modules that have or have not one or more reported defects. For this some metrics are used for detecting the class of modules with defects or without defects. Keyword...

متن کامل

Evolutionary Decision Trees and Software Metrics for Module Defects Identification

metric is a measure of some property of a piece of software or its specification. The aim of this paper is to present an application of evolutionary decision trees in software engineering in order to classify the software modules that have or have not one or more reported defects. For this some metrics are used for detecting the class of modules with defects or without defects.

متن کامل

Oblique Multicategory Decision Trees Using Nonlinear Programming

I of decision trees is a popular and effective method for solving classification problems in data-mining applications. This paper presents a new algorithm for multi-category decision tree induction based on nonlinear programming. This algorithm, termed OC-SEP (Oblique Category SEParation), combines the advantages of several other methods and shows improved generalization performance on a collec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i5.16490