Supervised and unsupervised data mining approaches in loan default prediction
نویسندگان
چکیده
Given the paramount importance of data mining in organizations and possible contribution a data-driven customer classification recommender systems for loan-extending financial institutions, study applied supervised approaches to derive best classifier loan default. A total 900 instances with determined attributes class labels were used training cross-validation processes while prediction 100 new without labels. In phase, J48 confidence factor 50% attained highest accuracy (76.85%), <em>k</em>-nearest neighbors (<em>k</em>-NN) 3 (78.38%) IBk variants, naïve Bayes has 76.65%, logistic 77.31% accuracy. <em>k</em>-NN have accuracy, F-measures, kappa statistics. Implementation these algorithms test set yielded 48 non-defaulters 52 defaulters <em>k</em> -NN 44 56 under logistic. Implications discussed paper.
منابع مشابه
Loan Default Prediction on Large Imbalanced Data Using Random Forests
In this paper, we propose an improved random forest algorithm which allocates weights to decision trees in the forest during tree aggregation for prediction and their weights are easily calculated based on out-of-bag errors in training. Experiments results show that our proposed algorithm beats the original random forest and other popular classification algorithms such as SVM, KNN and C4.5 in t...
متن کاملCombined Supervised and Unsupervised Learning in Genomic Data Mining
...................................................................................................................................................................IX
متن کاملCardiovascular Disease Analysis Using Supervised and Unsupervised Data Mining Techniques
Cardiovascular diseases are the main cause of death around the world. Every year, more people die from these diseases than from any other cause. According to World Health Organization data, in 2012 more than 17,5 million people died from this cause, and that represents 31% of all deaths registered worldwide. Data mining techniques are widely used for the analysis of diseases, including cardiova...
متن کاملSupervised and Unsupervised Data Mining with an Evolutionary Algorithm
This paper describes our current research with RAGA (Rule Acquisition with a Genetic Algorithm). RAGA is a genetic algorithm and genetic programming hybrid that is designed for the tasks of supervised and certain types of unsupervised data mining. Since its initial release we have improved its predictive accuracy and data coverage, as well as its ability to generate more scalable rule hierarchi...
متن کاملthe clustering and classification data mining techniques in insurance fraud detection:the case of iranian car insurance
با توجه به گسترش روز افزون تقلب در حوزه بیمه به خصوص در بخش بیمه اتومبیل و تبعات منفی آن برای شرکت های بیمه، به کارگیری روش های مناسب و کارآمد به منظور شناسایی و کشف تقلب در این حوزه امری ضروری است. درک الگوی موجود در داده های مربوط به مطالبات گزارش شده گذشته می تواند در کشف واقعی یا غیرواقعی بودن ادعای خسارت، مفید باشد. یکی از متداول ترین و پرکاربردترین راه های کشف الگوی داده ها استفاده از ر...
ذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Electrical and Computer Engineering
سال: 2023
ISSN: ['2088-8708']
DOI: https://doi.org/10.11591/ijece.v13i2.pp1837-1847