Initial Seed Value Efficiency on Data Mining Tools Performances: A Credit Approval Classification Example

نویسندگان

چکیده

After 2000s, Computer capacities and features are increased access to data made easy. However, the produced recorded should be meaningful. Transformation of unprocessed into meaningful information can done with help mining. In this study, classification methods from mining applications studied. First, parameters that make results same set different were investigated on 4 tools (Weka, Rapid Miner, Knime, Orange), It has been tested 3 algorithms (K nearest neighborhood, Naive Bayes, Random Forest). order evaluate performance while creating models, was divided training test as 80% -20%, 70% -30% 60-40%. The accuracy, roc precision values used classifying data. While classifying, effect algorithm is observed. most important these initial seed value. a value using especially in determines placement directly affects result. respect, it very determine correctly. between 0 100 evaluated shown could change accuracy approximately by 5%.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting the Credit Risk of Loans Using Data Mining Tools

 One of the most common causes or credit phenomenon that is taken into account for credit risk is the customer’s noncompliance with the commitments. Thus, by predicting the behavior of loan applicants, the growth rate of debts can be decreased. Hence, this study is conducted on corporate applicants for loans in one of the public banks in Iran. In this paper, the main elements comprising the cus...

متن کامل

Credit Scoring Based on Hybrid Data Mining Classification

The credit scoring has been regarded as a critical topic. This study proposed four approaches combining with the NN (Neural Network) classifier for features selection that retains sufficient information for classification purpose. Two UCI data sets and different approaches combined with NN classifier were constructed by selecting features. NN classifier combines with conventional statistical LD...

متن کامل

Automatic Credit Approval using Classification Method

This research paper aims to evaluate the performance and accuracy of classification models based on decision trees(C5.0 & CART), Support Vector Machine(SVM) and Logistic Regression with a dataset. Three methods to detect fraud are presented. Automatic credit approval is the most significant process in the banking sector and financial institutions. It prevents the fraud which is going to happen....

متن کامل

A Proposed Classification of Data Mining Techniques in Credit Scoring

Credit scoring has become very important issue due to the recent growth of the credit industry, so the credit department of the bank faces a large amount of credit data. Clearly it is impossible analyzing this huge amount of data both in economic and manpower terms, so data mining techniques were employed for this purpose. So far many data mining methods are proposed to handle credit scoring pr...

متن کامل

On Mining Fuzzy Classification Rules for Imbalanced Data

Fuzzy rule-based classification system (FRBCS) is a popular machine learning technique for classification purposes. One of the major issues when applying it on imbalanced data sets is its biased to the majority class, such that, it performs poorly in respect to the minority class. However many cases the minority classes are more important than the majority ones. In this paper, we have extended ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Düzce Üniversitesi bilim ve teknoloji dergisi

سال: 2021

ISSN: ['2148-2446']

DOI: https://doi.org/10.29130/dubited.813101