Comparative Study of ID3/C4.5 Decision tree and Multilayer Perceptron Algorithms for the Prediction of Typhoid Fever

نویسنده

  • O..O. Adeyemo
چکیده

Data mining is an essential phase in knowledge discovery in database which is actually used to extract hidden patterns from large databases. Data mining concepts and methods can be applied in various fields like marketing, medicine, real estate, customer relationship management, engineering, web mining, etc. The main objective of this paper is to compare the performance accuracy of Multilayer perceptron (MLP) Artificial Neural Network and ID3 (Iterative Dichotomiser 3), C4.5 (also known as J48) Decision Trees algorithms Weka data mining software in predicting Typhoid fever. The data used is the patient’s dataset collected from a well known Nigerian Hospital. ID3, C4.5 Decision tree and MLP Artificial Neural Network WEKA Data mining software was used for the implementation. The data collected were transformed in a form that is acceptable to the data mining software and it was splitted into two sets: The training dataset and the testing dataset so that it can be imported into the system. The training set was used to enable the system to observe relationships between input data and the resulting outcomes in order to perform the prediction. The testing dataset contains data used to test the performance of the model. This model can be used by medical experts both in the private and public hospitals to make more timely and consistent diagnosis of typhoid fever cases which will reduce death rate in our country. The MLP ANN model exhibits good performance in the prediction of typhoid fever disease in general because of the low values generated in the Mean Absolute Error (MAE), Root Mean Squared Error (RMSE) and Relative Absolute Error (RAE) error performance measures. KeywordsID3, C4.5 , MLP, Decision Tree Artificial Neural Network, Typhoid fever African Journal of Computing & ICT Reference Format: O..O. Adeyemo, T. .O Adeyeye & D. Ogunbiyi (2015). Ccomparative Study of ID3/C4.5 Decision tree and Multilayer Perceptron Algorithms for the Prediction of Typhoid Fever. Afr J. of Comp & ICTs. Vol 8, No. 1. Pp 103-112.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of Three Classification Techniques: CART, C4.5 and Multi-Layer Perceptrons

In this paper, after some introductory remarks into the classification problem as considered in various research communities, and some discussions concerning some of the reasons for ascertaining the performances of the three chosen algorithms, viz., CART (Classification and Regression Tree), C4.5 (one of the more recent versions of a popular induction tree technique known as ID3), and a multi-l...

متن کامل

A comparative study of decision tree ID3 and C4.5

Data mining is the useful tool to discovering the knowledge from large data. Different methods & algorithms are available in data mining. Classification is most common method used for finding the mine rule from the large database. Decision tree method generally used for the Classification, because it is the simple hierarchical structure for the user understanding & decision making. Various data...

متن کامل

Application of ensemble learning techniques to model the atmospheric concentration of SO2

In view of pollution prediction modeling, the study adopts homogenous (random forest, bagging, and additive regression) and heterogeneous (voting) ensemble classifiers to predict the atmospheric concentration of Sulphur dioxide. For model validation, results were compared against widely known single base classifiers such as support vector machine, multilayer perceptron, linear regression and re...

متن کامل

A hybrid model based on machine learning and genetic algorithm for detecting fraud in financial statements

Financial statement fraud has increasingly become a serious problem for business, government, and investors. In fact, this threatens the reliability of capital markets, corporate heads, and even the audit profession. Auditors in particular face their apparent inability to detect large-scale fraud, and there are various ways to identify this problem. In order to identify this problem, the majori...

متن کامل

Data Mining: A Prediction for Performance Improvement of Engineering Students using Classification

Now-a-days the amount of data stored in educational database increasing rapidly. These databases contain hidden information for improvement of students’ performance. Educational data mining is used to study the data available in the educational field and bring out the hidden knowledge from it. Classification methods like decision trees, Bayesian network etc can be applied on the educational dat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015