Entropy based C4.5-SHO algorithm with information gain optimization in data mining
نویسندگان
چکیده
منابع مشابه
On Entropy-Based Data Mining
In the real world, we are confronted not only with complex and high-dimensional data sets, but usually with noisy, incomplete and uncertain data, where the application of traditional methods of knowledge discovery and data mining always entail the danger of modeling artifacts. Originally, information entropy was introduced by Shannon (1949), as a measure of uncertainty in the data. But up to th...
متن کاملA New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining
Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...
متن کاملAn Information Entropy-Based Animal Migration Optimization Algorithm for Data Clustering
Data clustering is useful in a wide range of application areas. The Animal Migration Optimization (AMO) algorithm is one of the recently introduced swarm-based algorithms, which has demonstrated good performances for solving numeric optimization problems. In this paper, we presented a modified AMO algorithm with an entropy-based heuristic strategy for data clustering. The main contribution is t...
متن کاملOn Multiplicative Entropy and Information gain in Large Data Sets
Information theory is one of the widely used branches of applied probability theory. When probability is used to describe the state of a system implies that the state has some uncertainty. Some probability distributions indicate more uncertainty than others as they are not created equal. We can come up with some mathematical entity which returns a measure of uncertainty after taking a probabili...
متن کاملOptimization-based Data Mining Techniques with Applications
Uncontrolled epilepsy poses a significant burden to society due to associated healthcare cost to treat and control the unpredictable and spontaneous occurrence of seizures. The main objective of this paper is to develop and apply novel optimization-based data mining approaches to the study of brain physiology, which might be able to revolutionize current diagnosis and treatment of epilepsy. Thr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: PeerJ Computer Science
سال: 2021
ISSN: 2376-5992
DOI: 10.7717/peerj-cs.424