Induction of Selective Bayesian Networks from Data
نویسنده
چکیده
Bayesian networks (Pearl 1988), which provide a compact graphical way to express complex probabilistic relationships among several random variables, are rapidly becoming the tool of choice for dealing with uncertainty in knowledge based systems. Amongst the many advantages offered by Bayesian networks over other representations such as decision trees and neural networks are the ease of comprehensibility to humans, effectiveness as complex decision making models and elicitability of informative prior distributions. However, approaches based on Bayesian networks have often been dismissed as unfit for many real-world applications because they are difficult to construct and probabilistic inference is intractable for most problems of realistic size. Given the increasing availability of large amounts of data in most domains, learning of Bayesian networks from data can circumvent the first problem. This research deals primarily with the second problem. We address this issue by learning selective Bayesian networks a variant of the Bayesian network that uses only a subset of the given attributes to model a domain. Our aim is to learn networks that are smaller, and hence computationally simpler to evaluate, but display accuracy comparable to that of networks induced using all attributes. We have developed two methods for inducing selective Bayesian networks from data. The first method, K2-AS (Singh & Provan 1995), selects a subset of attributes that maximizes predictive accuracy prior to the network learning phase.The idea behind this approach is that attributes which have little or no influence on the accuracy of learned networks can be discarded without significantly affecting their performance. The second method we have developed, InfoAS (Singh & Provan 1996), uses information-theoretic metrics to efficiently select a subset of attributes from which to learn the classifier. The aim is to discard those attributes which can give us little or no information about the class variable, given the other attributes in the network. We have showed that relative to networks learned using all attributes, networks learned by both K2-AS and Info-AS are significantly smaller and computationally simpler to evaluate but display comparable predictive accuracy. More-
منابع مشابه
The modeling of body's immune system using Bayesian Networks
In this paper, the urinary infection, that is a common symptom of the decline of the immune system, is discussed based on the well-known algorithms in machine learning, such as Bayesian networks in both Markov and tree structures. A large scale sampling has been executed to evaluate the performance of Bayesian network algorithm. A number of 4052 samples wereobtained from the database of the Tak...
متن کاملA Comparison of Induction Algorithms for Selective andnon - Selective Bayesian Classi
In this paper we present a novel induction algorithm for Bayesian networks. This selective Bayesian network classiier selects a subset of attributes that maximizes predictive accuracy prior to the network learning phase, thereby learning Bayesian networks with a bias for small, high-predictive-accuracy networks. We compare the performance of this classiier with selective and non-selective naive...
متن کاملAn Introduction to Inference and Learning in Bayesian Networks
Bayesian networks (BNs) are modern tools for modeling phenomena in dynamic and static systems and are used in different subjects such as disease diagnosis, weather forecasting, decision making and clustering. A BN is a graphical-probabilistic model which represents causal relations among random variables and consists of a directed acyclic graph and a set of conditional probabilities. Structure...
متن کاملInduction of Selective Bayesian Network Classiiers
We present an algorithm for inducing Bayesian networks using feature selection. The algorithm selects a subset of attributes that maximizes predictive accuracy prior to the network learning phase, thereby incorporating a bias for small networks that retain high predictive accuracy. We compare the behavior of this selective Bayesian network classiier with that of (a) Bayesian network classiiers ...
متن کاملEstimation of Products Final Price Using Bayesian Analysis Generalized Poisson Model and Artificial Neural Networks
Estimating the final price of products is of great importance. For manufacturing companies proposing a final price is only possible after the design process over. These companies propose an approximate initial price of the required products to the customers for which some of time and money is required. Here using the existing data of already designed transformers and utilizing the bayesian anal...
متن کامل