Rule extraction from recurrent neural networks
نویسنده
چکیده
This thesis investigates rule extraction from recurrent neural networks, which takes the form of automated construction of models of an underlying network. Typically the models are expressed as finite state machines and they should mimic the network while being more intelligible. It is argued that rule extraction allows a deeper and more general form of analysis than other, more or less ad hoc, methods which are typically applied after the training of the recurrent networks. The first part of this thesis reviews and analyses the development of related techniques. The second part presents a novel algorithm, the Crystallizing Substochastic Sequential Machine Extractor (CrySSMEx), which efficiently generates a sequence of increasingly refined stochastic finite state models of an underlying system. Novel features of CrySSMEx include, for example, freedom from parameters, deterministic extraction, a hierarchical vector quantizer, and a stochastic finite state model which can be constructed also when some data is missing. Experiments show that CrySSMEx is, compared to other methods, applicable to a wider range of problems (such as high-dimensional or chaotic dynamic systems). Finally, the field is discussed from a more theoretical perspective in terms of scientific methodology targeted at simulated systems. It is suggested that a rule extractor (or Empirical Machine) can actively select data from the system it is set to model by continuously targeting the weakest point of its currently strongest model. These automated experimenters can, in turn, be made part of a framework (or Popperian Machine) in which theories about populations of systems are generated and tested in order to establish falsifiable statements. These statements should have a high empirical content and thus concisely describe emergent, and previously unknown, properties of the systems.
منابع مشابه
Rethinking Rule Extraction from Recurrent Neural Networks
We will in this paper identify some of the central problems of current techniques for rule extraction from recurrent neural networks (RNN-RE). Then we will raise the expectations of future RNN-RE techniques considerably and through this, hopefully guide the research towards a common goal. Some preliminary results based on work in line with these goals, will also be presented.
متن کاملAn Empirical Evaluation of Rule Extraction from Recurrent Neural Networks
Rule extraction from black-box models is critical in domains that require model validation before implementation, as can be the case in credit scoring and medical diagnosis. Though already a challenging problem in statistical learning in general, the difficulty is even greater when highly non-linear, recursive models, like recurrent neural networks (RNNs), are fit to data. Here, we study the ex...
متن کاملKnowledge Extraction from the Neural ‘Black Box’ in Ecological Monitoring
Phytoplankton biomass within the Saginaw Bay ecosystem (Lake Huron, Michigan, USA) was characterized as a function of select physical/chemical indicators. The complexity and variability of ecological systems typically make it difficult to model the influences of anthropogenic stressors and/or natural disturbances. Here, Artificial Neural Networks (ANNs) were developed to model chlorophyll a con...
متن کاملRule Extraction from Recurrent Neural Networks: A Taxonomy and Review
Rule extraction (RE) from recurrent neural networks (RNNs) refers to finding models of the underlying RNN, typically in the form of finite state machines, that mimic the network to a satisfactory degree while having the advantage of being more transparent. RE from RNNs can be argued to allow a deeper and more profound form of analysis of RNNs than other, more or less ad hoc methods. RE may give...
متن کاملNeuro-Optimizer: A New Artificial Intelligent Optimization Tool and Its Application for Robot Optimal Controller Design
The main objective of this paper is to introduce a new intelligent optimization technique that uses a predictioncorrectionstrategy supported by a recurrent neural network for finding a near optimal solution of a givenobjective function. Recently there have been attempts for using artificial neural networks (ANNs) in optimizationproblems and some types of ANNs such as Hopfield network and Boltzm...
متن کاملRule Extraction from Recurrent Neural Networks using a Symbolic Machine Learning Algorithmy
This paper addresses the extraction of knowledge from recurrent neural networks trained to behave like deterministic nite-state automata (DFAs). To date, methods used to extract knowledge from such networks have relied on the hypothesis that networks states tend to cluster and that clusters of network states correspond to DFA states. The computational complexity of such a cluster analysis has l...
متن کامل