Exploration of combining ESN learning with gradient-descent RNN learning techniques

نویسندگان

  • Dumitru Erhan
  • Herbert Jaeger
چکیده

Modeling dynamical systems via neural networks has become a well-established field of research in Computer Science. However, for more complex dynamical systems the current algorithms are often impractical or not accurate enough. Recurrent Neural Networks (RNNs), when coupled with the Echo-State Network (ESN) approach, offer an efficient way of solving such kind of problems – by concentrating on the behavior that can be observed from the outside and modeling it, while ignoring the underlying processes. This biologically inspired method produces very good results for a variety of problems and, in many cases, out-beats other algorithms' results by orders of magnitude. However, for certain scenarios, the ESN algorithm requires large networks in order to operate at its full potential. This is not desirable in those situations where the physical or computational requirements limit the size of network to be used (such as the case of micro-controllers and embedded devices). The aim of this project was to provide a remedy to this problem by employing a " post-processing " technique on the solutions generated by the ESN algorithm. This technique takes the form of a traditional algorithm for learning with RNNs. While these algorithms are normally slow and tend to produce worse results, they require small networks in order to operate properly. This way we found a compromise solution that helped us show that it is possible to employ the ESN algorithm even on small networks and still obtain good results by means of " post-processing " them with other, more traditional, algorithms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Input and Recurrent Weight Matrices in Echo State Networks

The traditional echo state network (ESN) is a special type of a temporally deep model, the recurrent network (RNN), which carefully designs the recurrent matrix and fixes both the recurrent and input matrices in the RNN. The ESN also adopts the linear output (or readout) units to simplify the leanring of the only output matrix in the RNN. In this paper, we devise a special technique that takes ...

متن کامل

On Optimization Algorithms for Recurrent Networks with Long Short-Term Memory

Motivation and contribution Recurrent neural networks (RNN) with long short-term memory (LSTM) are recently proposed to model sequences without prior domain knowledge [3, 6]. In these work, the authors empirically observed that RNN-LSTMs trained with vanilla optimization algorithms, such as stochastic gradient descent (SGD) with a simple learning rate annealing schedule, achieves good performan...

متن کامل

Spiral Recurrent Neural Network for Online Learning

Autonomous, self* sensor networks require sensor nodes with a certain degree of “intelligence”. An elementary component of such an “intelligence” is the ability to learn online predicting sensor values. We consider recurrent neural network (RNN) models trained with an extended Kalman filter algorithm based on real time recurrent learning (RTRL) with teacher forcing. We compared the performance ...

متن کامل

Nonnegative Least Squares Learning for the Random Neural Network

In this paper, a novel supervised batch learning algorithm for the Random Neural Network (RNN) is proposed. The RNN equations associated with training are purposively approximated to obtain a linear Nonnegative Least Squares (NNLS) problem that is strictly convex and can be solved to optimality. Following a review of selected algorithms, a simple and efficient approach is employed after being i...

متن کامل

Forecasting GDP Growth Using ANN Model with Genetic Algorithm

Applying nonlinear models to estimation and forecasting economic models are now becoming more common, thanks to advances in computing technology. Artificial Neural Networks (ANN) models, which are nonlinear local optimizer models, have proven successful in forecasting economic variables. Most ANN models applied in Economics use the gradient descent method as their learning algorithm. However, t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004