First Layer : Expanded Input Representation Final Output

نویسندگان

  • Richard S. Sutton
  • Steven D. Whitehead
چکیده

We consider the requirements of online learning|learning which must be done in-crementally and in realtime, with the results of learning available soon after each new example is acquired. Despite the abundance of methods for learning from examples, there are few that can be used eeectively for on-line learning, e.g., as components of reinforcement learning systems. Most of these few, including radial basis functions, CMACs, Ko-honen's self-organizing maps, and those developed in this paper, share the same structure. All expand the original input representation into a higher dimensional representation in an unsupervised way, and then map that representation to the nal answer using a relatively simple supervised learner, such as a perceptron or LMS rule. Such structures learn very rapidly and reliably, but have been thought either to scale poorly or to require extensive domain knowledge. have argued that the expanded representation can be chosen largely at random with good results. The main contribution of this paper is to develop and test this hypothesis. We show that simple random-representation methods can perform as well as nearest-neighbor methods (while being more suited to online learning), and signiicantly better than backprop-agation. We nd that the size of the random representation does increase with the dimensionality of the problem, but not unreasonably so, and that the required size can be reduced substantially using unsupervised-learning techniques. Our results suggest that randomness has a useful role to play in on-line supervised learning and constructive induction. 1 Online Learning Applications of supervised learning can be divided into two types: online and ooine. By ooine supervised learning we mean the classical task in machine learning and statistics: a set of examples is obtained and used to learn a good approximating function before the function is used in the application. In online learning, on the other hand, data gathered during the normal operation of the system is used to continually adapt the learned function. Online learning has several advantages over ooine learning. First, online learning is potentially more robust because errors or omissions in the training set can be corrected during operation. Second, training data can often be generated easily and in great quantities when a system is in operation, whereas it is usually scarce and precious before. Being able to use this data sometimes puts online learning at a great advantage ; for example, this was probably the most important reason for the success of Tesauro's (1992) champion …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of breeding values for the milk production trait in Iranian Holstein cows applying artificial neural networks

The artificial neural networks, the learning algorithms and mathematical models mimicking the information processing ability of human brain can be used non-linear and complex data. The aim of this study was to predict the breeding values for milk production trait in Iranian Holstein cows applying artificial neural networks. Data on 35167 Iranian Holstein cows recorded between 1998 to 2009 were ...

متن کامل

Exponential Capacity in an Autoencoder Neural Network with a Hidden Layer

A fundamental aspect of limitations in learning any computation in neural architectures is characterizing their optimal capacities. An important, widely-used neural architecture is known as autoencoders where the network reconstructs the input at the output layer via a representation at a hidden layer. Even though capacities of several neural architectures have been addressed using statistical ...

متن کامل

The effect of structural changes in higher education sector on regional output (Case study: Sistan and Baluchestan Province)

Abstract The aim of this study is of the effect of structural changes in higher education on changes of output in Sistan and Baluchestan Province using structural decomposition analysis (SDA). The input-output tables of this region for the period 2006-2011 have been employed as the database of the model. The structural changes were decomposed into two factors: changes in share of specific sect...

متن کامل

Size-depth-alternation tradeoffs for circuits

A Boolean circuit is a directed acyclic graph with some designated input gates of fan-in zero and one designated output gate of fan-out zero in which all non-input nodes are labeled with or, and, or not. All or and and gates have fan-in two, and all not gates fan-in one. We assume that the gates of a Boolean circuit are arranged in layers; each layer consists of gates whose inputs come only fro...

متن کامل

خوشه‌بندی اسناد مبتنی بر آنتولوژی و رویکرد فازی

Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...

متن کامل

Early detection of MS in fMRI images using deep learning techniques

Introduction & Objective:MS is a disease of the central nervous system in which the body makes a defensive attack on its tissues. The disease can affect the brain and spinal cord, causing a wide range of potential symptoms, including balance, movement and vision problems. MRI and fMRI images are a very important tool in the diagnosis and treatment of MS. The aim of this study was to provide...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993