منابع مشابه
16-899C ACRL Tetris Reinforcement Learner
Our approach to this problem was to use reinforcement learning with a function approximator to approximate the state value function [RSS98]. In our case, a +1 reward was given for every completed line, so that the value function would encode the long-term number of lines that is going to be completed by the algorithm. In order to achieve this, we extract features from the game state, and use gr...
متن کاملNational Congenital Rubella Surveillance Programme 1 July 1971-30 June 1984.
This is the fourth report of a series on the surveillance of congenital rubella by the National Congenital Rubella Surveillance Programme using methods previously described in the BM7.'-3 We present data on 763 children (including nine co-twins, that is, nine pairs) classified by the Northern and Southern Registries of the National Congenital Rubella Surveillance Programme as cases of confirmed...
متن کاملJune 1984 LIDS - P - 1387 DISCRETE - TIME PRIORITY QUEUES WITH PARTIAL INTERFERENCE
A class of discrete time priority queueing systems with partial interference is considered. Packet-radio communication networks that use a certain mode of operation fall into this class. In these systems N nodes share a common channel to transmit their packets. One node uses random access scheme while the other nodes use the channel according to prescribed priorities. Packet arrivals are modele...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: College & Research Libraries News
سال: 1984
ISSN: 2150-6698,0099-0086
DOI: 10.5860/crln.45.8.393