Adaptive Look-Ahead Planning
نویسندگان
چکیده
We present a new adaptive connectionist planning method. By interaction with an environment a world model is progressively constructed using the backpropagation learning algorithm. The planner constructs a look-ahead plan by iteratively using this model to predict future reinforcements. Future reinforcement is maximized to derive suboptimal plans, thus determining good actions directly from the knowledge of the model network (strategic level). This is done by gradient descent in action space. The problem of nding good initial plans is solved by the use of an \experience" network (intuition level). The appropriateness of this planning method for nding suboptimal actions in unknown environments is demonstrated with a target tracking problem.
منابع مشابه
Diagnosis and Repair Iteration Planning versus N-Step Look Ahead Planning
In this paper we compare the performance of planning algorithms distinguishing and iterating between observation (diagnosis) and repair (action) phases with algorithms extending conventional planning methods with observations using n-step look ahead. Diagnosis and repair iteration planning algorithms are an extension of earlier work of Friedrich and Nejdl, n-step look ahead planning including d...
متن کاملPipelined adaptive IIR filter architectures using scattered and relaxed look-ahead transformations
Fine-grain pipelined architectures for adaptive infinite impulse response (AIIR) filters are presented in this paper. The AIIR filters are equation error based. The proposed architectures are developed by employing a combination of scattered look-ahead and relaxed look-ahead pipelining techniques. First, a pipelined system identification scenario is developed. Then, the scattered look-ahead tec...
متن کاملEffect of driving experience on anticipatory look-ahead fixations in real curve driving.
Anticipatory skills are a potential factor for novice drivers' curve accidents. Behavioural data show that steering and speed regulation are affected by forward planning of the trajectory. When approaching a curve, the relevant visual information for online steering control and for planning is located at different eccentricities, creating a need to disengage the gaze from the guidance of steeri...
متن کاملAnnihilation-reordering look-ahead pipelined CORDIC-based RLS adaptive filters and their application to adaptive beamforming
The novel annihilation-reordering look-ahead technique is proposed as an attractive technique for pipelining of Givens rotation (or CORDIC) based adaptive lters. Unlike the existing relaxed look-ahead, the annihilation-reordering look-ahead does not depend on the statistical properties of the input samples. It is an exact look-ahead and based on CORDIC arithmetic, which is known to be numerical...
متن کاملMovement-Based Look-Ahead Traffic-Adaptive Intersection Control ⋆
There exist several control approaches for traffic signal control such as fixed-time, vehicle-actuated, or look-ahead traffic-adaptive control. We argue that in order to flexibly deal with varying demand levels movement-based control (which is already common in vehicleactuated intersection control) is required instead of stage-based control (which is still employed in the state-of-the-art in lo...
متن کامل