Induction and Exploitation of Subgoal Automata for Reinforcement Learning

نویسندگان

چکیده

In this paper we present ISA, an approach for learning and exploiting subgoals in episodic reinforcement (RL) tasks. ISA interleaves with the induction of a subgoal automaton, automaton whose edges are labeled by task’s expressed as propositional logic formulas over set high-level events. A also consists two special states: state indicating successful completion task, that task has finished without succeeding. state-of-the-art inductive programming system is used to learn covers traces events observed RL agent. When currently exploited does not correctly recognize trace, learner induces new trace. The interleaving process guarantees automata minimum number states, applies symmetry breaking mechanism shrink search space whilst remaining complete. We evaluate several gridworld continuous problems using different algorithms leverage structures. provide in-depth empirical analysis performance terms traces, specific restrictions imposed on final learnable automaton. For each class problem, show learned can be successfully policies reach goal, achieving average reward comparable case where but handcrafted given beforehand.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Subgoal Discovery for Hierarchical Reinforcement Learning Using Learned Policies

Reinforcement learning addresses the problem of learning to select actions in order to maximize an agent’s performance in unknown environments. To scale reinforcement learning to complex real-world tasks, agent must be able to discover hierarchical structures within their learning and control systems. This paper presents a method by which a reinforcement learning agent can discover subgoals wit...

متن کامل

Hierarchical Reinforcement Learning Based on Subgoal Discovery and Subpolicy Specialization

A method is described for hierarchical reinforcement learning. High-level policies automatically discover subgoals; low-level policies learn to specialize for different subgoals. Subgoals are represented as desired abstract observations which cluster raw input data. Experiments shows that this method outperforms several flat methods. The full paper can be found in [1]. Introduction. The promise...

متن کامل

development and implementation of an optimized control strategy for induction machine in an electric vehicle

in the area of automotive engineering there is a tendency to more electrification of power train. in this work control of an induction machine for the application of electric vehicle is investigated. through the changing operating point of the machine, adapting the rotor magnetization current seems to be useful to increase the machines efficiency. in the literature there are many approaches wh...

15 صفحه اول

Accelerating Action Dependent Hierarchical Reinforcement Learning Through Autonomous Subgoal Discovery

This paper presents a new method for the autonomous construction of hierarchical action and state representations in reinforcement learning, aimed at accelerating learning and extending the scope of such systems. In this approach, the agent uses information acquired while learning one task to discover subgoals for similar tasks by analyzing the learned policy using Monte Carlo sampling. The age...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Artificial Intelligence Research

سال: 2021

ISSN: ['1076-9757', '1943-5037']

DOI: https://doi.org/10.1613/jair.1.12372