Constructing Skill Trees for Reinforcement Learning Agents from Demonstration Trajectories
نویسندگان
چکیده
We introduce CST, an algorithm for constructing skill trees from demonstration trajectories in continuous reinforcement learning domains. CST uses a changepoint detection method to segment each trajectory into a skill chain by detecting a change of appropriate abstraction, or that a segment is too complex to model as a single skill. The skill chains from each trajectory are then merged to form a skill tree. We demonstrate that CST constructs an appropriate skill tree that can be further refined through learning in a challenging continuous domain, and that it can be used to segment demonstration trajectories on a mobile manipulator into chains of skills where each skill is assigned an appropriate abstraction.
منابع مشابه
Robot learning from demonstration by constructing skill trees
We describe CST, an online algorithm for constructing skill trees from demonstration trajectories. CST segments a demonstration trajectory into a chain of component skills, where each skill has a goal and is assigned a suitable abstraction from an abstraction library. These properties permit skills to be improved efficiently using a policy learning algorithm. Chains from multiple demonstration ...
متن کاملImplementing Cst in Learning Layer of Csia for Higher Level of Intelligence
Development of cognitive architecture where the agents at different levels exhibit different levels of thinking. The paper primarily focus on building the skill tree at the learning layer of the architecture. These include the discovery of one’s own body, including its structure and dynamics. Also the acquisition of associated cognitive skills such as self and non-self-distinction. This can be ...
متن کاملCST: Constructing Skill Trees by Demonstration
We describe recent work on CST, an online algorithm for constructing skill trees from demonstration trajectories. CST segments a demonstration trajectory into a chain of component skills, where each skill has a goal and is assigned a suitable abstraction from an abstraction library. These properties permit skills to be improved efficiently using a policy learning algorithm. Chains from multiple...
متن کاملLearning from a Single Demonstration: Motion Planning with Skill Segmentation
We propose an approach to control learning from demonstration that first segments demonstration trajectories to identify subgoals, then uses model-based control methods to sequentially reach these subgoals to solve the overall task. Using this approach, we show that a mobile robot is able to solve a combined navigation and manipulation task robustly after observing only a single successful traj...
متن کاملTowards Robust Skill Generalization: Unifying Learning from Demonstration and Motion Planning
In this paper, we present Combined Learning from demonstration And Motion Planning (CLAMP) as an efficient approach to skill learning and generalizable skill reproduction. CLAMP combines the strengths of Learning from Demonstration (LfD) and motion planning into a unifying framework. We carry out probabilistic inference to find trajectories which are optimal with respect to a given skill and al...
متن کامل