Model-Free Quantum Control with Reinforcement Learning
نویسندگان
چکیده
Model bias is an inherent limitation of the current dominant approach to optimal quantum control, which relies on a system simulation for optimization control policies. To overcome this limitation, we propose circuit-based training reinforcement learning agent tasks in model-free way. Given continuously parameterized circuit, learns its parameters through trial-and-error interaction with system, using measurement outcomes as only source information about state. Focusing harmonic oscillator coupled ancilla qubit, show how reward measurements experimentally available observables. We train prepare various non-classical states both unitary and adaptive measurement-based feedback, execute logical gates encoded qubits. This significantly outperforms widely used methods terms sample efficiency. Our numerical work immediate relevance superconducting circuits trapped ions platforms where such can be implemented experiment, allowing complete elimination model adaptation policies specific they are deployed.
منابع مشابه
Reinforcement Learning: Model-free
Simply put, reinforcement learning (RL) is a term used to indicate a large family of dierent algorithms RL that all share two key properties. First, the objective of RL is to learn appropriate behavior through trialand-error experience in a task. Second, in RL, the feedback available to the learning agent is restricted to a reward signal that indicates how well the agent is behaving, but does ...
متن کاملMultitask model-free reinforcement learning
Conventional model-free reinforcement learning algorithms are limited to performing only one task, such as navigating to a single goal location in a maze, or reaching one goal state in the Tower of Hanoi block manipulation problem. It has been thought that only model-based algorithms could perform goal-directed actions, optimally adapting to new reward structures in the environment. In this wor...
متن کاملDepth Control of Model-Free AUVs via Reinforcement Learning
In this paper, we consider depth control problems of an autonomous underwater vehicle (AUV) for tracking the desired depth trajectories. Due to the unknown dynamical model of the AUV, the problems cannot be solved by most of modelbased controllers. To this purpose, we formulate the depth control problems of the AUV as continuous-state, continuous-action Markov decision processes (MDPs) under un...
متن کاملShaping Model-Free Reinforcement Learning with Model- Based Pseudorewards
Model-free and model-based reinforcement learning have provided a successful framework for understanding both human behavior and neural data. These two systems are usually thought to compete for control of behavior. However, it has also been proposed that they can be integrated in a cooperative manner. For example, the Dyna algorithm uses model-based replay of past experience to train the model...
متن کاملLearning Epistemic Actions in Model-Free Memory-Free Reinforcement Learning: Experiments with a Neuro-robotic Model
Passive sensory processing is often insufficient to guide biological organisms in complex environments. Rather, behaviourally relevant information can be accessed by performing so-called epistemic actions that explicitly aim at unveiling hidden information. However, it is still unclear how an autonomous agent can learn epistemic actions and how it can use them adaptively. In this work, we propo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Physical Review X
سال: 2022
ISSN: ['2160-3308']
DOI: https://doi.org/10.1103/physrevx.12.011059