Model-Free Quantum Control with Reinforcement Learning

نویسندگان

چکیده

Model bias is an inherent limitation of the current dominant approach to optimal quantum control, which relies on a system simulation for optimization control policies. To overcome this limitation, we propose circuit-based training reinforcement learning agent tasks in model-free way. Given continuously parameterized circuit, learns its parameters through trial-and-error interaction with system, using measurement outcomes as only source information about state. Focusing harmonic oscillator coupled ancilla qubit, show how reward measurements experimentally available observables. We train prepare various non-classical states both unitary and adaptive measurement-based feedback, execute logical gates encoded qubits. This significantly outperforms widely used methods terms sample efficiency. Our numerical work immediate relevance superconducting circuits trapped ions platforms where such can be implemented experiment, allowing complete elimination model adaptation policies specific they are deployed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reinforcement Learning: Model-free

Simply put, reinforcement learning (RL) is a term used to indicate a large family of dierent algorithms RL that all share two key properties. First, the objective of RL is to learn appropriate behavior through trialand-error experience in a task. Second, in RL, the feedback available to the learning agent is restricted to a reward signal that indicates how well the agent is behaving, but does ...

متن کامل

Multitask model-free reinforcement learning

Conventional model-free reinforcement learning algorithms are limited to performing only one task, such as navigating to a single goal location in a maze, or reaching one goal state in the Tower of Hanoi block manipulation problem. It has been thought that only model-based algorithms could perform goal-directed actions, optimally adapting to new reward structures in the environment. In this wor...

متن کامل

Depth Control of Model-Free AUVs via Reinforcement Learning

In this paper, we consider depth control problems of an autonomous underwater vehicle (AUV) for tracking the desired depth trajectories. Due to the unknown dynamical model of the AUV, the problems cannot be solved by most of modelbased controllers. To this purpose, we formulate the depth control problems of the AUV as continuous-state, continuous-action Markov decision processes (MDPs) under un...

متن کامل

Shaping Model-Free Reinforcement Learning with Model- Based Pseudorewards

Model-free and model-based reinforcement learning have provided a successful framework for understanding both human behavior and neural data. These two systems are usually thought to compete for control of behavior. However, it has also been proposed that they can be integrated in a cooperative manner. For example, the Dyna algorithm uses model-based replay of past experience to train the model...

متن کامل

Learning Epistemic Actions in Model-Free Memory-Free Reinforcement Learning: Experiments with a Neuro-robotic Model

Passive sensory processing is often insufficient to guide biological organisms in complex environments. Rather, behaviourally relevant information can be accessed by performing so-called epistemic actions that explicitly aim at unveiling hidden information. However, it is still unclear how an autonomous agent can learn epistemic actions and how it can use them adaptively. In this work, we propo...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Physical Review X

سال: 2022

ISSN: ['2160-3308']

DOI: https://doi.org/10.1103/physrevx.12.011059