State-of-the-art reinforcement learning algorithms mostly rely on being allowed to directly interact with their environment collect millions of observations. This makes it hard transfer success industrial control problems, where simulations are often very costly or do not exist, and exploring in the real can potentially lead catastrophic events. Recently developed, model-free, offline RL algori...