Abstract Forpractical considerations reinforcement learning has proven to be a difficult task outside of simulation when applied physical experiment. Here we derive an optional approach model free learning, achieved entirely online, through careful experimental design and algorithmic decision making. We scheme implement traditionally episodic algorithms for unstable 1-dimensional mechanical env...