We use a deep neural network to generate controllers for optimal trading on high-frequency data. For the first time, learns mapping between preferences of trader, i.e. risk aversion parameters, and controls. An important challenge in learning this is that, intra-day trading, trader's actions influence price dynamics closed loop via market impact. The exploration–exploitation tradeoff generated ...