Today’s advanced Reinforcement Learning algorithms produce black-box policies, that are often difficult to interpret and trust for a person. We introduce policy distilling algorithm, building on the CN2 rule mining distills into rule-based decision system. At core of our approach is fact an RL process does not just learn policy, mapping from states actions, but also produces extra meta-informat...