Reinforcement learning (RL) can obtain the supervisory controller for discrete-event systems modeled by finite automata and temporal logic. The published methods often have two limitations. First, a large number of training data are required to learn RL controller. Second, algorithms do not consider uncontrollable events, which essential control theory (SCT). To address limitations, we first ap...