Recently, multi-hop reasoning over incomplete Knowledge Graphs (KGs) has attracted wide attention due to its desirable interpretability for downstream tasks, such as question answer and knowledge graph completion. Multi-Hop is a typical sequential decision problem, which can be formulated Markov process (MDP). Subsequently, some reinforcement learning (RL) based approaches are proposed proven e...