Model-based Vol/VAR optimization method is widely used to eliminate voltage violations and reduce network losses. However, the parameters of active distribution networks(ADNs) are not onsite identified, so significant errors may be involved in model make model-based infeasible. To cope with this critical issue, we propose a novel two-stage deep reinforcement learning (DRL) improve profile by re...