Nested Rollout Policy Adaptation (NRPA) is an approach using online learning policies in a nested structure. It has achieved great result variety of difficult combinatorial optimization problems. In this paper, we propose Meta-NRPA, which combines optimal stopping theory with NRPA for warm-starting and significantly improves the performance NRPA. We also present several exploratory techniques e...