Abstract We study controllable text summarization, which allows users to gain control on a particular attribute (e.g., length limit) of the generated summaries. In this work, we propose novel training framework based Constrained Markov Decision Process (CMDP), conveniently includes reward function along with set constraints, facilitate better summarization control. The encourages generation res...