Clustering categorical sequences is currently a difficult problem due to the lack of an efficient representation model for sequences. Unlike the existing models, which mainly focus on the fixed-length tuples representation, in this paper, a new representation model on the variablelength tuples is proposed. The variable-length tuples are obtained using a pruning method applied to delete the redu...