With the emergence of large pre-trained vison-language model like CLIP, transferable representations can be adapted to a wide range downstream tasks via prompt tuning. Prompt tuning tries probe beneficial information for from general knowledge stored in model. A recently proposed method named Context Optimization (CoOp) introduces set learnable vectors as text language side. However, alone only...