In this paper, we consider the contextual variant of MNL-Bandit problem. More specifically, a dynamic set optimization problem, where decision-maker offers subset (assortment) products to consumer and observes response in every round. Consumers purchase maximize their utility. We assume that attributes describe products, mean utility product is linear values these attributes. model choice behav...