In this paper, we study the well-known team orienteering problem where a fleet of robots collects rewards by visiting locations. Usually, are assumed to be known robots; however, in applications such as environmental monitoring or scene reconstruction, often subjective and specifying them is challenging. We propose framework learn unknown preferences user presenting alternative solutions them, ...