Learning Hierarchical Riffle Independent Groupings from Rankings
نویسندگان
چکیده
Ri ed independence is a generalized notion of probabilistic independence that has been shown to be naturally applicable to ranked data. In the ri ed independence model, one assigns rankings to two disjoint sets of items independently, then in a second stage, interleaves (or ri es) the two rankings together to form a full ranking, as if by shu ing a deck of cards. Because of this interleaving stage, it is much more di cult to detect ri ed independence than ordinary independence. In this paper, we provide the rst automated method for discovering sets of items which are ri e independent from a training set of rankings. We show that our clustering-like algorithms can be used to discover meaningful latent coalitions from real preference ranking datasets and to learn the structure of hierarchically decomposable models based on ri ed independence.
منابع مشابه
Structure Discovery from Partial Rankings
Aggregating and statistical reasoning with ranked data are tasks that arise in a number of applications from analyzing political elections to modeling user preferences over a set of items. Representing distributions over rankings, however, can be daunting due to the fact that the number of rankings of n items scales factorially. Moreover, it is crucial for probabilistic models over rankings to ...
متن کاملLearning Hierarchical Ri e Independent Groupings from Rankings: Supplemental Material
While Figures 1(a) and 1(b) (on the next page) encode distinct families of distributions, they share a common subset of independence assumptions. It turns out that any distributions consistent with either of the two hierarchies must also be consistent with what we call a 3-way decomposition. We de ne a d-way decomposition to be a distribution with a single level of hierarchy, but instead of par...
متن کاملRiffled Independence for Efficient Inference with Partial Rankings
Distributions over rankings are used to model data in a multitude of real world settings such as preference analysis and political elections. Modeling such distributions presents several computational challenges, however, due to the factorial size of the set of rankings over an item set. Some of these challenges are quite familiar to the artificial intelligence community, such as how to compact...
متن کاملUncovering the Riffled Independence Structure of Rankings
Representing distributions over permutations can be a daunting task due to the fact that the number of permutations of n objects scales factorially in n. One recent way that has been used to reduce storage complexity has been to exploit probabilistic independence, but as we argue, full independence assumptions impose strong sparsity constraints on distributions and are unsuitable for modeling r...
متن کاملUncovering the riffled independence structure of ranked data
Representing distributions over permutations can be a daunting task due to the fact that the number of permutations of n objects scales factorially in n. One recent way that has been used to reduce storage complexity has been to exploit probabilistic independence, but as we argue, full independence assumptions impose strong sparsity constraints on distributions and are unsuitable for modeling r...
متن کامل