Estimating Joint Probabilities from Marginal Ones
نویسندگان
چکیده
Estimating joint probabilities plays an important role in many data mining and machine learning tasks. In this paper we introduce two methods, minAB and prodAB, to estimate joint probabilities. Both methods are based on a light-weight structure, partition support . The core idea is to maintain the partition support of itemsets over logically disjoint partitions and then use it to estimate joint probabilities of itemsets of higher cardinalitiess. We present extensive mathematical analyses on both methods and compare their performances on synthetic datasets. We also demonstrate a case study of using the estimation methods in Apriori algorithm for fast association mining. Moreover, we explore the usefulness of the estimation methods in other mining/learning tasks [9]. Experimental results show the effectiveness of the estimation methods.
منابع مشابه
Second-order estimating equations for the analysis of clustered current status data.
With clustered event time data, interest most often lies in marginal features such as quantiles or probabilities from the marginal event time distribution or covariate effects on marginal hazard functions. Copula models offer a convenient framework for modeling. We present methods of estimating the baseline marginal distributions, covariate effects, and association parameters for clustered curr...
متن کاملIndependence and 2-Monotonicity: Nice to Have, Hard to Keep
When using lower probabilities to model uncertainty about the value assumed by a variable, 2-monotonicity is an interesting property to satisfy, as it greatly facilitates further treatments (such as the computation of lower/upper expectation bounds). In this paper, we show that multivariate joint models induced from marginal ones by strong independence, epistemic independence or epistemic irrel...
متن کاملEstimating predicted probabilities from logistic regression: different methods correspond to different target populations.
BACKGROUND We review three common methods to estimate predicted probabilities following confounder-adjusted logistic regression: marginal standardization (predicted probabilities summed to a weighted average reflecting the confounder distribution in the target population); prediction at the modes (conditional predicted probabilities calculated by setting each confounder to its modal value); and...
متن کاملEstimating Probability Distributions over Hypotheses with Variable Unification
We analyze the difficulties in applying Bayesian belief networks to language interpretation domains, which typically involve many unification hypotheses that posit variable bindings. As an alternative, we observe that the structure of the underlying hypothesis space permits an approximate encoding of the joint distribution based on marginal rather than conditional probabilities. This suggests a...
متن کاملDoF Analysis of the K-user MISO Broadcast Channel with Alternating CSIT
We consider a K-user multiple-input single-output (MISO) broadcast channel (BC) where the channel state information (CSI) of user i(i = 1, 2, . . . ,K) may be either perfect (P), delayed (D) or not known (N) at the transmitter with probabilities λP , λ i D and λ i N , respectively. In this channel, according to the three possible CSIT for each user, joint CSIT of the K users could have at most ...
متن کامل