Probabilistic Reasoning for Large Scale Databases
نویسندگان
چکیده
The complexity of probabilistic reasoning prohibits its application on a large scale of data. In order to reduce the complexity, implementations of modeling approaches restrict themselves with respect to expressive power or relax on the underlying probability theory. We present the implementation aspects of a probabilistic extension of stratified Datalog. This probabilistic deductive system is strictly based on the well-founded ground of probability theory. The prototypical implementation of the system handles the expensive computation of the probabilities separately from the reasoning process itself. Thus, we can use standard optimization strategies known from deterministic systems in order to cope with large amounts of data. By adding probabilistic reasoning to a deductive database system we gain the possibility of describing the information retrieval task as computing the probabilityP (d! q), i. e. the probability of the inference between a document d and a query q. Therefore, the logical view on databases plus a probabilistic generalization of the data model is a promising candidate for a breakthrough in integrating database and information retrieval technology on the way to multimedia information systems.
منابع مشابه
Open-World Probabilistic Databases
Large-scale probabilistic knowledge bases are becoming increasingly important in academia and industry alike. They are constantly extended with new data, powered by modern information extraction tools that associate probabilities with database tuples. In this paper, we revisit the semantics underlying such systems. In particular, the closed-world assumption of probabilistic databases, that fact...
متن کاملLPKP: location-based probabilistic key pre-distribution scheme for large-scale wireless sensor networks using graph coloring
Communication security of wireless sensor networks is achieved using cryptographic keys assigned to the nodes. Due to resource constraints in such networks, random key pre-distribution schemes are of high interest. Although in most of these schemes no location information is considered, there are scenarios that location information can be obtained by nodes after their deployment. In this paper,...
متن کاملReasoning with Probabilities , Time And
Over the years probability theory has emerged as the leading way of dealing with uncertainty in AI and Databases. However, application of probabilistic methods to diierent frameworks has always been a problem as the two main goals of such an application | eeciency and correctness | are often at odds with each other. This is so because diierent dependencies between the events considered cause di...
متن کاملGraphical Models for Uncertain Data
Graphical models are a popular and well-studied framework for compact representation of a joint probability distribution over a large number of interdependent variables, and for efficient reasoning about such a distribution. They have been proven useful in a wide range of domains from natural language processing to computer vision to bioinformatics. In this chapter, we present an approach to us...
متن کاملChapter 4 GRAPHICAL MODELS FOR UNCERTAIN DATA
Graphical models are a popular and well-studied framework for compact representation of a joint probability distribution over a large number of interdependent variables, and for efficient reasoning about such a distribution. They have been proven useful in a wide range of domains from natural language processing to computer vision to bioinformatics. In this chapter, we present an approach to us...
متن کامل