Probabilistic Skylines on Uncertain Data

نویسندگان

  • Jian Pei
  • Bin Jiang
  • Xuemin Lin
  • Yidong Yuan
چکیده

Uncertain data are inherent in some important applications. Although a considerable amount of research has been dedicated to modeling uncertain data and answering some types of queries on uncertain data, how to conduct advanced analysis on uncertain data remains an open problem at large. In this paper, we tackle the problem of skyline analysis on uncertain data. We propose a novel probabilistic skyline model where an uncertain object may take a probability to be in the skyline, and a p-skyline contains all the objects whose skyline probabilities are at least p. Computing probabilistic skylines on large uncertain data sets is challenging. We develop two efficient algorithms. The bottom-up algorithm computes the skyline probabilities of some selected instances of uncertain objects, and uses those instances to prune other instances and uncertain objects effectively. The top-down algorithm recursively partitions the instances of uncertain objects into subsets, and prunes subsets and objects aggressively. Our experimental results on both the real NBA player data set and the benchmark synthetic data sets show that probabilistic skylines are interesting and useful, and our two algorithms are efficient on large data sets, and complementary to each other in performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Continuous Probabilistic Skyline Queries over Uncertain Data Streams

Recently, some approaches of finding probabilistic skylines on uncertain data have been proposed. In these approaches, a data object is composed of instances, each associated with a probability. The probabilistic skyline is then defined as a set of non-dominated objects with probabilities exceeding or equaling a given threshold. In many applications, data are generated as a form of continuous d...

متن کامل

Identifying Interesting Instances for Probabilistic Skylines

Uncertain data arises from various applications such as sensor networks, scientific data management, data integration, and location based applications. While significant research efforts have been dedicated to modeling, managing and querying uncertain data, advanced analysis of uncertain data is still in its early stages. In this paper, we focus on skyline analysis of uncertain data, modeled as...

متن کامل

Supporting Case-Based Retrieval by Similarity Skylines: Basic Concepts and Extensions

Conventional approaches to similarity search and case-based retrieval, such as nearest neighbor search, require the specification of a global similarity measure which is typically expressed as an aggregation of local measures pertaining to different aspects of a case. Since the proper aggregation of local measures is often quite difficult, we propose a novel concept called similarity skyline. R...

متن کامل

Probabilistic GENCOs Bidding Strategy in Restructured Two-Side Auction Power Markets

As a matter of course, power market uncertainties escalation is by product of power industry restructure on one hand and the unrivalled penetration of renewable energies on the other. Generally, the decision making process in such an uncertain environment faces with different risks. In addition, the performance of real power markets is very close to oligopoly markets, in which, some market play...

متن کامل

Probabilistic Allocation Of Parking lots In Distribution Network Considering Uncertainty.

In this paper, parking lots with bidirectional power flow capability, is used as an achievements of smart power systems. Based on operating conditions, electric vehicles can be considered  as a load  or generator. For optimal operation of power systems, allocation of these novel units is also necessary same as other distributed generation. In this paper, an optimization problem is  proposed for...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007