Electronic Companion — “ A Framework for Reconciling Attribute Values from Multiple Data Sources

نویسندگان

  • Zhengrui Jiang
  • Sumit Sarkar
  • Prabuddha De
چکیده

Proof of Proposition 1. Suppose an attribute value ai is not recorded in any of the data sources S1 through Sn for an entity instance. Then, from Assumptions 1 and 2 in the paper, we have P A= ai AS1 = ak AS2 = al ASn = atW i = k i = l i = t = P AS1 = ak A= ai P AS2 = al A= ai × · · ·×P ASn = at A= ai P AS1 = ak AS2 = al ASn = at P A= ai = 1−R A S1 / m− 1 1−RS2 / m− 1 × · · ·× 1−RSn / m− 1 P AS1 = ak AS2 = al ASn = at P A= ai Clearly, the above probability expression is proportional to the prior probability P A= ai ∀ i. Therefore, we must have P A= ai AS1 = ai AS2 = ai ASn = ai P A= aj AS1 = aj AS2 = aj ASn = aj = P A= ai P A= aj ∀ i j

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reconciling Attribute Values from Multiple Data Sources

Because of the heterogeneous nature of multiple data sources, data integration is often one of the most challenging tasks of today’s information systems. While the existing literature has focused on problems such as schema integration and entity identification, our current study attempts to answer a basic question: When an attribute value for a real-world entity is recorded differently in two d...

متن کامل

A Framework for Reconciling Attribute Values from Multiple Data Sources

B of the heterogeneous nature of different data sources, data integration is often one of the most challenging tasks in managing modern information systems. While the existing literature has focused on problems such as schema integration and entity identification, it has largely overlooked a basic question: When an attribute value for a real-world entity is recorded differently in different dat...

متن کامل

Reconciling Continuous Attribute Values from Multiple Data Sources

Because of the heterogeneous nature of different data sources, data integration is often one of the most challenging tasks in managing modern information systems. The challenges exist at three different levels: schema heterogeneity, entity heterogeneity, and data heterogeneity. The existing literature has largely focused on schema heterogeneity and entity heterogeneity; and the very limited wor...

متن کامل

Conversion Rules from Disparate Data Sources

The successful integration of data from autonomous and heterogeneous systems calls for the resolution of semantic conflicts that may be present. Such conflicts are often reflected by discrepancies in attribute values of the same data object. In this paper, we describe a recently developed prototype system, DIRECT (DIscovering and REconciling ConflicTs). The system mines data value conversion ru...

متن کامل

Attribute-based Access Control for Cloud-based Electronic Health Record (EHR) Systems

Electronic health record (EHR) system facilitates integrating patients' medical information and improves service productivity. However, user access to patient data in a privacy-preserving manner is still challenging problem. Many studies concerned with security and privacy in EHR systems. Rezaeibagha and Mu [1] have proposed a hybrid architecture for privacy-preserving accessing patient records...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007