Taking Entity Reconciliation Offline

نویسندگان

  • Ryan Shaw
  • Patrick Golden
چکیده

Entity reconciliation—linking names or terms to identifiers in external datasets—is a popular method of adding standardized structured data to loosely structured documents. Most approaches to entity reconciliation rely on remote web services, requiring network access during the reconciliation process. For use cases that rely on a “human in the loop” (reconciling entities during the authoring process), this requirement may be a problem. To address this problem, we investigated the feasibility of offline entity reconciliation against the Virtual International Authority File. Offline entity reconciliation was implemented by taking advantage of newly standardized browser storage interfaces to store and query parts of this large dataset locally. We present the results of this investigation and our comparison of the performance, scalability, ease of implementation, and cross-browser compatibility of the various options for storing entity data locally.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Wavelet Filtering Application for On-line Dynamic Data Reconciliation

Discrete wavelet transform (DWT) is known for its signal processing ability. In the recent researches, DWT is adopted for signal filtering before executing dynamic data reconciliation. While on-line dynamic data reconciliation is concerned, the computation is heavy duo to the filtering in every time instant. In this article, a shift property of the DWT is indicated and is applied to reduce the ...

متن کامل

An Integrated Model-Centric Framework for Joint Parameter Estimation/Data Reconciliation of Process Systems

This paper focuses on the current developments on the estimation/reconciliation modules of a novel framework for integrated decision support of process systems (IDSoPS). Built on the initial conceptual definition, a generic and versatile error-invariables (EVM) was implemented and tested for both off-line and on-line applications using a state-of-the art modeling tool. The IDSoPS has the capabi...

متن کامل

L2R: A Logical Method for Reference Reconciliation

The reference reconciliation problem consists in deciding whether different identifiers refer to the same data, i.e., correspond to the same world entity. The L2R system exploits the semantics of a rich data model, which extends RDFS by a fragment of OWL-DL and SWRL rules. In L2R, the semantics of the schema is translated into a set of logical rules of reconciliation, which are then used to inf...

متن کامل

COBRA (Consolidated Omnibus Budget Reconciliation Act of 1985).

Applies to: faculty staff students student employees visitors contractors BACKGROUND: CMU has adopted this policy to comply with the Consolidated Omnibus Budget Reconciliation Act (COBRA) of 1985.

متن کامل

Dynamic and Distributed Reconciliation in P2P-DHT Networks

Optimistic replication can provide high data availability for collaborative applications in large scale distributed systems (grid, P2P, and mobile systems). However, if data reconciliation is performed by a single node, data availability remains an important issue since the reconciler node can fail. Thus, reconciliation should also be distributed and reconciliation data should be replicated. We...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013