Data Cleansing Consolidation with PatchR
نویسندگان
چکیده
The Linking Open Data (LOD) initiative is turning large resources of publicly available structured data from various domains into interlinked RDF(S) facts to constitute the so-called “Web of Data”. But, this Web of Data is by no means a perfect world of consistent and valid facts. Linked Data has multiple dimensions of shortcomings ranging from simple syntactical errors over logical inconsistencies to complex semantic errors and wrong facts. Multiple efforts target data quality assessment or aim to detect and to resolve such shortcomings in Linked Data datasets, such as crowdsourcing based, statistical, or heuristical approaches. These approaches rather address particular problems or datasets than to be generalizable for any kind of error. Moreover, results are published in various forms, which makes it hard to combine their results. In this paper we propose the aggregation of heterogeneous Linked Data cleansing efforts by using the Patch Request ontology [1]. This allows to include less assured outcomes in order to reach a higher coverage.
منابع مشابه
PatchR: A Framework for Linked Data Change Requests
Incorrect BLOCKINor BLOCKINoutdated BLOCKINdata BLOCKINis BLOCKINa BLOCKINcommon BLOCKINproblem BLOCKINwhen BLOCKINworking BLOCKINwith BLOCKINLinked BLOCKINData BLOCKINin BLOCKINreal BLOCKINworld BLOCKINapplications. PatchR:
متن کاملAn Architecture for Data Warehouse Systems Using a Heterogeneous Database Management System
In today’s highly competitive world, enterprises are using Data Warehouse (DW) systems in order to make better and faster decisions. However, the success of a DW project is related to the quality of the information. This quality depends on data cleansing, consolidation and extraction processes. In this paper, an architecture for DW systems is proposed. This architecture provides high data quali...
متن کاملCleansing and preparation of data for statistical analysis: A step necessary in oral health sciences research
In many published articles, there is still no mention of quality control processes, which might be an indication of the insufficient importance the researchers attach to undertaking or reporting such processes. However, quality control of data is one of the most important steps in research projects. Lack of sufficient attention to quality control of data might have a detrimental effect on the r...
متن کاملSmartCOPI Smart Consolidation of Product Information
Maintaining the quality of detailed product data, ranging from data about required raw materials to detailed specifications of tools and spare parts, is of vital importance in many industries. Ordering or using wrong spare parts (based on wrong or incomplete information) may result in significant production loss or even impact health and safety. The web provides a wealth of information on produ...
متن کاملEvaluation of the Effect of Commercial Bank Consolidation on Economic Growth (Evidence from Nigeria, 2006- 2015)
This study evaluated the effect of bank consolidation on economic growth of Nigeria between the periods of 2006-2015. Secondary data were sourced from the Central Bank of Nigeria statistical bulletin and the NDIC Annual Reports between the period of 2006 and 2015. Data was analyzed using the Ordinary Least Square (OLS) multiple regression technique with the aid of the SPSS statistical software ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014