Connecting KOSs and the LOD Cloud

نویسندگان

  • Rick Szostak
  • Andrea Scharnhorst
  • Wouter Beek
  • Richard P. Smiraglia
چکیده

This paper describes a specific project, the current situation leading to it, its project design and first results. In particular, we will examine the terminology employed in the Linked Open Data cloud and compare this to the terminology employed in both the Universal Decimal Classification and the Basic Concepts Classification. We will explore whether these classifications can encourage greater consistency in LOD terminology. We thus hope to link the largely distinct scholarly literatures that address LOD and KOSs. 1.0 Introduction and Motivation Our research1 involves comparing the terminology employed within the Linked Open Data (LOD) Cloud with terminology employed within two KOSs: The Universal Decimal Classification (UDC) and the Basic Concepts Classification (BCC). In doing so we will connect two quite distinct literatures and communities of practice: the Semantic Web (SW) community, which has tended to be centered in computer science, and the knowledge organization (KO) community. In the SW community there have been increasing efforts to curate and preserve the machine-readable knowledge items as published on the Web using linked data formats (Beek, Rietveld at al. 2014; Beek at al. 2014). Controlled vocabularies play a prominent role in these efforts. They provide a way to index the knowledge graph, and they represent a semantically enriched layer in this graph. In knowledge organization (KO), systematic studies of KOSs have been proposed already (Tennis 2012), and such studies have also been executed for a number of small samples. The promise of the web-based LOD Cloud is to free up data, metadata and information to a large extent from what often is called “data silos”—isolated information systems, which come with their own domain-specific knowledge organization systems, and are often barely interoperable. The LOD Cloud promises to deliver machinereadable KOSs and their implementation in a way that enables easy cross-linking. For example, the platform GeoNames (http://www.geonames.org) publishes about eleven billion place names in machine readable form, and has been used by many other services to relate a term like “New York” to a specific geographic reference, which in turn enables other services to link other names to this location, e.g., “City of New York,” “New York City,” or the historic term “Nieuw Amsterdam.” To be able to compare the different terminologies expressed in vocabularies, one first has to have an overview of them. Hence, our research involves the initial step of surveying the terminologies that are currently employed in linked open data. This will result in an atlas of vocabularies. 1 Digging Into the Knowledge Graph, 2016 Digging Into Data Challenge https://diggingintodata.org/awards/2016/project/digging-knowledge-graph

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Simultaneous Determination of Carbazoles in Water Samples by Cloud Point Extraction Coupled to HPLC

Cloud point extraction (CPE) as a rapid, simple and efficient method coupled with high performance liquid chromatography (HPLC) was used for sample preparation and subsequent determination of carbazole, trinitrocarbazole (TrNC) and tetra nitro carbazole (TNC) in water samples. Some effective parameters on extraction, such as volume of Triton X-100, extraction time, extraction temperature, ionic...

متن کامل

Preliminary Work towards Publishing Vocabularies for Germplasm and Soil Data as Linked Data

The agINFRA project focuses on the production of interoperable data in agriculture, starting from the vocabularies and Knowledge Organization Systems (KOSs) used to describe and classify them. In this paper we report on our first steps in the direction of publishing agricultural Linked Open Data (LOD), focusing in particular on germplasm data and soil data, which are still widely missing from t...

متن کامل

IR Scientific Data: How to Semantically Represent and Enrich Them

English. Experimental evaluation carried out in international large-scale campaigns is a fundamental pillar of the scientific and technological advancement of Information Retrieval (IR) systems. Such evaluation activities produce a large quantity of scientific and experimental data, which are the foundation for all the subsequent scientific production and development of new systems. We discuss ...

متن کامل

DRX: A LOD dataset interlinking recommendation tool

With the growth of the Linked Open Data (LOD) cloud, data publishers face a new challenge: finding related datasets to interlink with. To face this challenge, this paper describes a tool, called DRX, to assist data publishers in the process of dataset interlinking and browsing the LOD cloud. DRX is organized in five main modules responsible for: (i) collecting data from datasets on the LOD clou...

متن کامل

LOQUS: Linked Open Data SPARQL Querying System

The LOD cloud is gathering a lot of momentum, with the number of contributors growing manifold. Many prominent data providers have submitted and linked their data to other dataset with the help of manual mappings. The potential of the LOD cloud is enormous ranging from challenging AI issues such as open domain question answering to automated knowledge discovery. We believe that there is not eno...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1802.08141  شماره 

صفحات  -

تاریخ انتشار 2018