Towards automatic quality assessment of component metadata

نویسندگان

  • Thorsten Trippel
  • Daan Broeder
  • Matej Durco
  • Oddrun Pauline Ohren
چکیده

Measuring the quality of metadata is only possible by assessing the quality of the underlying schema and the metadata instance. We propose some factors that are measurable automatically for metadata according to the CMD framework, taking into account the variability of schemas that can be defined in this framework. The factors include among others the number of elements, the (re-)use of reusable components, the number of filled in elements. The resulting score can serve as an indicator of the overall quality of the CMD instance, used for feedback to metadata providers or to provide an overview of the overall quality of metadata within a repository. The score is independent of specific schemas and generalizable. An overall assessment of harvested metadata is provided in form of statistical summaries and the distribution, based on a corpus of harvested metadata. The score is implemented in XQuery and can be used in tools, editors and repositories.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Automatic Evaluation of Learning Object Metadata Quality

Thanks to recent developments on automatic generation of metadata and interoperability between repositories, the production, management and consumption of learning object metadata is vastly surpassing the human capacity to review or process these metadata. However, we need to make sure that the presence of some low quality metadata does not compromise the performance of services that rely on th...

متن کامل

Towards Automatic Evaluation of Metadata Quality in Digital Repositories

Thanks to recent developments on automatic generation of metadata and interoperability between repositories, the production, management and consumption of metadata is vastly surpassing the human capacity to review or process this information. However, we need to assure that low quality metadata does not compromise the performance of the services that the repository provides to its users. We con...

متن کامل

Metadata Enrichment for Automatic Data Entry Based on Relational Data Models

The idea of automatic generation of data entry forms based on data relational models is a common and known idea that has been discussed day by day more than before according to the popularity of agile methods in software development accompanying development of programming tools. One of the requirements of the automation methods, whether in commercial products or the relevant research projects, ...

متن کامل

Towards linking libraries and Wikipedia: automatic subject indexing of library records with Wikipedia concepts

In this article, we first argue the importance and timely need of linking libraries and Wikipedia for improving the quality of their services to information consumers, as such linkage will enrich the quality of Wikipedia articles and at the same time increase the visibility of library resources which are currently overlooked to a large degree. We then describe the development of an automatic sy...

متن کامل

Towards Real-time Metadata for Sensor-based Networks and Geographic Databases

Nowadays, the geographic metadata are defined by ISO standards and their particular goal is to assist the data exchange between different users and to provide the data quality information. However, these metadata are defined for static data processed by traditional applications. Such metadata do not take into consideration the qualification of dynamic data resulting from mobile and agile object...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014