Semi-automated collection evaluation for large-scale aggregations
نویسندگان
چکیده
Library and museum digital collections are increasingly aggregated at various levels. Large-scale aggregations, often characterized by heterogeneous or messy metadata, pose unique and growing challenges to aggregation administrators – not only in facilitating end-user discovery and access, but in performing basic administrative and curatorial tasks in a scalable way, such as finding messy data and determining the overall topical landscape of the aggregation. This poster describes early findings on using statistical text analysis techniques to improve the scalability of an aggregation development workflow for a large-scale aggregation. These techniques hold great promise for automating historically labor-intensive evaluative aspects of aggregation development and form the basis for the development of an aggregator’s dashboard. The aggregator’s dashboard is planned as a statistical textanalysis-driven tool for supporting large-scale aggregation development and maintenance, through multifaceted, automatic visualization of an aggregation’s metadata quality and topical coverage. The administrator’s dashboard will support principled yet scalable aggregation development.
منابع مشابه
Beyond Size and Search: Building Contextual Mass in Digital Aggregations for Scholarly Use
At present there are no established collection development methods for building large-scale digital aggregations. However, to realize the potential of the collective base of digital content and advance scholarship, aggregations must do more than provide search of sizable bodies of content. Informed by empirical understanding of scholarly information practices, the IMLS Digital Collections and C...
متن کاملCost Function Modelling for Semi-automated SC, RTG and Automated and Semi-automated RMG Container Yard Operating Systems
This study analyses the concept of cost functions for semi-automated Straddle Carrier (SC), Rubber Tyred Gantry (RTG) and automated Rail Mounted Gantry (RMG) container yard operating cranes. It develops a generic cost based model for a pair-wise comparison, analysis and evaluation of economic efficiency and effectiveness of container yard equipment to be used for decision-making by terminal pla...
متن کاملSemi-quantitative segmental perfusion scoring in myocardial perfusion SPECT: visual vs. automated analysis
Introduction: It is recommended that the physician apply at least a semi-quantitative segmental scoring system in myocardial perfusion SPECT. We aimed to assess the agreement between automated semi-quantitative analysis using QPS (quantitative Perfusion SPECT) software and visual approach for calculation of summed stress score (SSS), summed rest score (SRS) and summed difference score (SDS). ...
متن کاملProject Acronym: ASSESS CT Grant Agreement number: 643818 Project Title: Assessing SNOMED CT for Large Scale eHealth Deployments in the EU
for dissemination) Deliverable 2.4 introduces the notion of user interface terminologies in contrast to reference terminologies. Three terminology settings (SNOMED CT against an alternative, UMLS-derived hybrid terminology and a local terminology collection), are analysed under user interface terminology aspects. It investigated the coverage of such interface terms in these three terminology sc...
متن کاملTowards Semi-automatic Ontology Building Supported by Large-Scale Knowledge Acquisition
Knowledge acquisition is usually the first step in building ontologies. On the one hand, knowledge is typically implicitly contained in large collections of unstructured documents. Therefore it is extremely troublesome to manually identify relevant concepts. On the other hand, users are often not fully satisfied with the results of automated stateof-the-art ontology learning techniques. In this...
متن کامل