Coping with very large digital collections using Greenstone
نویسندگان
چکیده
The Greenstone digital library software is widely used for small to medium digital library collections, but its reputation for creating very large collections is less well established. This paper describes how Greenstone is being used to produce large newspaper collections for the National Libraries of New Zealand and Singapore, respectively. It also describes current developments that integrate IBM’s DB2 database system into Greenstone as an optional search engine and metadata database, which allows the runtime server to be deployed in a federated configuration.
منابع مشابه
Towards Very Large Scale Digital Library Building in Greenstone Using Parallel Processing
As very large digital library collections become more commonplace, software tools must adapt appropriately. This paper reports on an evolution of the Greenstone Digital Library software to support parallel processing during the collection building phase. A series of experiments were conducted to first establish a basic speed-up factor, and then deconstruct the parallelisation process to underst...
متن کاملThe Greenstone Digital Library Software
Digital libraries are large, organized, focused collections of information. The Greenstone software is intended to help people design and build such collections quickly and easily. Collections may be large—some comprise Gbytes of text; others include many millions of short documents. Additionally, far larger volumes of information may be associated with a collection—typically audio, image, and ...
متن کاملA Distributed Directory Service for Greenstone
Greenstone is a software for creating and maintaining distributed digital library collections. It provides a sophisticated federation mechanism for the collections. In order to support alerting notification about changes in the distributed collections, we propose a distributed directory service for the management of the distributed Greenstone installations. The Greenstone directory service (GDS...
متن کاملCustomizing Digital Library Interfaces with Greenstone
The Greenstone digital library software is intended to help users construct simple collections of information very quickly. Indeed, only a few minutes of the user’s time are needed to set up a collection based on a standard design and initiate the building process. Collections may be large—some comprise Gbytes of text; millions of documents. Furthermore, even larger volumes of information may b...
متن کاملGreenstone: Collection management for digital works
The Greenstone Digital Library Software is a comprehensive package for creating, maintaining, presenting and disseminating collections of digital resources (http://www.greenstone.org/, [10]). Greenstone collections offer effective full-text searching and metadata-based browsing facilities that are attractive and easy to use, and a user-friendly interface called The Collector makes it easy for p...
متن کامل