Design of a Digital Library for Early 20 Century Medico-legal Documents
نویسندگان
چکیده
The research value of important government documents to historians of medicine and law is enhanced by a digital library of such a collection being designed at the U.S. National Library of Medicine. This paper presents work toward the design of a system for preservation and access of this material, focusing mainly on the automated extraction of descriptive metadata needed for future access. Since manual entry of these metadata for thousands of documents is unaffordable, automation is required. Successful metadata extraction relies on accurate classification of key textlines in the document. Methods are described for the optimal scanning alternatives leading to high OCR conversion performance, and a combination of a Support Vector Machine (SVM) and Hidden Markov Model (HMM) for the classification of textlines and metadata extraction. Experimental results from our initial research toward an optimal textline classifier and metadata extractor are given.
منابع مشابه
Design of a Digital Library for Early 20th Century Medico-legal Documents
The research value of important government documents to historians of medicine and law is enhanced by a digital library of such a collection being designed at the U.S. National Library of Medicine. This paper presents work toward the design of a system for preservation and access of this material, focusing mainly on the automated extraction of descriptive metadata needed for future access. Sinc...
متن کاملTools for the Governance of Urban Design: The Tehran Experience
This research seeks to reflect the managerial, academic and professional experience of the authors in the design and implementation process of urban design projects, aiming to use the application of the “design governance” model, in order to describe the documents and activities of the Department of Urban Planning and Architecture of Tehran Municipality in the last decade. This paper consists ...
متن کاملAn 18th century Tuscan pharmacy: analysis of the library.
The archival documents of San Luca Hospital, which has long been the most important welfarist institution of the Republic of Lucca (Tuscany), are stored in the Record Offices of Lucca. The hospital was served by a pharmacy, where the medicaments were prepared for patients and for the needs of other institutions in the city. Three different inventories, dating back to 1719, 1749 and 1784, report...
متن کاملAnalyzing registry, log files, and prefetch files in finding digital evidence in graphic design applications
The products of graphic design applications leave behind traces of digital information which can be used during a digital forensic investigation in cases where counterfeit documents have been created. This paper analyzes the digital forensics involved in the creation of counterfeit documents. This is achieved by first recognizing the digital forensic artifacts left behind from the use of graphi...
متن کاملUtilization of Digital Technology in Designing and Producing Zero Waste Clothes with Sustainability Approach
Consumerism is a new phenomenon created in the 21st century that can play a substantial role in the destruction of national resources of any country. Nowadays, concerning the ever-increasing improvement of fashion in the world, consumerism in the field of textiles and clothing is raised more than ever. Considering the wasteful consumption of products may bring about remarkable damage...
متن کامل