Hyperdocument Generation using OCR and Icon Detection
نویسندگان
چکیده
In this contribution we consider the construction of hyperdocuments; converting scanned paper documents into electronic hypertext. Hyperlink creation is automated by analyzing the structure and content of the scanned document. The focus is on hyperlinks between the text and labels in a picture. A number of tools for such hyperlink detection are described. Practical results are presented.
منابع مشابه
Automatic HTML Generation from Formal Hypermedia Specifications
HMBS (Hypermedia Model Based on Statecharts) is a model suitable for specifying highly structured hyperdocuments. HySCharts is an environment that supports the authoring of hyperdocuments based on the HMBS model. It also supports hyperdocument navigation according to a well-defined browsing semantics. In this paper, we propose three strategies for automatically deriving an HTML implementation f...
متن کاملDocument Image Dewarping Based on Text Line Detection and Surface Modeling (RESEARCH NOTE)
Document images produced by scanner or digital camera, usually suffer from geometric and photometric distortions. Both of them deteriorate the performance of OCR systems. In this paper, we present a novel method to compensate for undesirable geometric distortions aiming to improve OCR results. Our methodology is based on finding text lines by dynamic local connectivity map and then applying a l...
متن کاملHow to Cite this article: LOW SPECIFICITY OF THE THIRD GENERATION ELISA FOR HCV DETECTION IN VOLUNTARY BLOOD DONORS
Objective Third generation anti-HCV ELISA is currently recommended for the diagnosis of HCV infection. We determined its specificity in voluntary blood donors (VBDs) and patients with chronic liver disease (CLD) in relation to confirmatory line immunoassay (LIA) and reverse transcription polymerase chain reaction (RT-PCR). Material and Methods: 1926 serum samples of VBDs and 16 HCV related CLD ...
متن کاملGenerating Guided Tours to Facilitate Learning from a Set of Indexed Resources
This presentation proposes an approach to the generation of guided tours over indexed information resources on user demand. It represents a lightweight alternative to sophisticated hyperdocument generation systems based on knowledge representation and complex resource indexing. The information space is modelled as a hypergraph of resources and resource descriptors. A spanning tree of descriptor...
متن کاملTranslatAR: A Mobile Augmented Reality Translator on the Nokia N900
Researchers have long been interested in the synergy between portability and computing power but had been limited by unwieldy, uncommonly used devices. The latest generation of mobile phones, i.e. ‘smartphones’, are equipped with hardware powerful enough to develop novel, interesting applications with allow users to directly interact with the world around them. This paper describes a multimodal...
متن کامل