نتایج جستجو برای: ocr

تعداد نتایج: 2705  

2013
Zifei Shan Haowen Cao

Optical Character Recognition (OCR) Systems are widely used to process scanned text into text usable by computers. We observe that current OCR systems have bad performance on domain-specific papers, even generating lots of incorrect words; besides, different OCR systems make relatively independent mistakes. Based on these observations, we train an ensemble system from multiple open-source OCR s...

Journal: :Applied sciences 2022

With the increase of soil consolidation degree, pore water pressure induced by thermal loading drops dramatically. To conveniently and quickly calculate inside under different overconsolidation states quantify effect on pressure, a calculation method considering for saturated clay is proposed. The verified relevant experimental data, good agreements were achieved. Through analyzing influence me...

2004
Kazem Taghva Julie Borsack Thomas A. Nartker Jeffrey S. Coombs Ron Young

Hundreds of experiments over the last decade on the retrieval of OCR documents performed by the Information Science Research Institute have shown that OCR errors do not significantly affect retrievability. We extend those results to show that in the case of proximity searching, the removal of running headers and footers from OCR text will not improve retrievability for such searches.

Journal: :Archives of ophthalmology 2010
Manokaraananthan Chandrakumar Zahra Hirji Herbert C Goltz Giuseppe Mirabella Alan W Blakeman Linda Colpa Agnes M F Wong

OBJECTIVE To investigate whether static ocular counterroll (OCR) gain is reduced during viewing of an earth-fixed vs a head-fixed target. METHODS Twelve healthy individuals were recruited. The target consisted of a red fixation cross against a grid pattern at a viewing distance of 33 cm. The target was mounted on a wall (earth fixed) or was coupled to the head (head fixed). Changes in mean to...

2009
Patrick Röder

This bachelor thesis investigates the use of the RWTH-OCR system on the French handwriting database RIMES. The RWTH-OCR system is based on the RWTH-ASR speech recognition system. The field of offline handwriting recognition is an open topic in research and in the past the RWTH-OCR system has been adapted to several languages as English or Arabic handwriting. The RWTH-OCR is a hidden Markov mode...

2009
Sohail A. Sattar Shamsul Haque Mahmood K. Pathan

In this paper we have presented a novel segmentation technique for the implementation of an OCR (Optical Character Recognition) for printed Nastalique text, a calligraphic style of Urdu which uses the Arabic script for its writing. OCR for many of the world major languages have been developed and are being used but at present an OCR for Nastalique is not available and the published research on ...

2001
J. C. Lecoq Laurent Najman Olivier Gibot Éric Trupin

The choice of a commercial Optical Character Recognition (OCR) engine is important for the process of automatically indexing technical drawings from their title blocks. We would like to benchmark commercial OCR engines with respect to their inclusion in the global digitalisation chain from scanning to understanding the text information contained in a technical drawing document. The crucial (cos...

2010
Khalil Dahab Anja Belz

We present a methodology that takes as input scanned documents of typed or hand-written text, and produces transcriptions of the text as output. Instead of using OCR technology, the methodology is game-based and produces such transcriptions as a by-product. The approach is intended particularly for languages for which language technology and resources are scarce and reliable OCR technology may ...

2008
Songhua Xu James McCusker Martin Schultz Michael Krauthammer

Today’s information retrieval (IR) techniques are mostly text-based. As a consequence, some types of information are beyond the reach of text-based IR systems, which fail in situations where textual information can not be easily accessed, e.g. textual information in biomedical images and figures. To tackle such situations, we propose to augment IR systems with the ability to perform optical cha...

2010
Martin Volk

This paper describes our efforts in building a heritage corpus of Alpine texts. We have already digitized the yearbooks of the Swiss Alpine Club from 1864 until 1982. This corpus poses special challenges since the yearbooks are multilingual and vary in orthography and layout. We discuss methods to improve OCR performance and experiment with combining two different OCR programs with the goal to ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید