COMPREHENSIVE STUDY OF DEEP LEARNING BASED TELUGU OCR
نویسندگان
چکیده
The aim of the project is to understand offline One most popular and difficult pattern recognition subjects use optical character (OCR) read handwritten Telugu letters. This study suggests a three-stage OCR solution for documents that includes pre-processing, feature extraction, classification. For extraction boundary edge pixel points during preprocessing, we used median filtering on input characters as well normalisation skeletonization techniques. Each initially divided into three 3x3 grids stage, associated centroid each nine zones assessed. allows us recognise in various styles. Following that, drew projection angel's horizontal vertical symmetry character's closest pixel.
منابع مشابه
Telugu OCR Framework using Deep Learning
In this paper, we address the task of Optical Character Recognition(OCR) for the Telugu script. We present an end-to-end framework that segments the text image, classifies the characters and extracts lines using a language model. The segmentation is based on mathematical morphology. The classification module, which is the most challenging task of the three, is a deep convolutional neural networ...
متن کاملA Survey of Telugu Ocr System
Optical character recognition is usually abbreviated as OCR. The object of OCR is automatic reading of optically sensed document text materials to translate human-readable characters into machine-readable codes. Today, reasonably efficient and inexpensive OCR packages are commercially available to recognize printed texts in widely used languages such as English, Chinese, and Japanese. These sys...
متن کاملOCR for Telugu Script Using Back-Propagation Based Classifier
This paper deals with the theory and implementation of an Optical Character Recognition (OCR) system for printed Telugu script, which exploits the inherent characteristics of Telugu scripts, one of the major scheduled language of India, spoken by more than 66 million people, especially in South India. The principle idea is to convert images of text documents such as those obtained from scanning...
متن کاملOCR of Printed Telugu Text with High Recognition Accuracies
Telugu is one of the oldest and popular languages of India spoken by more than 66 million people especially in South India. Development of Optical Character Recognition systems for Telugu text is an area of current research. OCR of Indian scripts is much more complicated than the OCR of Roman script because of the use of huge number of combinations of characters and modifiers. Basic Symbols are...
متن کاملCandidate Search and Elimination Approach for Telugu OCR
In this paper we propose an OCR system for Telugu based on the candidate search and elimination technique. The initial candidates for recognition are found by applying a zoning method on input glyphs. We propose cavities as a structural approach suited specifically for Telugu script, where cavity vectors are used to prune the candidates found by zoning. A final template matching stage using con...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International journal of engineering technology and management sciences
سال: 2023
ISSN: ['2581-4621']
DOI: https://doi.org/10.46647/ijetms.2023.v07i03.133