CT-Net: Cascade T-shape deep fusion networks for document binarization
نویسندگان
چکیده
• We present a Cascade T-Shape Deep Networks framework for document image binarization. The T-Net is multi-task enhancement and features learned by the are transferred to binarization which can improve performance of evaluated on nine public DIBCO datasets achieves new state-of-the-art performance. Document key step in most analysis tasks. However, historical-document images usually suffer from various degradations, making this very challenging processing stage. has improved dramatically recent years use Convolutional Neural (CNNs). In paper, dual-task, T-shaped neural network proposed that main task an auxiliary enhancement. learns degradations specific CNN-kernel be adapted towards training process. addition, considered as version input image, fed into fine-tuning, it possible design chained-cascade (CT-Net). Experimental results competition (DIBCO datasets) MCS dataset show our method outperforms competing methods cases.
منابع مشابه
Document Binarization Combining with Graph Cuts and Deep Neural Networks
Most data mining applications on collections of historical documents require binarization of the digitized images as a pre-processing step. Historical documents are often subjected to degradations such as parchment aging, smudges and bleed through from the other side. The text is sometimes printed, but more often handwritten. Mathematical modeling of the appearance of the text, as well as the b...
متن کاملShape based local thresholding for binarization of document images
This paper presents a novel local threshold algorithm for the binarization of document images. Stroke width of handwritten and printed characters in documents is utilized as the shape feature. As a result, in addition to the intensity analysis, the proposed algorithm introduces the stroke width as shape information into local thresholding. Experimental results for both synthetic and practical d...
متن کاملDeep Adversarial 3D Shape Net
3D object understanding and reconstruction are important research questions in computer graphics and computer vision. In graphics, researchers tried to solve this question by breaking the shape down into separate shape parts (Huang et al. (2015)). This approach has achieved plausible results in shape synthesis. However, this requires pre-processing steps, including shape segmentation and resamp...
متن کاملLoss-aware Binarization of Deep Networks
Deep neural network models, though very powerful and highly successful, are computationally expensive in terms of space and time. Recently, there have been a number of attempts on binarizing the network weights and activations. This greatly reduces the network size, and replaces the underlying multiplications to additions or even XNOR bit operations. However, existing binarization schemes are b...
متن کاملDocument Image Binarization
Principal stage of the document image analysis procedure is the binarization, according to which the pixels are classified into text and background. It is a crucial stage that can affect further stages including the final character recognition stage. This thesis is focused on document image binarization, including both binarization techniques and evaluation methodologies. Specifically, accordin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pattern Recognition
سال: 2021
ISSN: ['1873-5142', '0031-3203']
DOI: https://doi.org/10.1016/j.patcog.2021.108010