Identiication of Text on Colored Book and Journal Covers
نویسندگان
چکیده
In this paper an approach to automatic text location and identiication on colored book and journal covers is proposed. To reduce the amount of small variations in color, a clustering algorithm is applied in a preprocessing step. Two methods have been developed for extracting text hypotheses. One is based on a top-down analysis using successive splitting of image regions. The other is a bottom-up region growing algorithm. The results of both methods are combined to robustly distinguish between text and non-text elements. Text elements are binarized using automatically extracted information about text color. The binarized text regions can be used as input for a conventional OCR module. Results are shown for several book and journal covers of diierent complexity. The proposed method is not restricted to book and journal cover pages, but can be applied to the extraction of text from other types of color images as well.
منابع مشابه
Identification of Text on Colored Book and Journal Covers
In this paper an approach to automatic text location and identification on colored book and journal covers is proposed. To reduce the amount of small variations in color, a clustering algorithm is applied in a preprocessing step. Two methods have been developed for extracting text hypotheses. One is based on a top-down analysis using successive splitting of image regions. The other is a bottom-...
متن کاملIntracranial Arterial Aneurysms
In its scope, it occupies an intermediate position between such works as are primarily concerned with parasitology and the outstanding text-book by Dr. Strong on "Diagnosis, Prevention, and Treatment of Tropical Diseases" which covers so completely and thoroughly not only "tropical medicine" in its narrower sense, but also so many of the contributing sciences as well. The content of Dr. Bercovi...
متن کاملJournals Subheadlines Text Extraction Using Wavelet Thresholding and New Projection Profile
In this paper a new robust and efficient algorithm to automatic text extraction from colored book and journal cover sheets is proposed. First, we perform wavelet transform. Next for edge detecting from detail wavelet coefficient, we use dynamic threshold. By blurring approximate coefficients with alternative heuristic thresholding, achieve effective edge,. Afterward, with ROI technique get bina...
متن کاملINVESTIGATION OF BARRIERS AND REQUIREMENTS AFFECTING E-SHOPPING BEHAVIOR OF CUSTOMERS IN THE BOOK MARKET
<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: justify; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; backgro...
متن کامل