Recognize, Categorize, and Retrieve

نویسندگان

  • Kazem Taghva
  • Thomas A. Nartker
  • Julie Borsack
چکیده

A successful text categorization experiment divides a textual collection into pre-defined classes. A true representative for each class is generally obtained during training of the categorizer. In this paper, we report on our experiments on training and categorization of optically recognized documents. In particular, we will address the issues regarding the effects OCR errors may have on training, dimensionality reduction, and categorization. We further report on ways that categorization may help error correction and retrieval effectiveness.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Object Recognition and Object Categorization in Animals

One of the most important attributes of cognitive activities in both human and nonhuman animals is the ability to recognize individual objects and to categorize a variety of objects that share some properties. Wild-living spider monkeys, for example, individually recognize their partners and a large number of other conspecifics quickly and accurately regardless of their highly variable spatial ...

متن کامل

بکارگیری تکنولوژی در فرآیند طراحی معماری

  Designers draw diagrams to think about architectural concepts and design concerns. Scientists are interested in programming a computer to recognize and interpret design diagrams to deliver appropriate tools for the design task at hand. Researchers conducted empirical studies to find out if designers share drawing conventions when designing. Quick improvement in technology guide us to develope...

متن کامل

The Contribution of fMRI in the Study of Visual Categorization and Expertise

We recognize and categorize objects around us within a fraction of a second and in a number of different ways, depending on context, our experience with them, and the purpose of the categorization. For example the same animal can be a dog, a bow wow or a bulldog, a mammal or a Canis lupus familiaris. We are also able to recognize it in a variety of lighting conditions, orientations and position...

متن کامل

A Belief Revision Approach to Textual Entailment Recognition

An artificial believer has to recognize textual entailment to categorize beliefs. We describe our system – the Fuzzy Believer system – and its application to the TAC/RTE three-way task.

متن کامل

Social Health Signals

Recently Twitter, has emerged as one of the primary medium for sharing and seeking of the latest information related to variety of the topics including health information. Although Twitter is an excellent information source, identification of useful information from the deluge of tweets is one of the major challenge. Twitter search is limited to keyword based techniques to retrieve information ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001