Recognize, Categorize, and Retrieve
نویسندگان
چکیده
A successful text categorization experiment divides a textual collection into pre-defined classes. A true representative for each class is generally obtained during training of the categorizer. In this paper, we report on our experiments on training and categorization of optically recognized documents. In particular, we will address the issues regarding the effects OCR errors may have on training, dimensionality reduction, and categorization. We further report on ways that categorization may help error correction and retrieval effectiveness.
منابع مشابه
Object Recognition and Object Categorization in Animals
One of the most important attributes of cognitive activities in both human and nonhuman animals is the ability to recognize individual objects and to categorize a variety of objects that share some properties. Wild-living spider monkeys, for example, individually recognize their partners and a large number of other conspecifics quickly and accurately regardless of their highly variable spatial ...
متن کاملبکارگیری تکنولوژی در فرآیند طراحی معماری
Designers draw diagrams to think about architectural concepts and design concerns. Scientists are interested in programming a computer to recognize and interpret design diagrams to deliver appropriate tools for the design task at hand. Researchers conducted empirical studies to find out if designers share drawing conventions when designing. Quick improvement in technology guide us to develope...
متن کاملThe Contribution of fMRI in the Study of Visual Categorization and Expertise
We recognize and categorize objects around us within a fraction of a second and in a number of different ways, depending on context, our experience with them, and the purpose of the categorization. For example the same animal can be a dog, a bow wow or a bulldog, a mammal or a Canis lupus familiaris. We are also able to recognize it in a variety of lighting conditions, orientations and position...
متن کاملA Belief Revision Approach to Textual Entailment Recognition
An artificial believer has to recognize textual entailment to categorize beliefs. We describe our system – the Fuzzy Believer system – and its application to the TAC/RTE three-way task.
متن کاملSocial Health Signals
Recently Twitter, has emerged as one of the primary medium for sharing and seeking of the latest information related to variety of the topics including health information. Although Twitter is an excellent information source, identification of useful information from the deluge of tweets is one of the major challenge. Twitter search is limited to keyword based techniques to retrieve information ...
متن کامل