The TELLTALE Dynamic Hypertext Environment : Approaches to
نویسندگان
چکیده
Methods and tools for nding documents relevant to a user's needs in document corpora can be found in the information retrieval, library science, and hypertext communities. Typically, these systems provide retrieval capabilities for fairly static corpora, their algorithms are dependent on the language for which they are written, e.g. English, and they don't perform well when presented with misspelled words or text that has been degraded by OCR (optical character recognition) techniques. In this paper, we present the TELLTALE system. TELLTALE is a dynamic hypertext environment that provides full-text search from a hypertext-style user interface for text corpora that may be garbled by OCR or transmission errors, and that may contain languages other than English by using several techniques based on n-grams (n character sequences of text). In this paper, we identify methods and techniques that we have applied to the n-gram data structures and algorithms to enhance the scalabilty of the TELLTALE Dynamic Hypertext System.
منابع مشابه
The TELLTALE Dynamic Hypertext Environment: Approaches to Scalability
Methods and tools for nding documents relevant to a user's needs in document corpora can be found in the information retrieval, library science, and hypertext communities. Typically, these systems provide retrieval capabilities for fairly static corpora, their algorithms are dependent on the language for which they are written, e.g. English, and they don't perform well when presented with missp...
متن کاملTELLTALE: Experiments in a Dynamic Hypertext Environment for Degraded and Multilingual Data
Methods and tools for finding documents relevant to a user’s needs in document corpora can be found in the information retrieval, library science, and hypertext communities. Typically, these systems provide retrieval capabilities for fairly static corpora, their algorithms are dependent on the language for which they are written, e.g. English, and they do not perform well when presented with mi...
متن کاملPerformance and Scalability of a Large-Scale N-gram Based Information Retrieval System
Information retrieval has become more and more important due to the rapid growth of all kinds of information. However, there are few suitable systems available. This paper presents a few approaches that enable large-scale information retrieval for the TELLTALE system. TELLTALE is a dynamic hypertext information retrieval environment. It provides full-text search for text corpora that may be gar...
متن کاملValuation Links: Formally Extending the Computational Power of Hypertext
We view hypertext as an inherently dynamic concept to incorporate in the interface of dynamic information systems. What challenges does hypertext face in a constantly changing environment? In this paper, we discuss the benefits and the problems we face in our research into hypertext-oriented decision support systems. Then we focus on a new hypertext construct beneficial to this domain: valuatio...
متن کامل