A Heuristic Approach to Natural Language Processing
نویسندگان
چکیده
This paper is concerned with the design of a processor capable of formalizing English language descriptions of problems in the sententlal calculus. The emphasis is on the design of a system with natural language processing capabilities, but the formal languages specified are oriented to the problem context. A series of automata are specified to carry out the necessary functions. The automata identifythe premises in the problem strings~ specify the appropriate logical connectives among the premises and determine which premises are meaning-equivalent. The syntax of each automaton is defined and examples are used to illustrate their functioning. The automata accept statements in the language L1, the set of English statements of problems in the sententlal calculus. The individual premises p @ L1 are recognized by the syntax~, where ~ is chosen so that the language L2 recognized by it is a subset of L1. Furthermore, the strings in L2 are restricted to the declarative sentences. Once the premises and their logical connectives have been identified, those that are meaning-equivalent are located in two additional steps. First the L2 description of the string is mapped into a string in L3. The L3 language consists of a limited set of canonical forms that ease the problem of establishing meaning equivalence of premises. Finally, the automaton applies heuristically a sequence of problem-orlented and meaning-preserving transformations in order to establish meaning-equivalence. Two premises are taken to be meaning-equivalent if one can be deduced from the other. Otherwise~ they are taken to be not meaning-equlvalent.
منابع مشابه
Heuristic Approach for Specially Structured Two Stage Flow Shop Scheduling to Minimize the Rental Cost, Processing Time, Set Up Time Are Associated with Their Probabilities Including Transportation Time and Job Weightage
The present paper is an attempt to develop a new heuristic algorithm, find the optimal sequence to minimize the utilization time of the machines and hence their rental cost for two stage specially structured flow shop scheduling under specified rental policy in which processing times and set up time are associated with their respective probabilities including transportation time. Further jo...
متن کاملروش جدید متنکاوی برای استخراج اطلاعات زمینه کاربر بهمنظور بهبود رتبهبندی نتایج موتور جستجو
Today, the importance of text processing and its usages is well known among researchers and students. The amount of textual, documental materials increase day by day. So we need useful ways to save them and retrieve information from these materials. For example, search engines such as Google, Yahoo, Bing and etc. need to read so many web documents and retrieve the most similar ones to the user ...
متن کاملNatural Language Inspired Approach for Handwritten Text Line Detection in Legacy Documents
Document layout analysis is an important task needed for handwritten text recognition among other applications. Text layout commonly found in handwritten legacy documents is in the form of one or more paragraphs composed of parallel text lines. An approach for handwritten text line detection is presented which uses machinelearning techniques and methods widely used in natural language processin...
متن کاملBARY at the NTCIR-11 MedNLP-2 Task for Complaints and Diagnosis Recognition
This paper describes a machine-learning based approach to recognizing diagnosed disease names and corresponding temporal expressions. Using CRFs (conditional random fields) to learn and predict tags, the systems described in this paper are characterized by a character-level formulation and heuristic features extracted from medical terminologies. Experimental results on the NTCIR-11 MedNLP-2 dat...
متن کاملCorpus based coreference resolution for Farsi text
"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...
متن کاملA Hybrid Meta-heuristic Approach to Cope with State Space Explosion in Model Checking Technique for Deadlock Freeness
Model checking is an automatic technique for software verification through which all reachable states are generated from an initial state to finding errors and desirable patterns. In the model checking approach, the behavior and structure of system should be modeled. Graph transformation system is a graphical formal modeling language to specify and model the system. However, modeling of large s...
متن کامل