DCKR - Knowledge Representation in Prolog and Its Application to Natural Language Processing
نویسنده
چکیده
Semantic processing is one of the important tasks for natural language processing. Basic to semantic processing is descriptions of lexical items. The most frequently used form of description of lexical items is probably Frames or Objects. Therefore in what form Frames or Objects are expressed is a key issue for natural language processing. A method of the Object representation in Prolog called DCKR will be introduced. It will be seen that if part of general knowledge and a dictionary are described in DCKR, part of context-processing and the greater part of semantic processing can be left to the functions built in Prolog. 1. Introduction Relationships between knowledge represented in predicate logic formulas and knowledge represented in Frames or ~Kt~i~K~ fihi~i~ are clarified by [Hayes 80], [Nilsson 80], [Goebel 85],[Bowen 85], et al, but their methods requires separately an interpreter for their representation. The authors have developed a knowledge representation form called DCKR (Definite Clause Knowledge Representation) [Koyama 85]. In DCKR, each of the ~i~%~ composing of a Structured Object (hereinafter simply called an ~hJ~Gi) is represented by a Horn clause (a Prolog statement) with the "sem" predicate (to be explained in Section 2) as its head. Therefore, an Object can he regarded as a set of Horn clauses (slots) headed by the sem predicate with the same first argument. From the foregoing it follows that almost all of a program for performing semantic intepretations relative to lexical items described in DCKR can be replaced by functions built in Prolog. That is, most of programming efforts of semantic processing can be left to the functions built in Prolog. DCKR will be described in detail in Section 2. Section 3 will discuss applications of DCKR to semantic processing of natural languages. 2. Knowledge Representation in DCKR The following examples of knowledge representation in DCKR will be used in Section 3 and later. Now the meanings of the sem, i~ and h~a predicates, which are important to descriptions in DCKR, are explained later using the DCKR examples given above. The first argument in the sem predicate is the Qhl~! ngm~. Objects are broadly divided into two types, in~iEi~i~ and RE~!Q%X~. Psychologists often refer to prototypes as stereotypes. An Object name with # represents an in~%~i~i n~ and the one without #, a ~K~!~!Z~ n~. For example, clyde#1 and elephant, which appears in 01l and 05), represent an individual name and a prototype name, …
منابع مشابه
Dependency parsing for medical language and concept representation
The theory of conceptual structures serves as a common basis for natural language processing and medical concept representation. We present a PROLOG-based formalization of dependency grammar that can accommodate conceptual structures in its dependency rules. First results indicate that this formalization provides an operational basis for the implementation of medical language parsers and for th...
متن کاملTowards Overcoming the Knowledge Acquisition Bottleneck in Answer Set Prolog Applications: Embracing Natural Language Inputs
Answer set Prolog, or AnsProlog in short, is one of the leading knowledge representation (KR) languages with a large body of theoretical and building block results, several implementations and reasoning and declarative problem solving applications. But it shares the problem associated with knowledge acquisition with all other KR languages; most knowledge is entered manually by people and that i...
متن کاملAttempto - From Specifications in Controlled Natural Language towards Executable Specifications
Deriving formal specifications from informal requirements is difficult since one has to take into account the disparate conceptual worlds of the application domain and of software development. To bridge the conceptual gap we propose controlled natural language as a textual view on formal specifications in logic. The specification language Attempto Controlled English (ACE) is a subset of natural...
متن کاملUsing Prolog for Biological Descriptions
We describe a system which performs biological identification on the basis of natural language descriptions. The system parses texts containing large sets of biological descriptions in restricted natural language and constructs a knowledge base. The system can semi-automatically adapt to a text by extending its lexicon and, to a limited extent, its grammar. Prolog features are important in both...
متن کاملSpecifying Logic Programs in Controlled Natural Language
Writing specifications for computer programs is not easy since one has to take into account the disparate conceptual worlds of the application domain and of software development. To bridge this conceptual gap we propose controlled natural language as a declarative and application-specific specification language. Controlled natural language is a subset of natural language that can be accurately ...
متن کامل