Meaning Representation and Text Planning
نویسندگان
چکیده
The data flow in natural language generation (NLG) starts with a 'world' state, represented by structures of an application program (e.g., an expert system) that has text generation needs and an impetus to produce a natural language text. The output of generation is a natural language text. The generation process involves the tasks of a) delimiting the content of the eventual text, b) plano ning its structure, c) selecting lexieal, syntactic and word order me,'ms of realizing this structure and d) actually realizing the textusing the latter. In advanced generation systems these processes are treated not in a monolithic way, but rather as components of a large, modular generator. NLG researchers experiment with various ways of delimiting the modules of the generation process and control architectures to drive these modules (see, for instance, McKeown, 1985, Hovy, 1987 or Meteer, 1989). But regardless of the decisions about general (intermodular) or local (intramodular) control flow, knowledge structures have to be defined to support processing and facilitate communication among the modules. The natural language generator DIOGENES(e.g., Nirenburg et al., 1989) has been originally designed for use in machine translation. This means that the content delimitation stage is unnecessary, as the set of meanings to be realized by the generator is obtained in machine translationas a result of source text analysis. The first processing component in DIOGENES is, therefore, its text planner which, takes as input a text meaning representation (TMR) and a set of static pragmatic factors (similar to Hovy's (1987) rhetorical goals) and produces a text plan (TP), a structure containing information about the order and boundaries of target language sentences; the decisions about reference realization and lexical selection, t At the next stage, a set of semantics-to-syntax mapping rules are used to produce a set of target-language syntactic structures (we are using the f-structures of LFG see, e.g., Nirenburg and Levin, 1989). Finally, a syntactic realizer produces a target language text from the set of f-structures. To produce texts of adequate quality, natural language generation needs a sufficiently expressive input language. In this paper we discuss several important aspects of the knowledge and the processing at the text planning stage of a generation system. First, we describe a comprehensive language processing paradigm which underlies work on both generation and analysis of natural language in our environment. Next, we illustrate the features of our meaning representation languages, the text meaning representation language TAMERLAN and the text plan representation language TPL. Finally, we describe the mechanism of text planning in DIOGENES and illustrate the formalism and the strategy for acquiring text planning rules.
منابع مشابه
Towards Generating Text from Discourse Representation Structures
We argue that Discourse Representation Structures form a suitable level of languageneutral meaning representation for micro planning and surface realisation. DRSs can be viewed as the output of macro planning, and form the rough plan and structure for generating a text. We present the first ideas of building a large DRS corpus that enables the development of broad-coverage, robust text generato...
متن کاملThe Significance of Multimodality/Multiliteracies in Iranian EFL Learners’ Meaning- Making Process
The main objective of this study was to investigate how Iranian EFL learners used their literacy practices and multimodal resources to mediate interpretation and representation of an advertisement text and construct their understanding of it. Fifteen female adolescents at an intermediate level of proficiency read the "مبلمان برلیان" (“Brelian Furniture”) advertisement text and re-created their ...
متن کاملA Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کاملText Meaning Representation For Chinese
This paper describes text meaning representation for Chinese. Text meaning representation is composed of a set of ontological concept instances along with ontological links among them. It integrates lexical, textual and world knowledge into a single hierarchical framework. In NLP application it serves as an interlingual representation for various processing. The methodology and implementation o...
متن کاملIntroduction to the teachings of the transcendental paradigm in the process of teaching-learning and its critique
The purpose of this study is to study the teachings of the transcendental paradigm in the process of teaching-learning and its critique. In order to achieve the purpose of the research, three methods of conceptual, inference and critical analysis have been used to analyze and critique the foreman paradigm. Findings of the research indicate that meta-text instead of oral text emphasizes written ...
متن کامل