ITRI-99-19 Using centering theory to plan coherent texts
نویسندگان
چکیده
This paper describes an approach to text planning, one of the distinct tasks identified in Ehud Reiter's " consensus " architecture for Natural Language Generation (Reiter 1994, Reiter and Dale 1997). This consists a " pipeline " of distinct tasks: Text Planning-deciding the content of a message, and organising the component propositions into a text tree; Sentence Planning-aggregating propositions into clausal units and choosing lexical items corresponding to concepts in the knowledge base; Linguistic realisation-surface details such as agreement, orthography etc. (See also (Cahill et al. 1999) who propose a more elaborate model which allows the " pipeline " as a concrete instantiation.) We assume that the component propositions to be realised in a text are organised in a tree structure in which terminal nodes are elementary propositions and non-terminal nodes represent discourse relations as defined by e.g., Rhetorical Structure Theory (RST, Mann and Thompson 1987). This structure only partially determines the linear order in which the propositions will be realised-in other words, any RST structure specifies a range of possible text plans. We propose as an additional constraint that the generator should seek to maximise continuity of reference as determined by the rules and constraints of centering theory, and we argue that this enables us to select the most cohesive variants from a set of text plans. Centering theory (CT) is a theory of discourse structure which models the interaction of cohesion and salience in the internal organisation of a text. (See Grosz et al 1995; cf Hardt 1998 for a more formal treatment.) The main assumptions of the theory are: 1. For each utterance in a discourse there is precisely one entity which is the centre of attention or center. The center in an utterance U n is the most grammatically salient entity realised in U n?1 which is also realised in U n. This is also referred to as the backward-looking center or Cb. 2. There is a preference for consecutive utterances within a discourse segment to keep the same entity as the center, and for the center to be realised as Subject or preferred center (Cp). We refer to these principles as cohesion and salience respectively. (Hardt's (1998) formalization only covers the first of these.) Pairs of successive utterances hU n ; U n+1 i are classified into the transition types shown in Fig. 1, in order of preference. 3. The center is the entity which …
منابع مشابه
ITRI-00-08 An integrated framework for text planning and pronominalisation
This paper describes an implemented system which uses centering theory for planning of coherent texts and choice of referring expressions. We argue that text and sentence planning need to be driven in part by the goal of maintaining referential continuity and thereby facilitating pronoun resolution: obtaining a favourable ordering of clauses, and of arguments within clauses, is likely to increa...
متن کاملOptimizing Referential Coherence in Text Generation
This article describes an implemented system which uses centering theory for planning of coherent texts and choice of referring expressions. We argue that text and sentence planning need to be driven in part by the goal of maintaining referential continuity and thereby facilitating pronoun resolution: Obtaining a favorable ordering of clauses, and of arguments within clauses, is likely to incre...
متن کاملAn investigation into the use of Centering transitions for summarisation
This paper investigates the usefulness of Centering Theory, a theory of local coherence and salience, for automatic summarisation. The transitions computed for utterances in different types of texts are examined for patterns which relate to the coherence of summaries. These transitions are also employed in preliminary experiments as an evaluation method by which to judge extracts as readable an...
متن کاملITRI - 00 - 19 Using an abstract rhetorical representation to generate a variety of pragmatically congruent texts
In order for a text planner to produce all the possible pragmatically congruent texts and only these, we distinguish between abstract and concrete rhetorical representations of a text. We discuss these representations and present our methodology for exploring the mappings from the underlying message to the actual surface discourse.
متن کاملAn empirical study for generating zero pronoun in Korean based on Cost-based centering model
In Korean, in order to generate a coherent text, a redundantly prominent noun should be replaced by a non-zero pronoun or zero pronoun. Otherwise, the text becomes unnatural. Specifically, a redundant noun in Korean is frequently omitted while a redundant noun in English is replaced by a pronoun. This paper proposes a generation algorithm of the zero pronoun, using a Cost-based Centering Model ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999