A Parser For Real-Time Speech Synthesis Of Conversational Texts
نویسندگان
چکیده
In this paper, we concern ourselves with an application of text-to-speech for speech-impaired, deaf, and hard of hearing people. The application is unusual because it requires real-time synthesis of unedited, spontaneously generated conversational texts transmitted via a Telecommunications Device for the Deaf (TDD). We describe a parser that we have implemented as a front end for a version of the Bell Laboratories text-to-speech synthesizer (Olive and Liberman 1985). The parser prepares TDD texts for synthesis by (a) performing lexical regularization of abbreviations and some non-standard forms, and (b) identifying prosodic phrase boundaries. Rules for identifying phrase boundaries are derived from the prosodic phrase grammar described in Bachenko and Fitzpatrick (1990). Following the parent analysis, these rules use a mix of syntactic and phonological factors to identify phrase boundaries but, unlike the parent system, they forgo building any hierarchical structure in order to bypass the need for a stacking mechanism; this permits the system to operate in near real time. As a component of the text-to-speech system, the parser has undergone rigorous testing during a successful three-month field trial at an AT&T telecommunications center in California. In addition, laboratory evaluations indicate that the parser's performance compares favorably with human judgments about phrasing.
منابع مشابه
Lexicalized Tree Automata-Based Grammars For Translating Conversational Texts
We propose a new lexicalizcd grammar formalism called Lexicalized Tree Automata-based Grammar, which lcxicalizes tree acccptors instead of trees themselves. We discuss the properties of the grammar and present a chart parsing algorithm. Wc have implemented a translation module for conversational texts using this formalism, and applied it to an experimental automatic interpretation system (speec...
متن کاملAn Experimental Real - Time Speech - to - Speech Translation System *
This paper reports the current progress in the SPEECHTRANS project at the Center for Machine Translation which is a speech-to-speech translation project for real-time processing of speaker-independent noisy continuous speech input. SPEECHTRANS uses a custom speech recognition hardware and a phoneme-based generalized LR parser that uses a unification-based grammar formalism and a natural languag...
متن کاملTowards conversational speech synthesis; lessons learned from the expressive speech processing project
This paper discusses some ideas for the requirements and methods of conversational speech synthesis, based on experience gained from the collection and analysis of a very large corpus of conversational speech in a variety of real-life everyday contexts. It shows that because variation in voice quality plays a significant part in the transmission of interpersonal and affect-related social inform...
متن کاملReal-Time Interfaces for Speech and Singing
This paper introduces a new concept in speech synthesisers by constructing devices and methods by which they can be operated in real-time. Further development of this concept may lead to an improvement in the conversational capabilities of people with ‘speech communicators’. This paper outlines the current limitations of such systems and then describes the methods used to give the user real-tim...
متن کاملStudying impressive parameters on the performance of Persian probabilistic context free grammar parser
In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...
متن کامل