A Parser-based Text Preprocessor for Romanian Language Tts Synthesis
ثبت نشده
چکیده
Text preprocessing plays an important role in a textto-speech (TTS) synthesis system. The correct detection and interpretation of input strings influence the overall system accuracy and contribute to the conversion of an unrestricted text into synthetic speech. This paper describes the design philosophy of a preprocessing module for a TTS system in Romanian language. The preprocessor is implemented using the standard flex/bison lexer and parser generators. The paper discusses the text preprocessing task and the major difficulties connected with Romanian language, proposes a set of definitions and rules, gives some implementation details and concludes with a few considerations about the TTS system and performances of the preprocessing module.
منابع مشابه
A parser-based text preprocessor for romanian language TTS synthesis
Text preprocessing plays an important role in a textto-speech (TTS) synthesis system. The correct detection and interpretation of input strings influence the overall system accuracy and contribute to the conversion of an unrestricted text into synthetic speech. This paper describes the design philosophy of a preprocessing module for a TTS system in Romanian language. The preprocessor is impleme...
متن کاملA text analyzer for Korean text-to-speech systems
In developing a text-to-speech system, it is well known that the accuracy of information extracted from a text is crucial to produce high quality synthesized speech. In this paper, by transferring probabilistic natural language processing techniques into TTS system eld, we develop a more robust text analyzer with high accuracy for Korean TTS systems. The proposed system is composed of ve module...
متن کاملMultilingual text analysis for text-to-speech synthesis
We present a model of text analysis for text-to-speech (TTS) synthesis based on (weighted) finite-state transducers, which serves as the text-analysis module of the multilingual Bell Labs TTS system. The transducers are constructed using a lexical toolkit that allows declarative descriptions of lexicons, morphological rules, numeral-expansion rules, and phonological rules, inter alia. To date, ...
متن کاملA Phonetic Converter for Speech Synthesis in Romanian
Letter-to-phone conversion, as part of the natural language processing stage, plays a very important role in text-to-speech (TTS) synthesis because it associates an appropriate phonetic transcription with each word of the sentence to be pronounced. The classical approach for the phonetic conversion is based in most TTS systems on either a dictionary or a set of rules. Because both methods have ...
متن کاملMixed-lingual text analysis for polyglot TTS synthesis
Text-to-speech (TTS) synthesis is more and more confronted with the language mixing phenomenon. An important step towards the solution of this problem and thus towards a socalled polyglot TTS system is an analysis component for mixedlingual texts. In this paper it is shown how such an analyzer can be realized for a set of languages, starting from a corresponding set of monolingual analyzers whi...
متن کامل