A language-independent, data-oriented architecture for grapheme-to-phoneme conversion
نویسندگان
چکیده
We report on an implemented grapheme to phoneme conversion architecture Given a set of examples spelling words with their associated phonetic represen tation in a language a grapheme to phoneme conversion system is automatically produced for that language which takes as its input the spelling of words and pro duces as its output the phonetic transcription according to the rules implicit in the training data This paper describes the architecture and focuses on our solution to the alignment problem given the spelling and the phonetic trancription of a word often di ering in length these two representations have to be aligned in such a way that grapheme symbols or strings of grapheme symbols are consistently asso ciated with the same phonetic symbol If this alignment has to be done by hand it is extremely labour intensive
منابع مشابه
Language-independent Data-oriented Grapheme-to-phoneme Conversion
We describe an approach to grapheme-to-phoneme conversion which is both language-independent and data-oriented. Given a set of examples (spelling words with their associated phonetic representation) in a language, a grapheme-to-phoneme conversion system is automatically produced for that language which takes as its input the spelling of words, and produces as its output the phonetic transcripti...
متن کاملA Language - Independent , Data - OrientedArchitecture for Grapheme - to
We report on an implemented grapheme-to-phoneme conversion architecture. Given a set of examples (spelling words with their associated phonetic representation) in a language, a grapheme-to-phoneme conversion system is automatically produced for that language which takes as its input the spelling of words, and produces as its output the phonetic transcription according to the rules implicit in t...
متن کاملLanguage � Independent Data � Oriented Grapheme
We describe an approach to grapheme to phoneme conver sion which is both language independent and data oriented Given a set of examples spelling words with their associated phonetic representation in a language a grapheme to phoneme conversion system is automatically pro duced for that language which takes as its input the spelling of words and produces as its output the phonetic transcription ...
متن کاملLetter-to-Phoneme Conversion for a German Text-to-Speech System
This thesis deals with the conversion from letters to phonemes, syllabification and word stress assignment for a German text-to-speech system. In the first part of the thesis (chapter 5), several alternative approaches for morphological segmentation are analysed and the benefit of such a morphological preprocessing component is evaluated with respect to the grapheme-to-phoneme conversion algori...
متن کاملLanguage-independent Grapheme-phoneme Conversion and Word Stress Assignment as a Web Service
We introduce a new language-independent procedure for grapheme-phoneme conversion, syllabification, and word stress assignment. Grapheme-phoneme conversion and syllabification is carried out by means of fallback sequences of decision trees trained on varying context sizes. Word stress is determined within an analogy-based framework by means of a Bayes classifier. Evaluation results on six langu...
متن کامل