Language-independent Data-oriented Grapheme-to-phoneme Conversion
نویسندگان
چکیده
We describe an approach to grapheme-to-phoneme conversion which is both language-independent and data-oriented. Given a set of examples (spelling words with their associated phonetic representation) in a language, a grapheme-to-phoneme conversion system is automatically produced for that language which takes as its input the spelling of words, and produces as its output the phonetic transcription according to the rules implicit in the training data. We describe the design of the system, and compare its performance to knowledge-based and alternative data-oriented approaches.
منابع مشابه
Language � Independent Data � Oriented Grapheme
We describe an approach to grapheme to phoneme conver sion which is both language independent and data oriented Given a set of examples spelling words with their associated phonetic representation in a language a grapheme to phoneme conversion system is automatically pro duced for that language which takes as its input the spelling of words and produces as its output the phonetic transcription ...
متن کاملA language-independent, data-oriented architecture for grapheme-to-phoneme conversion
We report on an implemented grapheme to phoneme conversion architecture Given a set of examples spelling words with their associated phonetic represen tation in a language a grapheme to phoneme conversion system is automatically produced for that language which takes as its input the spelling of words and pro duces as its output the phonetic transcription according to the rules implicit in the ...
متن کاملA Language - Independent , Data - OrientedArchitecture for Grapheme - to
We report on an implemented grapheme-to-phoneme conversion architecture. Given a set of examples (spelling words with their associated phonetic representation) in a language, a grapheme-to-phoneme conversion system is automatically produced for that language which takes as its input the spelling of words, and produces as its output the phonetic transcription according to the rules implicit in t...
متن کاملLanguage-independent Grapheme-phoneme Conversion and Word Stress Assignment as a Web Service
We introduce a new language-independent procedure for grapheme-phoneme conversion, syllabification, and word stress assignment. Grapheme-phoneme conversion and syllabification is carried out by means of fallback sequences of decision trees trained on varying context sizes. Word stress is determined within an analogy-based framework by means of a Bayes classifier. Evaluation results on six langu...
متن کاملInvestigations on joint-multigram models for grapheme-to-phoneme conversion
We present a fully data-driven, language independent way of building a grapheme-to-phoneme converter. We apply the joint-multigram approach to the alignment problem and use standard language modelling techniques to model transcription probabilities. We study model parameters, training procedures and effects of corpus size in detail. Experiments were conducted on English and German pronunciation...
متن کامل