A generic methodology of converting transliterated text to phonetic strings case study: greeklish
نویسندگان
چکیده
In this work, we present a generic methodology for converting transliterated text (native language written with a non-native alphabet) to phonetic sequences. The goal is to create the same phonetic result that would be produced if a native speaker uttered the original text in native alphabet. In our work, we implemented the specific methodology as a front-end to a Text-to-Speech (TTS) server. To evaluate our algorithms we considered the case that corresponds to the Greek language, called Greeklish.
منابع مشابه
Generic Analysis of Literary Translation: A Case Study of Contemporary English Short Stories
Translation of a literary text is a difficult task, for understanding literature requires knowledge of various linguistic levels of a literary text in addition to strategies and methods of translation. To this should still be added cognitive-based translation training which helps practitioners preserve the aesthetic aspects of a literary text. Focusing on short story as a genre with both ...
متن کاملBackward Machine Transliteration by Learning Phonetic Similarity
In many cross-lingual applications we need to convert a transliterated word into its original word. In this paper, we present a similarity-based framework to model the task of backward transliteration, and provide a learning algorithm to automatically acquire phonetic similarities from a corpus. The learning algorithm is based on Widrow-Hoff rulewithsomemodifications. The experiment results sho...
متن کاملA Random Forests Text Transliteration System for Greek Digraphia
Greeklish to Greek transcription does undeniably seem to be a challenging task since it cannot be accomplished by directly mapping each Greek character to a corresponding symbol of the Latin alphabet. The ambiguity in the human way of Greeklish writing, since Greeklish users do not follow a standardized way of transliteration makes the process of transcribing Greeklish back to Greek alphabet ch...
متن کاملA Full Inspection on Chinese Characters Used in the Secrete History of the Mongols
The Secrete History of the Mongols (SHM) is a special Mongolian historical document transliterated with Chinese characters as phonetic symbols, which feature causes many multivariate analyses in later ages. The discussion of this paper mainly focuses on the data of used Chinese characters, the frequencies, and the rules of transliteration on SHM. All the statistic data and the analysis are base...
متن کاملTranslating Transliterations
Translating new entity names is important for improving performance in Natural Language Processing (NLP) applications such as Machine Translation (MT) and Cross Language Information Retrieval (CLIR). Usually, transliteration is used to obtain phonetic equivalents in a target language for a given source language word. However, transliteration across different writing systems often results in dif...
متن کامل