Phoneme-to-phoneme alignment and conversion
نویسندگان
چکیده
This paper deals with new methods for phoneme-to-phoneme (P2P) alignment and conversion. Alignment is carried out by dynamic programming for Levenshtein distance calculation. Cost functions based on phoneme co-occurrence statistics and on distinctive feature vector distances accounting for connected speech processes are comparatively evaluated. Given the aligned data, decision trees for P2P conversion across word boundaries are trained and evaluated. Amongst others it turned out, that while accounting for assimilation processes improved alignment quality, these quality differences showed no impact on P2P conversion performance.
منابع مشابه
Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM
Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...
متن کاملA Novel Approach to Unsupervised Grapheme–to–phoneme Conversion
Automatic, data-driven grapheme-to-phoneme conversion is a challenging but often necessary task. The top-down strategy implicitly adopted by traditional inductive learning techniques tends to dismiss relevant contexts when they have been seen too infrequently in the training data. This paper proposes instead a bottom-up approach which, by design, exhibits better generalization properties. For e...
متن کاملAllophone-based acoustic modeling for Persian phoneme recognition
Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...
متن کاملLetter-Phoneme Alignment: An Exploration
Letter-phoneme alignment is usually generated by a straightforward application of the EM algorithm. We explore several alternative alignment methods that employ phonetics, integer programming, and sets of constraints, and propose a novel approach of refining the EM alignment by aggregation of best alignments. We perform both intrinsic and extrinsic evaluation of the assortment of methods. We sh...
متن کاملA latent analogy framework for grapheme-to-phoneme conversion
Data-driven grapheme-to-phoneme conversion involves either (top-down) inductive learning or (bottom-up) pronunciation by analogy. As both approaches rely on local context information, they typically require some external linguistic knowledge, e.g., individual grapheme/phoneme correspondences. To avoid such supervision, this paper proposes an alternative solution, dubbed pronunciation by latent ...
متن کامل