Based Persian Viseme Clustering
نویسندگان
چکیده
Viseme (Visual Phoneme) clusterin every language is among the most important conducting various multimedia researches as reading, lip synchronization and com pronunciation training applications. With re that clustering and analyzing visemes are lan processes, we concentrated our research on P which indeed has suffered from lack of su paper, we used a hierarchical approach for c in Persian language based on principal compo polynomial kernel matrix considering coar Having obtained feature vector of each phon unweighted pair group method with arithme projected viseme on constructed manifold neighbor of the weight value as a result of rec as the criterion for comparing viseme dissimi indicate the robustness of the proposed alg experiments was conducted on Persian datab syllables were examined. Comparing the clustering algorithm with that of the percept an expert proves a reasonable evaluation algorithm. KeywordsAudio/Visual processing, C pronunciation training, Persian Viseme clu manifold.
منابع مشابه
Toward Clustering Persian Vowel Viseme: A New Clustering Approach based on HMM
This paper sorts out the problem of Persian Vowel viseme clustering. Clustering audio-visual data has been discussed for a decade or so. However, it is an open problem due to shortcoming of appropriate data and its dependency to target language. Here, we propose a speaker-independent and robust method for Persian viseme class identification as our main contribution. The overall process of the p...
متن کاملPersian Viseme Classification Using Interlaced Derivative Patterns and Support Vector Machine
Viseme (Visual Phoneme) classification and analysis in every language are among the most important preliminaries for conducting various multimedia researches such as talking head, lip reading, lip synchronization, and computer assisted pronunciation training applications. With respect to the fact that analyzing visemes is a language dependent process, we concentrated our research on Persian lan...
متن کاملAutomatic Viseme Clustering for Audiovisual Speech Synthesis
A common approach in visual speech synthesis is the use of visemes as atomic units of speech. In this paper, phonemebased and viseme-based audiovisual speech synthesis techniques are compared in order to explore the balancing between data availability and an improved audiovisual coherence for synthesis optimization. A technique for automatic viseme clustering is described and it is compared to ...
متن کاملImproved visual speech synthesis using dynamic viseme k-means clustering and decision trees
We present a decision tree-based viseme clustering technique that allows visual speech synthesis after training on a small dataset of phonetically-annotated audiovisual speech. The decision trees allow improved viseme grouping by incorporating k-means clustering into the training algorithm. The use of overlapping dynamic visemes, defined by tri-phone time-varying oral pose boundaries, allows im...
متن کاملComprehensive many-to-many phoneme-to-viseme mapping and its application for concatenative visual speech synthesis
The use of visemes as atomic speech units in visual speech analysis and synthesis systems is well-established. Viseme labels are determined using a many-to-one phoneme-to-viseme mapping. However, due to the visual coarticulation effects, an accurate mapping from phonemes to visemes should define a many-to-many mapping scheme. In this research it was found that neither the use of standardized no...
متن کامل