An Automatic Syllable Segmentation Method for Mandarin Speech
نویسنده
چکیده
An automatic syllable segmentation method for mandarin speech is proposed. There are five features and the corresponding phonetic transcriptions used in the method. Firstly, the speech signals are pre-filtered. Secondly, the speech signal pre-filtered is segmented into 30 ms long segments and the five features of each segment are computed. Finally, syllable segmentation performs based on the phonetic transcriptions and computed values of the features. The performance of the method has been evaluated using a large speech database. The method is shown to perform well in the cases of both clean and noisedegraded speech.
منابع مشابه
Automatic initial and final segmentation in cleft palate speech of Mandarin speakers
The speech unit segmentation is an important pre-processing step in the analysis of cleft palate speech. In Mandarin, one syllable is composed of two parts: initial and final. In cleft palate speech, the resonance disorders occur at the finals and the voiced initials, while the articulation disorders occur at the unvoiced initials. Thus, the initials and finals are the minimum speech units, whi...
متن کاملDecision Tree Classification Approach for Model Selection in Segmenting Mandarin TTS Corpus
High accuracy automatic segmentation of Mandarin TTS (text to speech) corpus is vital for obtaining high quality syllable’s boundary to corpusbased speech synthesis. Among the existing methods, most studies on automatic segmentation are based upon single model, ignoring the diverse time marks gained by different models in specific Mandarin boundary environment. In this paper, three hidden Marko...
متن کاملAutomatic Segmentation and Labeling for Mandarin Chinese Speech Corpus for Concatenation-based TTS
Corpus for Concatenation-based TTS Cheng-Yuan Lin, Jyh-Shing Roger Jang, Kuan-Ting Chen Multimedia Information Retrieval Laboratory Dept. of Computer Science National Tsing Hua University HsingChu, Taiwan +88635715131-3506 {gavins, jang, marco}@wayne.cs.nthu.edu.tw ABSTRACT Precise phone/syllable boundary labeling of utterances in a speech corpus plays an important role in constructing corpus-b...
متن کاملProsody Modeling of Spontaneous Mandarin Speech and Its Application to Automatic Speech Recognition
A prosody-assisted ASR approach for spontaneous Mandarin speech is proposed. It employs the joint prosody labeling and modeling algorithm proposed previously to construct a hierarchical prosodic model (HPM) and uses it in two-stage speech recognition. A word lattice is first generated by the HMM method using tri-phone AM and bigram LM. Then, the lattice is extended by replacing LM to a trigram ...
متن کاملAnalysis on command sequences of a F0 generation model for Mandarin speech and its application to their automatic extraction
In this paper, we report on Mandarin F0 characteristics analyses using an F0 generation model and present experimental results on the automatic extraction of their control parameters. To cope with difficulties in automatic extraction of control parameters for F0 generation model proposed by Fujisaki, generation command sequences were extracted and analyzed for two-syllable and three-syllable Ma...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012