An Automatic Syllable Segmentation Method for Mandarin Speech

نویسنده

  • Runshen Cai
چکیده

An automatic syllable segmentation method for mandarin speech is proposed. There are five features and the corresponding phonetic transcriptions used in the method. Firstly, the speech signals are pre-filtered. Secondly, the speech signal pre-filtered is segmented into 30 ms long segments and the five features of each segment are computed. Finally, syllable segmentation performs based on the phonetic transcriptions and computed values of the features. The performance of the method has been evaluated using a large speech database. The method is shown to perform well in the cases of both clean and noisedegraded speech.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic initial and final segmentation in cleft palate speech of Mandarin speakers

The speech unit segmentation is an important pre-processing step in the analysis of cleft palate speech. In Mandarin, one syllable is composed of two parts: initial and final. In cleft palate speech, the resonance disorders occur at the finals and the voiced initials, while the articulation disorders occur at the unvoiced initials. Thus, the initials and finals are the minimum speech units, whi...

متن کامل

Decision Tree Classification Approach for Model Selection in Segmenting Mandarin TTS Corpus

High accuracy automatic segmentation of Mandarin TTS (text to speech) corpus is vital for obtaining high quality syllable’s boundary to corpusbased speech synthesis. Among the existing methods, most studies on automatic segmentation are based upon single model, ignoring the diverse time marks gained by different models in specific Mandarin boundary environment. In this paper, three hidden Marko...

متن کامل

Automatic Segmentation and Labeling for Mandarin Chinese Speech Corpus for Concatenation-based TTS

Corpus for Concatenation-based TTS Cheng-Yuan Lin, Jyh-Shing Roger Jang, Kuan-Ting Chen Multimedia Information Retrieval Laboratory Dept. of Computer Science National Tsing Hua University HsingChu, Taiwan +88635715131-3506 {gavins, jang, marco}@wayne.cs.nthu.edu.tw ABSTRACT Precise phone/syllable boundary labeling of utterances in a speech corpus plays an important role in constructing corpus-b...

متن کامل

Prosody Modeling of Spontaneous Mandarin Speech and Its Application to Automatic Speech Recognition

A prosody-assisted ASR approach for spontaneous Mandarin speech is proposed. It employs the joint prosody labeling and modeling algorithm proposed previously to construct a hierarchical prosodic model (HPM) and uses it in two-stage speech recognition. A word lattice is first generated by the HMM method using tri-phone AM and bigram LM. Then, the lattice is extended by replacing LM to a trigram ...

متن کامل

Analysis on command sequences of a F0 generation model for Mandarin speech and its application to their automatic extraction

In this paper, we report on Mandarin F0 characteristics analyses using an F0 generation model and present experimental results on the automatic extraction of their control parameters. To cope with difficulties in automatic extraction of control parameters for F0 generation model proposed by Fujisaki, generation command sequences were extracted and analyzed for two-syllable and three-syllable Ma...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012