segmental hmm

Unsupervised training of an HMM-based self-organizing unit recognizer with applications to topic classification and keyword discovery

Journal: :Computer Speech & Language 2014

Man-Hung Siu Herbert Gish Arthur Chan William Belfield Steve Lowe

We present our approach to unsupervised training of speech recognizers. Our approach iteratively adjusts sound units that are ptimized for the acoustic domain of interest. We thus enable the use of speech recognizers for applications in speech domains here transcriptions do not exist. The resulting recognizer is a state-of-the-art recognizer on the optimized units. Specifically we ropose buildi...

متن کامل

Unsupervised training of an HMM-based speech recognizer for topic classification

2009

Herbert Gish Man-Hung Siu Arthur Chan William Belfield

HMM-based Speech-To-Text (STT) systems are widely deployed not only for dictation tasks but also as the first processing stage of many automatic speech applications such as spoken topic classification. However, the necessity of transcribed data for training the HMMs precludes its use in domains where transcribed speech is difficult to come by because of the specific domain, channel or language....

متن کامل

A Fast Framework for the Constrained Mean Trajectory Segment Model by Avoidance of Redundant Computation on Segment

Journal: :IJCLCLP 2006

Yun Tang Wenju Liu Yiyan Zhang Bo Xu

The segment model (SM) is a family of methods that use the segmental distribution rather than frame-based density (e.g. HMM) to represent the underlying characteristics of the observation sequence. It has been proved to be more precise than HMM. However, their high level of complexity prevents these models from being used in practical systems. In this paper, we propose a framework that can redu...

متن کامل

Research on Segmentation and Labeling of Speech Corpora

1999

HE Xiaodong LIU Jian YU Tiecheng

In this paper, we suggested a Reference Sentence Alignment (RSA) method to segment and label the speech automatically based on the multiple pronunciation phoneme segmental kmeans algorithm and HMM. Furthermore, based on the search path created by this method, information of pitch and energy of speech can be obtained and labeled synchronously. This segmentation and labeling strategy was applied ...

متن کامل

An HMM-Based Mandarin Chinese Text-To-Speech System

2006

Yao Qian Frank K. Soong Yining Chen Min Chu

In this paper we present our Hidden Markov Model (HMM)-based, Mandarin Chinese Text-to-Speech (TTS) system. Mandarin Chinese or Putonghua, “the common spoken language”, is a tone language where each of the 400 plus base syllables can have up to 5 different lexical tone patterns. Their segmental and supra-segmental information is first modeled by 3 corresponding HMMs, including: (1) spectral env...

متن کامل

Protein homology detection by HMM-HMM comparison

Journal: :Bioinformatics 2004

متن کامل

Joint recognition and segmentation using phonetically derived features and a hybrid phoneme model

1998

Naomi Harte Saeed Vaseghi Ben P. Milner

This paper encompasses the approaches of segmental modelling and the use of dynamic features in addressing the constraints of the IID assumption in standard HMM. Phonetic features are introduced which capture the transitional dynamics across a phoneme unit via a DCT transformation of a variable length segment. Alongside this, the use of a hybrid phoneme model is proposed. Classification experim...

متن کامل

Part-of-Speech Effects on Text-to-Speech Synthesis

2010

Georg I. Schlünz Etienne Barnard Gerhard B. van Huyssteen

One of the goals of text-to-speech (TTS) systems is to produce natural-sounding synthesised speech. Towards this end various natural language processing (NLP) tasks are performed to model the prosodic aspects of the TTS voice. One of the fundamental NLP tasks being used is the part-of-speech (POS) tagging of the words in the text. This paper investigates the effects of POS information on the na...

متن کامل

HMM training strategy for incremental speech synthesis

2015

Maël Pouget Thomas Hueber Gérard Bailly Timo Baumann

Incremental speech synthesis aims at delivering the synthetic voice while the sentence is still being typed. One of the main challenges is the online estimation of the target prosody from a partial knowledge of the sentence’s syntactic structure. In the context of HMM-based speech synthesis, this typically results in missing segmental and suprasegmental features, which describe the linguistic c...

متن کامل

The USTC System for Blizzard Challenge 2012

2012

Zhen-Hua Ling Xian-Jun Xia Yang Song Chen-Yu Yang Ling-Hui Chen Li-Rong Dai

This paper introduces the speech synthesis system developed by USTC for Blizzard Challenge 2012. An audiobook speech corpus is adopted as the training data for system construction this year. Similar to our previous systems, the hidden Markov model (HMM) based unit selection and waveform concatenation approach is followed to develop our speech synthesis system using this corpus. Considering the ...

متن کامل