A two-layer lexical tree based beam search in continuous Chinese speech recognition
نویسندگان
چکیده
In this paper, an approach to continuous speech recognition based on a two-layer lexical tree is proposed. The search network is maintained by the two-layer lexical tree, in which the first layer reflects the word net and the phone net while the second layer the dynamic programming (DP). Because the acoustic information is tied in the second layer, the memory cost is so small that it has the ability to process some complicated applications, such as the use of cross-word context-dependent (CD) triphone models, the Chinese fuzzy syllable mapping and the pronunciation modeling. The search algorithm based on the two-layer lexical tree is also proposed, which is derived from the token-passing algorithm. Finally, an implementation of the two-layer lexical tree using the crossword context-dependent triphone models is presented, and the experimental results show that the highly efficient decoding can be achieved without too much memory cost.
منابع مشابه
Improvements in search algorithm for large vocabulary continuous speech recognition
Current time-synchronous beam-search algorithm is improved from two aspects for speeding up large vocabulary continuous speech recognition. Single-triphone-tree structure is proposed to take instead of the tree copy technique for simplifying the search computation and saving the memory . By one kind of special-designed token propagation strategy, the n-gram language model can be integrated into...
متن کاملA syllable-synchronous network search algorithm for word decoding in Chinese speech recognition
The Chinese language is syllabic in nature with frequent homonym phenomena and severe word boundary uncertainty problem. This makes the Chinese continuous speech recognition (CSR) slightly difficult. In order to solve these problems, a Chinese syllable-synchronous network search (SSNS) algorithm is proposed. Together with the vocabulary word search tree and the N-gram based language model, the ...
متن کاملLexical access to large vocabularies for speech recognition
A large vocabulary isolated word recognition system based on the hypothesize-and-test paradigm is described. The system has been, however, devised as a word hypothesizer for a continuous speech understanding system able to answer to queries put to a geographical database. Word preselection is achieved by segmenting and classifying the input signal in terms of broad phonetic classes. Due to low ...
متن کاملImproved context-dependent acoustic modeling for continuous Chinese speech recognition
This paper describes the new framework of context-dependent (CD) Initial/Final (IF) acoustic modeling using the decision tree based state tying for continuous Chinese speech recognition. The Extended Initial/Final (XIF) set is chosen as the basic speech recognition unit (SRU) set according to the Chinese language characteristics, which outperforms the standard IF set. An adaptive mixture increa...
متن کاملStatistical knowledge based frame synchronous search strategies in continuous speech recognition
In this paper, we propose a novel and efficient search algorithm for the Continuous Speech Recognition (CSR). The proposed algorithm is on the basis of the traditional Frame Synchronous Search (FSS) algorithm. It makes full use of some statistical knowledge, such as the Differential State Dwelling Distribution (DSDD), as one of the control factors for the state transition. It also incorporates ...
متن کامل