Accent type recognition and syntactic boundary detection of Japanese using statistical modeling of moraic transitions of fundamental frequency contours
نویسندگان
چکیده
Experiments on accent type recognition and syntactic boundary detection of Japanese speech were conducted based on the statistical modeling of voice fundamental frequency contours formerly proposed by the authors. In the proposed modeling, fundamental frequency contours are segmented into moraic units to generate moraic contours, which are further represented by discrete codes. After modeling the accent types and syntactic boundaries, their recognition/detection was done for ATR speech corpus. As for the accent type recognition, 4-mora words were used for the training and testing, and recognition rates around 74 % were obtained for speaker open experiments. As for the syntactic boundary detection, detectability of accent phrase boundaries was tested for sentence speech. Although the experiments were conducted only for the closed condition due to availability of speech corpus, the result indicated the usefulness of separating the boundary model into two depending on whether the boundary is accompanied by a pause or not.
منابع مشابه
A method of representing fundamental frequency contours of Japanese using statistical models of moraic transition
A statistical modeling of voice fundamental frequency contours was proposed for the purpose of developing effective ways to utilize prosodic features in speech recognition. In view of the fact that prosodic features should be treated in longer units, the proposed modeling represents the transition in moraic units. A fundamental frequency contour was rst segmented into moraic units and then each...
متن کاملRepresenting prosodic words using statistical models of moraic transition of fundamental frequency contours of Japanese
We have formerly proposed a statistical model of moraic transitions of fundamental frequency (F0) contours and showed its e ectiveness for prosodic boundary detection and accent type recognition. This model represented F0 contours of prosodic words to simultaneously detect and recognize prosodic word boundaries and accent types. This paper proposes a method where prosodic word F0 contours are m...
متن کاملProsodic word boundary detection using statistical modeling of moraic fundamental frequency contours and its use for continuous speech recognition
A new method for prosodic word boundary detection in continuous speech was developed based on the statistical modeling of moraic transitions of fundamental frequency (F 0 ) contours, formerly proposed by the authors. In the developed method, F 0 contours of prosodic words were modeled separately according to the accent types. An input utterance was matched against the models and was divided int...
متن کاملDetection of prosodic word boundaries by statistical modeling of mora transitions of fundamental frequency contours and its use for continuous speech recognition
We have been developing a reliable method of prosodic word boundary detection for Japanese continuous speech based on the statistical modeling of mora transitions of fundamental frequency contours of prosodic words. Modifications in the codebook sizes and in the HMM topologies improved the boundary detection performance. When using mora boundary information obtainable from the phoneme recogniti...
متن کاملDetection of phrase boundaries in Japanese by low-pass filtering of fundamental frequency contours
Major syntactic boundaries are often accompanied by a rise in the phrase component of the fundamental frequency (F0) contour. Detecting such rises, therefore, can be signi cantly helpful to the speech recognition process. We developed a method to detect syntactic boundaries with phrasecomponent rise (henceforth, phrase boundaries), based on the compression of the accent component of the F0 cont...
متن کامل