Improved Prediction of Japanese Word Accent Sandhi Using CRF

نویسندگان

Nobuaki Minematsu

Shumpei Kobayashi

Shinya Shimizu

Keikichi Hirose

چکیده

In Japanese, every content word has its own mora-based H/L pitch pattern when it is uttered in isolation, called accent type. When reading out a written sentence, however, this lexical H/L pattern is often changed according to the context, known as word accent sandhi. In our previous work, an accent sandhi predictor was developed using CRF [1], and in this paper, the predictor is improved through feature engineering especially focusing on phrases including numerals and those including loanwords. This is because our previous work showed that the prediction performance was relatively low for those phrases. To optimize the features used for CRF, it is critical to take into account the mechanism of word accent sandhi. We review linguistic and technical literature that attempted to characterize accent sandhi in the phrases including numerals and loanwords and, by reflecting these characteristics, the features are re-designed. Experiments show that the proposed predictor improved the performance relatively by 37% and 41%, respectively.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CRF-based statistical learning of Japanese accent sandhi for developing Japanese text-to-speech synthesis systems

In Japanese, every content word has its own H/L pitch pattern when it is uttered isolatedly, called accent type. In a TTS system, this lexical information is usually stored in a dictionary and it is referred to for prosody generation. When converting a written sentence to speech, however, this lexical H/L pattern is often changed according to the context, known as word accent sandhi. This accen...

متن کامل

Improvement of CRF-Based Accent Sandhi Prediction Using The Features Derived from Accent Rules

When developing Japanese text-to-speech (TTS) systems, algorithms to accurately predict accent types of each constituent phrase is essential for better output speech quality. In our previous studies on the accent type estimation, a CRF-based method was realized. Although this method outperformed the conventional rule-based method, the estimation accuracy of particular phrases such as those incl...

متن کامل

Accent Sandhi Estimation of Tokyo Dialect of Japanese Using Conditional Random Fields

When synthesizing speech from Japanese text, correct assignment of accent nuclei for input text with arbitrary contents is indispensable in obtaining naturally-sounding synthetic speech. A phenomenon called accent sandhi occurs in utterances of Japanese; when a word is uttered in a sentence, its accent nucleus may change depending on the contexts of preceding/succeeding words. This paper descri...

متن کامل

Development and Evaluation of Online Infrastructure to Aid Teaching and Learning of Japanese Prosody

This paper develops an online and freely available framework to aid teaching and learning the prosodic control of Tokyo Japanese: how to generate its adequate word accent and phrase intonation. This framework is called OJAD (Online Japanese Accent Dictionary) [1] and it provides three features. 1) Visual, auditory, systematic, and comprehensive illustration of patterns of accent change (accent ...

متن کامل

Development of a web framework for teaching and learning Japanese prosody: OJAD (online Japanese accent dictionary)

This paper introduces the first online and free framework for teaching and learning Japanese prosody including word accent and phrase intonation. This framework is called OJAD (Online Japanese Accent Dictionary) [1] and it provides three functions. 1) Visual, auditory, systematic, and comprehensive illustration of patterns of accent change (accent sandhi) of verbs and adjectives. Here only the ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Improved Prediction of Japanese Word Accent Sandhi Using CRF

نویسندگان

چکیده

منابع مشابه

CRF-based statistical learning of Japanese accent sandhi for developing Japanese text-to-speech synthesis systems

Improvement of CRF-Based Accent Sandhi Prediction Using The Features Derived from Accent Rules

Accent Sandhi Estimation of Tokyo Dialect of Japanese Using Conditional Random Fields

Development and Evaluation of Online Infrastructure to Aid Teaching and Learning of Japanese Prosody

Development of a web framework for teaching and learning Japanese prosody: OJAD (online Japanese accent dictionary)

عنوان ژورنال:

اشتراک گذاری