Voice Quality Dependent Speech Recognition

نویسندگان

  • Tae-Jin Yoon
  • Xiaodan Zhuang
  • Jennifer Cole
  • Mark Hasegawa-Johnson
چکیده

Voice quality conveys both linguistic and paralinguistic information, and can be distinguished by acoustic source characteristics. We label objective voice quality categories based on the harmonic structure (H1-H2) and the mean autocorrelation ratio of each phone. Results from a Support Vector Machine (SVM) classification experiment show that these features are predictive of Perceptual Linear Predictive Cepstra (PLPC) used in speech recognition. We further demonstrate that by incorporating voice quality knowledge into a speech recognition system, we can improve word recogni-

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Voice-based Age and Gender Recognition using Training Generative Sparse Model

Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...

متن کامل

Speech recognition using voice-characteristic-dependent acoustic models

This paper proposes a speech recognition technique based on acoustic models considering voice characteristic variations. Context-dependent acoustic models, which are typically triphone HMMs, are often used in continuous speech recognition systems. This work hypothesizes that the speaker voice characteristics that humans can perceive by listening are also factors in acoustic variation for constr...

متن کامل

Voice Quality after Using Speech Recognition Software: Perceptual Results and Reliability

This study investigates the influence of using speech recognition software on voice quality. Two different groups of speakers (one group of subjects with a heavy daily vocal load and one control group) were subjected to different speech recognition tasks for 2 hours (either using discrete or continuous speech recognition software). Five listeners assessed the voice quality (14 parameters) befor...

متن کامل

Eigenvoices for Hmm-based

This paper describes an eigenvoice technique for an HMMbased speech synthesis system which can synthesize speech with various voice qualities. In the eigenvoice technique, which has successfully been applied to fast speaker adaptation in an HMM based speech recognition, a large number of speaker dependent HMM sets are represented by a few parameters through a dimensionality reduction technique,...

متن کامل

The Analysis of Voice Quality in Speech Processing

Voice quality has been defined as the characteristic auditory colouring of an individual's voice, derived from a variety of laryngeal and supralaryngeal features and running continuously through the individual's speech. The distinctive tone of speech sounds produced by a particular person yields a particular voice. Voice quality is at the centre of several speech processing issues. In speech re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006