Transient map method in stop consonant discrimination

نویسندگان

  • Jari Kangas
  • Teuvo Kohonen
چکیده

Discrimination between the voiceless stop consonants !k,p,tl is a subproblern in phoneme-based speech recognition systems. Lack of energy during the pronunciation and the fast transient effects at the end of the phoneme make the recognition difficult. A method of so called Phonotopic Maps [2] was studied in order to develop simple and effective solutions for discrimination. In the following studies the method of 'fransient Maps, a derivative of Phonotopic Maps, was found to be an easy-to-implement and powerful algorithm for real-time speech recognition systems. It contains an automatic learning algorithm that tunes the discrimination elements to detect the differences between the spectra at the end of the stop consonant. Using Transient Maps it is possible to classify correctly 80 to 90 percent of all voiceless stop consonants in our speech recognition system. Thus the recognition accuracy of voiceless stop consonants is comparable to that of the other phonemes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of Local Binary Patterns for SVM based Stop Consonant Detection

Detection of acoustic phonetic landmarks is useful for a variety of speech processing applications such as automatic speech recognition.The majority of existing methods use Melfrequency Cepstral Coefficients (MFCCs) describing the short time power spectral envelope of the speech signal. This paper hypothesizes that a different feature extraction method can be used to complement MFCCs by capturi...

متن کامل

Consonant Class Discrimination in Dysarthric Speech Based on Support Vector Machine Using Class- Dependent Acoustic Parameters

In this paper, we propose a consonant class discrimination (CCD) method in dysarthric speech, where a support vector machine (SVM) is employed by using class-dependent acoustic parameters. To this end, each consonant is categorized into one of five classes according to the manner of articulation such as stop, affricate, fricative, nasal and glide. In the proposed CCD method using SVM, acoustic ...

متن کامل

Consonant discrimination in elicited and spontaneous speech: a case for signal-adaptive front ends in ASR

The constant frame length in typical ASR front ends is too long to capture transient phenomena in speech, such as stop bursts. However, current HMM systems have consistently outperformed systems based solely on non-uniform units. This work investigates an approach to “add back” such transient information to a speech recognizer, without losing the robustness of the standard acoustic models. We d...

متن کامل

Consonant burst enhancement: a possible means to improve intelligibility for the hard of hearing.

The possibility of using a circuit to amplify selectively the burst of a stop consonant is investigated. It is shown that such a circuit used in the speech channel of an amplifying system can improve the discrimination between stop consonants. Such a system could be of value in assisting the hard of hearing.

متن کامل

Temporal encoding of the voice onset time phonetic parameter by field potentials recorded directly from human auditory cortex.

Voice onset time (VOT) is an important parameter of speech that denotes the time interval between consonant onset and the onset of low-frequency periodicity generated by rhythmic vocal cord vibration. Voiced stop consonants (/b/, /g/, and /d/) in syllable initial position are characterized by short VOTs, whereas unvoiced stop consonants (/p/, /k/, and t/) contain prolonged VOTs. As the VOT is i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1989