DEFENDED IN 1996 Pitch perception in speech : a time domain approach

نویسنده

  • Henning Reetz
چکیده

This thesis deals with the perception of pitch of speech signals and ways to measure this pitch. The term 'pitch' is used as a collective term for speech production (where it is used for the fundamental frequency of quasi-periodically vibrating vocal folds), for acoustic transmission (where it is used for the periodicity of signals), and for speech perception (where it is used for the perceived pitch of speech signals). In the last case, the frequency of the perceived pitch of a signal is expressed by the fundain�ntal frequency of a reference signal with a rather simple structure (e.g. a pure tone). ' ciften, the values for 'pitch' are identical in speech production, transmission, and perception. But the frequency of the quasi-periodically vibrating vocal folds might be attenuated by the vocal tract shape, and a higher harmonic of the FO can be amplified, which appears as clear periodicity in the acoustic signal and is perceived as pitch. Consequently, pitch and fundamental frequency are not always the same. As argued in Chapter 2, '.Yhat is important for speech communication is what is perceived as pitch and not what is produced as fundamental frequency. Voice source and vocal tract filter information are not linear independent and they are perceived as one signal, rather than being decomposed into separate components. This does not lessen the impact of Fant's (1960) source filter model as an adequate way to describe speech production; but it states that perception is not simply an in version of the production process. From this it follows directly that the motor theory of speech perception (Liberman & Mattingly, 1985) is untenable, because it predicts the perception of the intended gestures that generated a speech sound. The view that source and filter information are perceived together as one entity is compatible with the invariance theory of speech perception (Blumstein & Stevens, 1981), because here all information in a speech signal is used as an integrated percept, without considering how a sound might have been produced. Finally, the proposed view of pitch perception is also in accordance with the auditory theory of speech perception (Seneff, 1985), because here the acoustic signal is perceived as one percept which is represented in two different ways simultaneously. This thesis argues for an integrated

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی تأثیر دیرش نمونه گفتار بر زیروبمی عادتی در زنان طبیعی 18 تا 30 ساله

Introduction: habitual pitch perception associated with the mean fundamental frequency of speech. In the clinical evaluation referred to this issue is dealt with in the normal range for a person whether he is a habitual pitch. A common feature in many of the abnormal pitch of voice disorders, the assessment of habitual pitch and factors affecting it, may help scientists to determine the exist...

متن کامل

Emotions in time domain synthesis

1.2 Resynthesis A preliminary test exploring 4 emotions showed that conveying emotions by time domain synthesis may be possible. Therefore, a more sophisticated test was carried out in order to determine the influence of the prosodic parameters in the perception of a speaker's emotional state. Six different emotional states were investigated. The stimuli of the second test were used in three di...

متن کامل

Reliability of Interaural Time Difference-Based Localization Training in Elderly Individuals with Speech-in-Noise Perception Disorder

Background: Previous studies have shown that interaural-time-difference (ITD) training can improve localization ability. Surprisingly little is, however, known about localization training vis-à-vis speech perception in noise based on interaural time difference in the envelope (ITD ENV). We sought to investigate the reliability of an ITD ENV-based training program in speech-in-noise perception a...

متن کامل

Perceptual pitch deficits coexist with pitch production difficulties in music but not Mandarin speech

Congenital amusia is a musical disorder that mainly affects pitch perception. Among Mandarin speakers, some amusics also have difficulties in processing lexical tones (tone agnosics). To examine to what extent these perceptual deficits may be related to pitch production impairments in music and Mandarin speech, eight amusics, eight tone agnosics, and 12 age- and IQ-matched normal native Mandari...

متن کامل

New Time-frequency Domain Pitch Estimation Methods for Speech Signals under Low Levels of Snr

New Time-Frequency Domain Pitch Estimation Methods for Speech Signals under Low Levels of SNR Celia Shahnaz, Ph.D. Concordia University, 2009 Pitch estimation of speech signals is the key to understanding most acoustical phenomena as well as accurately designing many practical systems in speech communication. It is to determine the fundamental frequency or period of a vocal cord vibration causi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016