Classification of Fricative Consonants for Speech Enhancement in Hearing Devices
نویسندگان
چکیده
OBJECTIVE To investigate a set of acoustic features and classification methods for the classification of three groups of fricative consonants differing in place of articulation. METHOD A support vector machine (SVM) algorithm was used to classify the fricatives extracted from the TIMIT database in quiet and also in speech babble noise at various signal-to-noise ratios (SNRs). Spectral features including four spectral moments, peak, slope, Mel-frequency cepstral coefficients (MFCC), Gammatone filters outputs, and magnitudes of fast Fourier Transform (FFT) spectrum were used for the classification. The analysis frame was restricted to only 8 msec. In addition, commonly-used linear and nonlinear principal component analysis dimensionality reduction techniques that project a high-dimensional feature vector onto a lower dimensional space were examined. RESULTS With 13 MFCC coefficients, 14 or 24 Gammatone filter outputs, classification performance was greater than or equal to 85% in quiet and at +10 dB SNR. Using 14 Gammatone filter outputs above 1 kHz, classification accuracy remained high (greater than 80%) for a wide range of SNRs from +20 to +5 dB SNR. CONCLUSIONS High levels of classification accuracy for fricative consonants in quiet and in noise could be achieved using only spectral features extracted from a short time window. Results of this work have a direct impact on the development of speech enhancement algorithms for hearing devices.
منابع مشابه
Nasalance Scores of Sentences in Children with Hearing Loss
Background and purpose: Proper resonance is a major factor for the comprehension of speech in individuals with hearing loss. These people have low speech intelligibility caused by inappropriate resonance. Therefore, nasalance measurement is a principal aspect of the assessment of people with hearing loss. This study aimed at determining nasalance in children with hearing loss. Materials and me...
متن کاملPercentage of Consonants Correct for 3-5 Years Old Kurdish-Speaking Children With Middle Kurmanji-Mukryani Dialect
Objectives: The present research aims to study the normal development of Percentage of Consonant Correct (PCC) in Kurdish-speaking children, with Middle Kurmanji-Mukryani Dialect as an Articulation Competency Index (ACI). PCC was examined in terms of the manner of articulation and position of sound in the word. Methods: In this descriptoanalytical cross-sectional study, 120 Kurdish-speak...
متن کاملClassification of Fricatives Using Feature Extrapolation of Acoustic-Phonetic Features in Telephone Speech
This paper proposes a classification module for fricative consonants in telephone speech using an acoustic-phonetic feature extrapolation technique. In channel-deteriorated telephone speech, acoustic cues of fricative consonants are expected to be degraded or missing due to limited bandwidth. This paper applies an extrapolation technique to acoustic-phonetic features based on Gaussian mixture m...
متن کاملSpeech enhancement for bandlimited speech
Throughout the history of telecommunication, speech has rarely been transmitted with its full analog bandwidth (0 to 8 kHz or more) due to limitations in channel bandwidth. This impaired legacy continues with tactical voice communication. The passband of a voice terminal is typically 0 to 4 kHz. Hence, high-frequency speech components (4 to 8 kHz) are removed prior to transmission. As a result,...
متن کاملWeighting of Static and Transition Cues in Voiceless Fricatives and Stops in Children Wearing Cochlear Implants
OBJECTIVES To determine how normal-hearing adults (NHA), normal-hearing children (NHC) and children wearing cochlear implants (CI) differ in the perceptual weight given cues for fricative consonants (having a comparatively long static cue and short transition cue) versus stop consonants (having a comparatively short static cue and long transition cue). METHODS Ten NHA, eleven 5- to 8-year-old...
متن کامل