The effect of MPEG audio compression on multidimensional set of voice parameters.

نویسندگان

  • J Gonzalez
  • T Cervera
چکیده

The MPEG-1 Layer 3 compression schema of audio signal, or commonly known as mp3, has caused a great impact in recent years as it has reached high compression rates while also conserving a high sound quality. Previous listening tests have shown that music and speech samples compressed at high bitrates are virtually indistinguishable from the original samples, but very little is known about how compression acoustically affects the voice signal. In Experiment 1 the spectral composition of original and compressed speech signals were analyzed by means of the Long-Term Average Spectrum using the Computerized Speech Laboratory (Kay Elemetrics Corp. (Pine Brook, NJ, USA)). In Experiment 2 a set of 29 voice parameters extracted by using the Multidimensional Voice Program of Kay are compared between original and compressed voices at different bitrates. Results show a high fidelity at high-bitrate compressions (128 and 160 kbit per second (kbps)) both in voice parameters and the amplitude-frequency spectrum. Compressions at 64 kbps or lower bitrates introduces substantial modifications in the voice signal, seriously altering parameters related with tremor, amplitude perturbation, noise, subharmonics and voice irregularities, likewise the signal spectrum is altered in its high frequency region.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Acoustic analysis of pathological voices compressed with MPEG system.

The MPEG-1 Layer 3 compression schema of audio signal, commonly known as mp3, has caused a great impact in recent years as it has reached high compression rates while conserving a high sound quality. Music and speech samples compressed at high bitrates are perceptually indistinguishable from the original samples, but very little was known about how compression acoustically affects the voice sig...

متن کامل

Effect of MPEG audio compression on HMM-based speech synthesis

In this paper, the effect of MPEG audio compression on HMMbased speech synthesis is studied. Speech signals are encoded with various compression rates and analyzed using the GlottHMM vocoder. Objective evaluation results show that the vocoder parameters start to degrade from encoding with bitrates of 32 kbit/s or less, which is also confirmed by the subjective evaluation of the vocoder analysis...

متن کامل

A Single Core Hardware Approach of MPEG Audio Decoder for Real-Time Transmission

The decoding of the voice audio bit stream is an issue in terms of real-time transmission of high quality voice audio over the Internet. A stand-alone chip to perform decoding is a better solution over software approach. The MPEG audio compression provides high compression with minimal loss. This study describes a VHDL model of MPEG audio layer 1 decoder that perform concurrent processing while...

متن کامل

Coding of natural audio in MPEG-4

MPEG-4 standardizes natural audio coding at bitrates ranging from 2 kbit/s, suitable for intelligible speech coding, to 64 kbitls per channel, suitable for high-quality audio coding. Within this range, three categories of coding are defined: parametric coding, Code Excited Linear Predictive coding (CELP) and time/frequency (T/F) coding. The unique contribution of MPEG4 audio is that not only do...

متن کامل

Performance of MPEG-7 low level audio descriptors with compressed data

This paper presents a detailed analysis of lossy compression effects on a set of the MPEG-7 low-level audio descriptors. The analysis results show that lossy compression has a detrimental effect on the integrity of practical search and retrieval schemes that utilize the low level audio descriptors. Methods are then proposed to reduce the detrimental effects of compression in searching schemes. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Logopedics, phoniatrics, vocology

دوره 26 3  شماره 

صفحات  -

تاریخ انتشار 2001