Audio Coding Using Perceptually Controlled Bitstream Buffering
نویسنده
چکیده
Perceptual audio coders use a varying number of bits to encode subsequent frames according to the perceptual entropy of the audio signal. For transmission over a constant bitrate channel the bitstream must be buffered. The buffer must be large enough to absorb variations in the bitrate, otherwise the quality of the audio will be compromised. We present a new scheme for buffer control of perceptual audio coders. In contrast to conventional schemes the proposed scheme systematically reduces the variation in a perceptual distortion measure over time. The new scheme applied to a perceptual audio coder (PAC) improves the quality of the encoded signal for a given buffer size. The same technique can be used to increase the performance of other coders such as MPEG-1 Layer III or MPEG-2 AAC while maintaining backward compatibility.
منابع مشابه
A fine granular scalable perceptually lossy and lossless audio coder
This paper presents Advanced Audio Zip (AAZ), an audio codec that provides the fine granular bit-rate scalability from lossy to lossless coding. Perceptually embedded coding principle is employed in AAZ to provide lossy reconstruction with optimal perceptual quality at intermediate bit-rates. AAZ also provides the backward compatibility where the lossless bit-stream embeds a compliant MPEG-4 AA...
متن کاملFine grain scalable perceptual and lossless audio coding based on IntMDCT
This papers presents an embedded fine grain scalable perceptual and lossless audio coding scheme. The enabling technology for this combined perceptual and lossless audio coding approach is the Integer Modified Discrete Cosine Transform (IntMDCT), which is an integer approximation of the MDCT based on the lifting scheme. It maintains the perfect reconstruction property and therefore enables effi...
متن کاملA bitstream scalable audio coder using a hybrid WLPC-wavelet representation
In this paper, we present a novel bitstream scalable audio coder. In the proposed coder, the full bandwidth of input audio is first split into two. A hybrid WLPC–wavelet representation is used to encode the low frequency components ( 11 kHz). In this method, the excitation to the WLPC synthesis filter is decomposed into subbands using a wavelet filterbank, and perceptually encoded. Two stage qu...
متن کاملPerception-based partial encryption of compressed speech
Mobile multimedia applications, the focus of many forthcoming wireless services, increasingly demand low-power techniques implementing content protection and customer privacy. In this paper low complexity perception-based partial encryption schemes for speech are presented. Speech compressed by a widely-used speech coding algorithm, the ITU-T G.729 standard at 8 kb/s, is partitioned in two clas...
متن کاملSource-driven packet marking for speech transmission over differentiated-services networks
We present a source-driven approach to packet marking for speech transmission over packet networks implementing the Differentiated Services model. Packets generated by the speech coder are examined: if deemed perceptually critical, they are marked as premium and sent on a “virtual wire;” otherwise, they are sent as regular best-effort traffic. Applied to speech coded with the ITU-T 8 kb/s speec...
متن کامل