Multi-gabor Dictionaries for Audio Time-frequency Analysis
نویسندگان
چکیده
In this paper we consider the construction of multiresolution Gabor dictionaries appropriate for audio signal analysis. Motivated by a desire for parsimony and efficiency, we propose and formalise the idea of reduced multi-Gabor systems, showing that they constitute a frame for L2(R) and other Hilbert spaces of interest. In order to demonstrate the practicality of such a scheme, we apply it to the atomic decomposition of music and speech signals observed in noise. Qualitative results indicate the potential of this method to yield a salient representation of typical audio signals while at the same time reducing computational costs as compared to a full multiresolution decomposition.
منابع مشابه
Sparsity and persistence in time-frequency sound representations
It is a well known fact that the time-frequency domain is very well adapted for representing audio signals. The main two features of time-frequency representations of many classes of audio signals are sparsity (signals are generally well approximated using a small number of coefficients) and persistence (significant coefficients are not isolated, and tend to form clusters). This contribution pr...
متن کاملTime-scaling of Audio Signals with Muti-scale Gabor Analysis
The phase vocoder is a standard frequency domain time-scaling technique suitable for polyphonic audio, but it generates annoying artifacts called phasiness, or loss of presence, and transient smearing, especially for high values of the time-scale parameter. In this paper, a new time-scaling algorithm for polyphonic audio signals is described. It uses a multi-scale Gabor analysis for lowfrequenc...
متن کاملMulti-View Face Detection in Open Environments using Gabor Features and Neural Networks
Multi-view face detection in open environments is a challenging task, due to the wide variations in illumination, face appearances and occlusion. In this paper, a robust method for multi-view face detection in open environments, using a combination of Gabor features and neural networks, is presented. Firstly, the effect of changing the Gabor filter parameters (orientation, frequency, standard d...
متن کاملA Gabor Regression Scheme for Audio Signal Analysis
Here we describe novel Bayesian models for time-frequency analysis of non-stationary audio waveforms. These models are based on the idea of a Gabor regression, in which a time series is represented as a superposition of time-frequency shifted versions of a simple window function. Prior distributions over the corresponding time-frequency coefficients are constructed in a manner which favours bot...
متن کاملNumerical Performance of Time-Frequency Transforms in Lossy Audio Coding
Time-frequency analysis and the transforms that it gives rise to play an important role in digital signal processing. In lossy audio compression, for instance, it has been found that working in the transform domain leads to methods that achieve a much higher level of signal compression than could be achieved in the temporal domain. This article gives an introduction to time-frequency analysis t...
متن کامل