Gabor frames and deep scattering networks in audio processing

نویسندگان

  • Roswitha Bammer
  • Monika Dörfler
چکیده

In this paper a feature extractor based on Gabor frames and Mallat’s scattering transform, called Gabor scattering, is introduced. This feature extractor is applied to a simple signal model for audio signals, i.e. a class of tones consisting of fundamental frequency and its multiples and an according envelope. Within different layers, different invariances to certain signal features occur. In this paper we give a mathematical explanation for the first and the second layer which are illustrated by numerical examples. Deformation stability of this feature extractor will be shown by using a decoupling technique, previously suggested for the scattering transform of Cartoon functions. Here it is used to see if the feature extractor is robust to changes in spectral shape and frequency modulation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Image processing by alternate dual Gabor frames

‎We present an application of the dual Gabor frames to image‎ ‎processing‎. ‎Our algorithm is based on finding some dual Gabor‎ ‎frame generators which reconstructs accurately the elements of the‎ ‎underlying Hilbert space‎. ‎The advantages of these duals‎ ‎constructed by a polynomial of Gabor frame generators are compared‎ ‎with their canonical dual‎.

متن کامل

Theory, implementation and applications of nonstationary Gabor frames

Signal analysis with classical Gabor frames leads to a fixed time-frequency resolution over the whole time-frequency plane. To overcome the limitations imposed by this rigidity, we propose an extension of Gabor theory that leads to the construction of frames with time-frequency resolution changing over time or frequency. We describe the construction of the resulting nonstationary Gabor frames a...

متن کامل

Combining pattern recognition and deep-learning-based algorithms to automatically detect commercial quadcopters using audio signals (Research Article)

Commercial quadcopters with many private, commercial, and public sector applications are a rapidly advancing technology. Currently, there is no guarantee to facilitate the safe operation of these devices in the community. Three different automatic commercial quadcopters identification methods are presented in this paper. Among these three techniques, two are based on deep neural networks in whi...

متن کامل

Numerical Performance of Time-Frequency Transforms in Lossy Audio Coding

Time-frequency analysis and the transforms that it gives rise to play an important role in digital signal processing. In lossy audio compression, for instance, it has been found that working in the transform domain leads to methods that achieve a much higher level of signal compression than could be achieved in the temporal domain. This article gives an introduction to time-frequency analysis t...

متن کامل

Multi-View Face Detection in Open Environments using Gabor Features and Neural Networks

Multi-view face detection in open environments is a challenging task, due to the wide variations in illumination, face appearances and occlusion. In this paper, a robust method for multi-view face detection in open environments, using a combination of Gabor features and neural networks, is presented. Firstly, the effect of changing the Gabor filter parameters (orientation, frequency, standard d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1706.08818  شماره 

صفحات  -

تاریخ انتشار 2017