Pooling Robust Shift-Invariant Sparse Representations of Acoustic Signals
نویسندگان
چکیده
In recent years, designing the coding and pooling structures in layered networks has been shown to be a useful method for learning high-level feature representations for visual data. Yet, such learning structures have not been extensively studied for audio signals. In this paper, we investigate different pooling strategies based on the sparse coding scheme and propose a temporal pyramid pooling method to extract discriminative and shiftinvariant feature representations. We demonstrate the superiority of our new feature representation over traditional features on the acoustic event classification task.
منابع مشابه
Translation Invariant Approach for Measuring Similarity of Signals
In many signal processing applications, an appropriate measure to compare two signals plays a fundamental role in both implementing the algorithm and evaluating its performance. Several techniques have been introduced in literature as similarity measures. However, the existing measures are often either impractical for some applications or they have unsatisfactory results in some other applicati...
متن کاملTranslation Invariant Approach for Measuring Similarity of Signals
In many signal processing applications, an appropriate measure to compare two signals plays a fundamental role in both implementing the algorithm and evaluating its performance. Several techniques have been introduced in literature as similarity measures. However, the existing measures are often either impractical for some applications or they have unsatisfactory results in some other applicati...
متن کاملLearning An Invariant Speech Representation
Recognition of speech, and in particular the ability to generalize and learn from small sets of labeled examples like humans do, depends on an appropriate representation of the acoustic input. We formulate the problem of finding robust speech features for supervised learning with small sample complexity as a problem of learning representations of the signal that are maximally invariant to intra...
متن کاملAirplane detection based on rotation invariant and sparse coding in remote sensing images
Airplane detection has been taking a great interest to researchers in the remote sensing filed. In this paper, we propose a new approach on feature extraction for airplane detection based on sparse coding in high resolution optical remote sensing images. However, direction of airplane in images brings difficulty on feature extraction. We focus on the airplane feature possessing rotation invaria...
متن کاملLearning An Invariant Speech Representation by
Recognition of speech, and in particular the ability to generalize and learn from small sets of labeled examples like humans do, depends on an appropriate representation of the acoustic input. We formulate the problem of finding robust speech features for supervised learning with small sample complexity as a problem of learning representations of the signal that are maximally invariant to intra...
متن کامل