Monophonic sound source separation with an unsupervised network of spiking neurones
نویسندگان
چکیده
We incorporate auditory-based features into an unconventional pattern classification system, consisting of a network of spiking neurones with dynamical and multiplicative synapses. Although the network does not need any training and is autonomous, the analysis is dynamic and capable of extracting multiple features and maps. The neural network allows computing a binary mask that acts as a dynamic switch on a speech vocoder made of an FIR gammatone analysis/synthesis bank of 256 filters. We report experiments on separation of speech from various intruding sounds (siren, telephone bell, speech, etc.) and compare our approach to other techniques by using the Log Spectral Distortion (LSD) metric.
منابع مشابه
Cochleotopic/AMtopic (CAM) and Cochleotopic/Spectrotopic (CSM) map based sound sourcce separation using relaxatio oscillatory neurons
We use a two-layered unsupervised bio-inspired neural network to segregate sound sources, e.g. double-vowels or vowels intruded by nonstationary noise sources. The network consists of spiking neurons. The spiking neurons in both layers are modelized by relaxation oscillators. The first layer of the network is locally connected, while the second layer is a fully connected network. We show that i...
متن کاملTowards Neurocomputational Speech and Sound Processing
From physiology we learn that the auditory system extracts simultaneous features from the underlying signal, giving birth to simultaneous representations of audible signals. We also learn that pattern analysis and recognition are not separated processes (in opposition to the engineering approach of pattern recognition where analysis and recognition are usually separated processes). Furthermore,...
متن کاملSource Separation with One Ear: Proposition for an Anthropomorphic Approach
Wepresent an example of an anthropomorphic approach, in which auditory-based cues are combined with temporal correlation to implement a source separation system. The auditory features are based on spectral amplitude modulation and energy information obtained through 256 cochlear filters. Segmentation and binding of auditory objects are performed with a two-layered spiking neural network. The fi...
متن کاملReal - Time Pitch Detection
Finally, there is a definition problem between monophonic and polyphonic sounds. In the case of monophonic sound, the obvious definition is to pick the lowest partial as the fundamental frequency. In the case of polyphonic sounds, resulting either from one source (e.g., a piano) or from many sources (e.g., orchestra, choir), the definition is far more difficult, and approaches close to the prob...
متن کاملRemixing musical audio on the web using source separation
Research in audio source separation has progressed a long way, producing systems that are able to approximate the component signals of sound mixtures. In recent years, many efforts have focused on learning time-frequency masks that can be used to filter a monophonic signal in the frequency domain. Using current web audio technologies, time-frequency masking can be implemented in a web browser i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Neurocomputing
دوره 71 شماره
صفحات -
تاریخ انتشار 2007