An Audio Attention Computational Model Based on Spatial Cues Gradient
نویسندگان
چکیده
Present bottom-up audio attention computational model extracts the underlying characteristics of single channel audio such as energy, pitch, zero crossing rate etc., to calculate the audio signal attention level. In audio surveillance, sound source whose directions change rapidly should have higher attention level. But present audio computational models cannot effectively express the audio attention caused by such signals. It may cause some audio events, which should be paid attention to, cannot bedetected. To solve this problem, based on the psychological principles that spatial information affects attention, this paper proposes a model that introducing the short-term spatial gradient cues to measure the attention caused by the fast changing of single audio source space direction. This model calculates the mean short-term changes of the spatial cues vector ofsub-bands as spatialcues gradient. Compared to the traditional audio attention computational model, the recall of detection of attention audio events increased 4.5 percentage points in experiments.
منابع مشابه
A General Framework for Online Audio Source Separation
We consider the problem of online audio source separation. Existing algorithms adopt either a sliding block approach or a stochastic gradient approach, which is faster but less accurate. Also, they rely either on spatial cues or on spectral cues and cannot separate certain mixtures. In this paper, we design a general online audio source separation framework that combines both approaches and bot...
متن کاملVisual Attention and the Semantics of Space: Evidence for Two Forms of Symbolic Control
In this paper, we investigate the functional differences between word cues and arrow cues in a spatial cuing task and provide a novel computational model fit to the empirical data that provides (1) a conceptually parsimonious explanation of the observed differences and (2) evidence for the existence of two forms of symbolic attentional control. We briefly discuss the implications of the model f...
متن کاملVisual Attention Driven by Auditory Cues - Selecting Visual Features in Synchronization with Attracting Auditory Events
Human visual attention can be modulated not only by visual stimuli but also by ones from other modalities such as audition. Hence, incorporating auditory information into a human visual attention model would be a key issue for building more sophisticated models. However, the way of integrating multiple pieces of information arising from audio-visual domains still remains a challenging problem. ...
متن کاملA Neurodynamic Model of Feature-Based Spatial Selection
Huang and Pashler (2007) suggested that feature-based attention creates a special form of spatial representation, which is termed a Boolean map. It partitions the visual scene into two distinct and complementary regions: selected and not selected. Here, we developed a model of a recurrent competitive network that is capable of state-dependent computation. It selects multiple winning locations b...
متن کاملContextual Awareness, Messaging and Communication in Nomadic Audio Environments
Nomadic Radio provides an audio-only wearable interface to unify remote information services such as email, voice mail, hourly news broadcasts, and personal calendar events. These messages are automatically downloaded to a wearable device throughout the day and users can browse them using speech recognition and tactile input. To provide an unobtrusive interface for nomadic users, the audio/text...
متن کامل