An Audio Attention Computational Model Based on Spatial Cues Gradient

نویسندگان

Bo Hang

Yi Wang

Changqing Kang

چکیده

Present bottom-up audio attention computational model extracts the underlying characteristics of single channel audio such as energy, pitch, zero crossing rate etc., to calculate the audio signal attention level. In audio surveillance, sound source whose directions change rapidly should have higher attention level. But present audio computational models cannot effectively express the audio attention caused by such signals. It may cause some audio events, which should be paid attention to, cannot bedetected. To solve this problem, based on the psychological principles that spatial information affects attention, this paper proposes a model that introducing the short-term spatial gradient cues to measure the attention caused by the fast changing of single audio source space direction. This model calculates the mean short-term changes of the spatial cues vector ofsub-bands as spatialcues gradient. Compared to the traditional audio attention computational model, the recall of detection of attention audio events increased 4.5 percentage points in experiments.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A General Framework for Online Audio Source Separation

We consider the problem of online audio source separation. Existing algorithms adopt either a sliding block approach or a stochastic gradient approach, which is faster but less accurate. Also, they rely either on spatial cues or on spectral cues and cannot separate certain mixtures. In this paper, we design a general online audio source separation framework that combines both approaches and bot...

متن کامل

Visual Attention and the Semantics of Space: Evidence for Two Forms of Symbolic Control

In this paper, we investigate the functional differences between word cues and arrow cues in a spatial cuing task and provide a novel computational model fit to the empirical data that provides (1) a conceptually parsimonious explanation of the observed differences and (2) evidence for the existence of two forms of symbolic attentional control. We briefly discuss the implications of the model f...

متن کامل

Visual Attention Driven by Auditory Cues - Selecting Visual Features in Synchronization with Attracting Auditory Events

Human visual attention can be modulated not only by visual stimuli but also by ones from other modalities such as audition. Hence, incorporating auditory information into a human visual attention model would be a key issue for building more sophisticated models. However, the way of integrating multiple pieces of information arising from audio-visual domains still remains a challenging problem. ...

متن کامل

A Neurodynamic Model of Feature-Based Spatial Selection

Huang and Pashler (2007) suggested that feature-based attention creates a special form of spatial representation, which is termed a Boolean map. It partitions the visual scene into two distinct and complementary regions: selected and not selected. Here, we developed a model of a recurrent competitive network that is capable of state-dependent computation. It selects multiple winning locations b...

متن کامل

Contextual Awareness, Messaging and Communication in Nomadic Audio Environments

Nomadic Radio provides an audio-only wearable interface to unify remote information services such as email, voice mail, hourly news broadcasts, and personal calendar events. These messages are automatically downloaded to a wearable device throughout the day and users can browse them using speech recognition and tactile input. To provide an unobtrusive interface for nomadic users, the audio/text...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2016

An Audio Attention Computational Model Based on Spatial Cues Gradient

نویسندگان

چکیده

منابع مشابه

A General Framework for Online Audio Source Separation

Visual Attention and the Semantics of Space: Evidence for Two Forms of Symbolic Control

Visual Attention Driven by Auditory Cues - Selecting Visual Features in Synchronization with Attracting Auditory Events

A Neurodynamic Model of Feature-Based Spatial Selection

Contextual Awareness, Messaging and Communication in Nomadic Audio Environments

عنوان ژورنال:

اشتراک گذاری