Ic Ip - 9 8 Audio - Visual Content - Based Violent Scene Characterization
نویسندگان
چکیده
We present a novel technique to characterize and index violent scenes in general TV drama and movies. Our goal is to identify violent signatures and localize violent events within a movie to support \high-level" video indexing. In particular, we exploit multiple \audiovisual" signatures to create a perceptual relation for conceptually meaningful violent scene identi cation. Potential applications are automatic blocking of violence in movies watched by children, hiding violence using data hiding or information ltering and genre classi cation of digital video database.
منابع مشابه
MediaEval 2011 Affect Task: Violent Scene Detection combining audio and visual Features with SVM
We propose an approach for violence analysis of movies in a multi-modal (visual and audio) manner with one-class and two-class support vector machine (SVM). We use the scale-invariant feature transform (SIFT) features with the Bag-of-Words (BoW) approach for visual content description of movies, where audio content description is performed with the mel-frequency cepstral coefficients (MFCCs) fe...
متن کاملThe Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Trajectory-based Features
The Violent Scene Detection task offers a very practical challenge in detecting complex and diverse violent video clips in movies. In this working note paper, we will briefly describe our system and discuss the results, which achieved top performance in mAP@20 and runner-up in mAP@100, among all 35 submissions worldwide. The central component of our system is a set of features derived from the ...
متن کاملRUCMM at MediaEval 2015 Affective Impact of Movies Task: Fusion of Audio and Visual Cues
This paper summarizes our efforts for the first time participation in the Violent Scene Detection subtask of the MediaEval 2015 Affective Impact of Movies Task. We build violent scene detectors using both audio and visual cues. In particular, the audio cue is represented by bag-of-audio-words with fisher vector encoding. The visual cue is exploited by extracting CNN features from video frames. ...
متن کاملA multimedia content modeling and classification methodology using visual information for the protection of sensitive user groups
The thesis concerns the problems of visual tracking and violence detection in video sequences. For the visual tracking problem, two feature fusion frameworks are presented. For violence detection, a system that classifies movie segments as violent or non-violent is proposed. The first tracking framework called ’Model Fusion via Proposal’ (MFP) framework, provides a way to efficiently fuse visua...
متن کاملAudio-Assisted Scene Segmentation for Story Browsing
Content-based video retrieval requires an effective scene segmentation technique to divide a long video file into meaningful high-level aggregates of shots called scenes. Each scene is part of a story. Browsing these scenes unfolds the entire story of a film. In this paper, we first investigate recent scene segmentation techniques that belong to the visual-audio alignment approach. This approac...
متن کامل