Combining Audio-Based and Video-Based Shot Classification Systems for News Videos Segmentation
نویسندگان
چکیده
In this paper we propose an innovative combination strategy for a system using video and audio stream of a news video to automatically segment it into stories. In our approach, the segmentation is performed in two steps: first, shots are classified by combining three different anchor shot detection algorithms using video information only. Then, the shot classification is improved by using a novel anchor shot detection method based on features extracted from the audio track. Experimental results demonstrate that the combined use of audio and video allows our system to perform better than approaches based only on video information in terms of both shot classification and news story segmentation.
منابع مشابه
Unsupervised News Video Segmentation by Combined Audio-Video Analysis
Segmenting news video into stories is among key issues for achieving efficient treatment of news-based digital libraries. In this paper we present a novel unsupervised algorithm that combines audio and video information for automatic partitioning news videos into stories. The proposed algorithm is based on the detection of anchor shots within the video. In particular, a set of audio/video templ...
متن کاملA Comparison of Unsupervised Shot Classification Algorithms for News Video Segmentation
Automatic classification of shots extracted by news video plays an important role in the context of news video segmentation. In spite of the efforts of the researchers involved in this field, a definite solution for the shot classification problem does not yet exist. Moreover, the authors of each novel algorithm usually provide results supporting the claim that their method performs well on a s...
متن کاملA Two-Level Multi-Modal Approach for Story Segmentation of Large News Video Corpus
This paper presents an enhanced work from our previous paper [Chaisorn et al. 2002]. The system is enhanced to perform news story segmentation on a large video corpus used in TRECVID 2003 evaluation. We use a combination of features include visual-based features such as color, object-based features such as face, video-text, temporal features such as audio and motion, and semantic feature such a...
متن کاملA Scalable Video Search Engine Based on Audio Content Indexing and Topic Segmentation
One important class of online videos is that of news broadcasts. Most news organisations provide near-immediate access to topical news broadcasts over the Internet, through RSS streams or podcasts. Until lately, technology has not made it possible for a user to automatically go to the smaller parts, within a longer broadcast, that might interest them. Recent advances in both speech recognition ...
متن کاملAudio-video based Segmentation and Classification using SVM and AANN
In this paper, we propose a method for combining audio and video for segmentation and classification. The objective of segmentation is to detect the category change point such news to advertisement. The classification system classify the audio-video data into one of the predefined categories such as news, advertisement, sports, serial and movies. Mel frequency cepstral coefficients( MFCC) are u...
متن کامل