Learning Semantic Visual Concepts from Video

نویسندگان

  • Jingchun Liu
  • Bir Bhanu
چکیده

Increasing amounts of digital video data have become available with the rapid growth in video technology. As a result, there is a great need for automatic extraction of concepts or events of interest from video. In this paper, we present an approach for learning concepts from video. The approach consists of three steps. In the first step, video shot boundaries are detected, and from these shots key frames are extracted, which are representatives of the shots. In the second step, key frames are segmented and a variety of features are computed. In the third step, a classification by feature partitioning method is employed for learning different semantic concepts. The results are presented for successfully learning semantic concepts such as ocean, mountain, people, and building from a variety of digital videos.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognition of Visual Events using Spatio-Temporal Information of the Video Signal

Recognition of visual events as a video analysis task has become popular in machine learning community. While the traditional approaches for detection of video events have been used for a long time, the recently evolved deep learning based methods have revolutionized this area. They have enabled event recognition systems to achieve detection rates which were not reachable by traditional approac...

متن کامل

The Effect of Using Visual Aids, Semantic Elaboration, and Visual Aids plus Semantic Elaboration on Iranian Learners' Vocabulary Learning

This study investigated the effect of using visual aids, semantic elaboration, and visual aids plus semantic elaboration on the Iranian EFL learners' vocabulary learning. To conduct the study, the researchers assigned 49 elementary learners to three homogeneous groups according to their proficiency level. Then, a pre-test of Paribakht and Wesche's Vocabulary Knowledge Scale was given to each gr...

متن کامل

Multimodal Video Concept Detection via Bag of Auditory Words and Multiple Kernel Learning

State-of-the-art systems for video concept detection mainly rely on visual features. Some previous approaches have also included audio features, either using low-level features such as mel-frequency cepstral coefficients (MFCC) or exploiting the detection of specific audio concepts. In this paper, we investigate a bag of auditory words (BoAW) approach that models MFCC features in an auditory vo...

متن کامل

Semantic Video Retrieval Using High Level Context

Video retrieval – searching and retrieving videos relevant to a user defined query – is one of the most popular topics in both real life applications and multimedia research. This thesis employs concepts from Natural Language Understanding in solving the video retrieval problem. Our main contribution is the utilization of the semantic word similarity measures for video retrieval through the tra...

متن کامل

Assessing Semantic Relevance by Using Audiovisual Cues

This paper presents two complementary approaches for assessing semantic relevance in video retrieval—(1) adaptive video indexing and (2) elemental concept indexing. Both approaches make extensive use of audiovisual cues. In the former, retrieval is performed by using implicit semantic indices through audio and visual features. Audio features are extracted by statistical time-frequency analysis ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002