Fine-granularity semantic video annotation: An approach based on automatic shot level concept detection and object recognition

نویسندگان

  • Vanessa El-Khoury
  • Martin Jergler
  • Getnet Abebe Bayou
  • David Coquil
  • Harald Kosch
چکیده

Purpose – A fine-grained video content indexing, retrieval, and adaptation requires accurate metadata describing the video structure and semantics to the lowest granularity, i.e. to the object level. The authors address these requirements by proposing semantic video content annotation tool (SVCAT) for structural and high-level semantic video annotation. SVCAT is a semi-automatic MPEG-7 standard compliant annotation tool, which produces metadata according to a new object-based video content model introduced in this work. Videos are temporally segmented into shots and shots level concepts are detected automatically using ImageNet as background knowledge. These concepts are used as a guide to easily locate and select objects of interest which are then tracked automatically to generate an object level metadata. The integration of shot based concept detection with object localization and tracking drastically alleviates the task of an annotator. The paper aims to discuss these issues. Design/methodology/approach – A systematic keyframes classification into ImageNet categories is used as the basis for automatic concept detection in temporal units. This is then followed by an object tracking algorithm to get exact spatial information about objects. Findings – Experimental results showed that SVCAT is able to provide accurate object level video metadata. Originality/value – The new contribution in this paper introduces an approach of using ImageNet to get shot level annotations automatically. This approach assists video annotators significantly by minimizing the effort required to locate salient objects in the video.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Semantic Video Annotation by Object and Shot Re-Detection

Manual video annotation on shot and on object level is a very time consuming and therefore cost intensive task. Automatic object and shot re-detection is one step forward in order to provide a cost efficient solution for temporally detailed video annotation. In this demonstration a tool will be shown which integrates novel video visualisation, navigation and interactive object re-detection tech...

متن کامل

Knowledge – Assisted Video Analysis and Object Detection

Intelligent video analysis is a problem of great importance for applications such as surveillance and automatic annotation. We present, in this paper, a hybrid, knowledge – based approach for object recognition in video sequences. Objects are modelled, in the signal level, through the visual descriptors defined by MPEG-7, the ISO standard for description of audiovisual content and in the semant...

متن کامل

Recognition of Visual Events using Spatio-Temporal Information of the Video Signal

Recognition of visual events as a video analysis task has become popular in machine learning community. While the traditional approaches for detection of video events have been used for a long time, the recently evolved deep learning based methods have revolutionized this area. They have enabled event recognition systems to achieve detection rates which were not reachable by traditional approac...

متن کامل

Application of Combined Local Object Based Features and Cluster Fusion for the Behaviors Recognition and Detection of Abnormal Behaviors

In this paper, we propose a novel framework for behaviors recognition and detection of certain types of abnormal behaviors, capable of achieving high detection rates on a variety of real-life scenes. The new proposed approach here is a combination of the location based methods and the object based ones. First, a novel approach is formulated to use optical flow and binary motion video as the loc...

متن کامل

Assisted video sequences indexing: shot detection and motion analysis based on interest points

♦ Abstract This work deals with content-based video indexing. It is part of a multidisciplinary project about television archives. We focus on semi-automatic compressed video analysis mainly as a means of assisting semantic indexing, i.e. we take into account interaction between automatic analysis and the operator. First, we have developed such an assistant for shot cut detection, using adaptiv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Int. J. Pervasive Computing and Communications

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2013