A review on multimodal video indexing
نویسندگان
چکیده
Efficient and effective handling of video documents depends on the availability of indexes. Manual indexing is unfeasible for large video collections. Efficient, single modality based, video indexing methods have appeared in literature. Effective indexing, however, requires a multimodal approach in which either the most appropriate modality is selected or the different modalities are used in collaborative fashion. In this paper we present a framework for multimodal video indexing, which views a video document from the perspective of its author. The framework serves as a blueprint for a generic and flexible multimodal video indexing system, and generalizes different state-of-the-art video indexing methods. It furthermore forms the basis for categorizing these different methods.
منابع مشابه
A State-of-the-art Review on Multimodal Video Indexing
Efficient and effective handling of video documents depends on the availability of indexes. Manual indexing is unfeasible for large video collections. Effective indexing requires a multimodal approach in which either the most appropriate modality is selected or the different modalities are used in collaborative fashion. In this paper we focus on the similarities and differences between the moda...
متن کاملAchieving Multimodal Cohesion during Intercultural Conversations
How do English as a lingua franca (ELF) speakers achieve multimodal cohesion on the basis of their specific interests and cultural backgrounds? From a dialogic and collaborative view of communication, this study focuses on how verbal and nonverbal modes cohere together during intercultural conversations. The data include approximately 160-minute transcribed video recordings of ELF interactions ...
متن کاملEvent Based Video Indexing by Intermodal Collaboration
In this paper, we propose event based video indexing, which is a kind of indexing by semantical contents. To achieve this, we exploit the idea of intermodal collaboration, i.e. collaborative processing taking account of the semantical dependency between multimodal information streams consisting of visual, auditory, and text (closed caption: CC) streams. The proposed method attempts to make temp...
متن کاملGrowing Trend from Uni-to-Multimodal Video Indexing
The ultimate challenge of the semantic multimedia database research is to provide a system that can retrieve the most relevant and semantically accurate multimedia data (image, audio, text) according to the user requirement. In order to cope with this challenge there is a need for a system that can efficiently and effectively index the multimedia data. This paper provides an overview of multimo...
متن کاملVideo Content Modeling: An Overview
This paper provides an overview of different video content modeling techniques employed in existing content-based video indexing and retrieval (CBVIR) systems. Based on the modeling requirements of a hypothetical (somewhat ideal) CBVIR system, we analyze and categorize existing modeling approaches. Starting with a review of techniques to model raw video data, we study approaches used to describ...
متن کامل