Enst/uob/lu@trecvid2007 High Level Feature Extraction Using 2-level Piecewise Gmm
نویسندگان
چکیده
We describe a high level feature extraction system for video. Video sequences are modeled using Gaussian Mixture Models. We have used those models in the past to segment video sequences into 2D+time objects. The segmentation result has been used with great success in a compression scheme. In the present work, the Gaussian components of the model are considered to completely model the corresponding objects in the video. Their parameters are used as low-level features for a high-level model used for the detection of a topic or high level feature. The system is not optimized for a particular feature and is thus scalable to any number of features. A threshold is manually selected for each feature after normalization. The only difference between runs was normalization. We tested two runs: B_ENST_1 uses znorm per topic per video. B_ENST_2 use znorm per video. The second system provided better results. This is an initial system that will be used to explore the effectiveness of the modeling of videos using GMMs.
منابع مشابه
Florida International University and University of Miami TRECVID 2008 - High Level Feature Extraction
This paper describes the FIU-UM group TRECVID 2008 high level feature extraction task submission. We have used a correlation based video semantic concept detection system for this task submission. This system first extracts shot based low-level audiovisual features from the raw data source (audio and video files). The resulting numerical feature set is then discretized. Multiple correspondence ...
متن کاملTokyoTech's TRECVID2007 Notebook
For the high-level feature extraction task, we use visual features (visual words of keyframe images and motion features), and do not use any audio information. Maximum entropy models [1] are employed to model these visual features. Because there was a material mistake in our submission, the inferred Average Precisions of our runs was almost zero. Therefore, in this notebook, we also show the re...
متن کاملTRECVID 2006 by NUS - I
NUS and IR joint participated in the high-level feature extraction and automated search task for TRECVID 2006. In both task, we only make use of the standard TRECVID available annotation results. For HLF task, we develop 2 methods to perform automated concept annotation: (a) fully machine learning approach using SVM, LDF and GMM; and (b) Bi-gram model for Pattern Discovery and Matching. As for ...
متن کاملImproved Closed Set Text-Independent Speaker Identification by Combining MFCC with Evidence from Flipped Filter Banks
A state of the art Speaker Identification (SI) system requires a robust feature extraction unit followed by a speaker modeling scheme for generalized representation of these features. Over the years, Mel-Frequency Cepstral Coefficients (MFCC) modeled on the human auditory system has been used as a standard acoustic feature set for SI applications. However, due to the structure of its filter ban...
متن کاملDevelopment of an Automatic Land Use Extraction System in Urban Areas using VHR Aerial Imagery and GIS Vector Data
Lack of detailed land use (LU) information and efficient data collection methods have made the modeling of urban systems difficult. This study aims to develop a novel hierarchical rule-based LU extraction framework using geographic vector and remotely sensed (RS) data, in order to extract detailed subzonal LU information, residential LU in this study. The LU extraction system is developed to ex...
متن کامل