Weighted subspace modeling for semantic concept retrieval using gaussian mixture models
نویسندگان
چکیده
At the era of digital revolution, social media data are growing at an explosive speed. Thanks to the prevailing popularity of mobile devices with cheap costs and high resolutions as well as the ubiquitous Internet access provided by mobile carriers, Wi-Fi, etc., numerous numbers of videos and pictures are generated and uploaded to social media websites such as Facebook, Flickr, and Twitter everyday. To efficiently and effectively search and retrieve information from the large amounts of multimedia data (structured, semi-structured, or unstructured), lots of algorithms and tools have been developed. Among them, a variety of data mining and machine learning methods have been explored and proposed and have shown their effectiveness and potentials in handling the growing requests to retrieve semantic information from those large-scale multimedia data. However, it is well-acknowledged that the performance of such multimedia semantic information retrieval is far from satisfactory, due to the challenges like rare events, data imbalance, etc. In this paper, a novel weighted subspace modeling framework is proposed that is based on the Gaussian Mixture Model (GMM) and is able to effectively retrieve semantic concepts, even from the highly Chao Chen Department of Electrical and Computer Engineering, University of Miami, 1251 Memorial Drive, Coral Gables, FL 33146, USA Tel.: +305-284-6503 E-mail: [email protected] Mei-Ling Shyu Department of Electrical and Computer Engineering, University of Miami, 1251 Memorial Drive, Coral Gables, FL 33146, USA Tel.: +305-284-5566 Fax: +305-284-4044 E-mail: [email protected] Shu-Ching Chen School of Computing and Information Sciences, Florida International University, 11200 SW 8th Street, Miami, FL 33199, USA Tel.: +305-348-3480 Fax: +305-348-3549 E-mail: [email protected]
منابع مشابه
Developing a BIM-based Spatial Ontology for Semantic Querying of 3D Property Information
With the growing dominance of complex and multi-level urban structures, current cadastral systems, which are often developed based on 2D representations, are not capable of providing unambiguous spatial information about urban properties. Therefore, the concept of 3D cadastre is proposed to support 3D digital representation of land and properties and facilitate the communication of legal owners...
متن کاملSemantic-Oriented 3D Model Classification and Retrieval Using Gaussian Processes
The need of retrieving 3D models is constantly emerging. To improve the performance of a shape-based 3D model retrieval system, an approach is introduced to classify and retrieve 3D model by integrating shape features and semantic information. First, a new type of shape feature based on 2D views (called ZA) is proposed. Then we use Gaussian processes as supervised learning to mode the mapping f...
متن کاملSemantic Similarity for Music Retrieval
We present a query-by-example system for content-based music information retrieval by ranking items in a database based on semantic similarity, rather than acoustic similarity, to a query example. The retrieval system is based on semantic concept models that are learned from the CAL500 data set containing both audio examples and their text captions. Using the concept models, the audio tracks ar...
متن کاملDependency Models based on Generalized Gaussian Scale Mixtures and Normal Variance Mean Mixtures
We extend the Gaussian scale mixture model of dependent subspace source densities to include non-radially symmetric densities using Generalized Gaussian random variables linked by a common variance. We also introduce the modeling of skew using the Normal Variance-Mean mixture model. We give closed form expressions for likelihoods and parameter updates in the EM algorithm.
متن کاملTper Hcaeser Pidi Application of Subspace Gaussian Mixture Models in Contrastive Acoustic Scenarios
This paper describes experimental results of applying Subspace Gaussian Mixture Models (SGMMs) in two completely diverse acoustic scenarios: (a) for Large Vocabulary Continuous Speech Recognition (LVCSR) task over (well-resourced) English meeting data and, (b) for acoustic modeling of underresourced Afrikaans telephone data. In both cases, the performance of SGMM models is compared with a conve...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Information Systems Frontiers
دوره 18 شماره
صفحات -
تاریخ انتشار 2016