Effective retrieval and visual analysis in multimedia databases

نویسنده

  • Tobias Schreck
چکیده

Based on advances in acquisition, storage, and dissemination technology, increasing amounts of multimedia content such as images, audio, video, or 3D models, become available. The Feature Vector (FV) paradigm is one of the most popular approaches for managing multimedia content due to its simplicity and generality. It maps multimedia elements from object space to metric space, allowing to infer object similarity relationships from distances in metric space. The distances in turn are used to implement similarity-based multimedia applications. For a given multimedia data type, many different FV mappings are possible, and the effectiveness of a FV mapping can be understood as the degree of resemblance of object space similarity relationships by distances in metric space. The effectiveness of the FV mapping is essential for any application based on it. Two main ideas motivate this thesis. We first recognize that the FV approach is promising, but needs attention of FV selection and engineering in order to serve as a basis for building effective multimedia applications. Secondly, we believe that visualization can contribute to building powerful user interfaces for analysis of the FV as well as the object space. This thesis focuses on supporting a number of important user tasks in FV-based multimedia databases. Specifically, we propose innovative methods for (a) effective processing of content-based similarity queries, (b) FV space visualization for discrimination analysis, and (c) visualization layout generation for content presentation. The methods are applied and evaluated on a number of specific multimedia data types such as 3D models, images, and time series data, and are expected to be useful in many other multimedia domains. Effective retrieval in 3D databases (Chapter 2). We review and classify a significant number of recently proposed FV extractors supporting the 3D model domain. Extensive effectiveness evaluation experiments are performed for many FV extractors on a number of benchmarks. Methods for improving retrieval effectiveness by forming static and query-dependent combinations of FVs are researched. Experiments show significant improvements in retrieval precision (quality of the answer sets) to be achievable. Visual FV space analysis (Chapter 3). We explore the usage of interactive 2D projections for retrieval and organization of multimedia content in a multi-FV 3D retrieval system. Selforganizing maps (SOMs) have shown to be appropriate to this end. We propose a PCA-based visualization method for supervised visual discrimination analysis in FV space. Also, SOMbased techniques are explored for unsupervised estimation of FV space discrimination power. Both visualizations can be used for addressing the FV selection problem, and for fine tuning FV-based multimedia applications.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Proposing an effective approach for Network security and multimedia documents classically using encryption and watermarking

Local binary pattern (LBP) operators, which measure the local contrast within a pixel's neighborhood, successfully applied to texture analysis, visual inspection, and image retrieval. In this paper, we recommend a semi blind and informed watermarking approach. The watermark has been built from the original image using Weber Law. The approach aims is to present a high robustness and imperceptibi...

متن کامل

Multimedia Information Retrieval: Promises and Challenges

The explosion of multimedia content in databases, broadcasts, streaming media, etc. has generated new requirements for more effective access to these global information repositories. Content extraction, indexing, and retrieval of multimedia data continues to be one of the most challenging and fastest-growing research areas. A consequence of the growing consumer demand for multimedia information...

متن کامل

Data Mining-Based CBIR System

Multimedia mining primarily involves information analysis and retrieval based on implicit knowledge. The ever increasing digital image databases on the internet has created a need for using multimedia mining on these databases for effective and efficient retrieval of images.

متن کامل

Eye-Tracking Method’ Usage for Understanding the Cognitive Processes in Multimedia Learning

Introduction: Designing multimedia learning environments should consist of the evidence-based study and principals about the human learning process. Eye tracking is a way based on the learner processing of learning materials which presented in multimedia learning environments. The aim of the study was to examine the use of the eye-tracking method to investigate the cognitive processes in m...

متن کامل

The Design of A Web-based Multimedia Information Retrieval System

In a large-scale and distributed visual information retrieval system, various multimedia databases are distributed throughout the Internet. Accessing multimedia databases distributed across remote sites has been made possible by the advent of the World Wide Web using the Internet as the medium. However, accessing such multimedia databases over the Web poses several challenges relating to the se...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007