Telefonica Research at TRECVID 2010 Content-Based Copy Detection

نویسندگان

  • Ehsan Younessian
  • Xavier Anguera Miró
  • Tomasz Adamek
  • Nuria Oliver
چکیده

This notebook paper presents the participation of Telefonica Research in the task of Video Copy Detection in TRECVID 2010. This is our second participation and, for this year, we have developed two local-based monomodal systems that we then combine using a score-based fusion to obtain a multimodal system output. We submitted 4 runs in total, whose main characteristics are described below: • TID.m.[BALANCED/NOFA].fusion: These correspond to our main submission, both for the no false alarm and balanced profiles. They are based on the fusion between the local audio and local video monomodal systems. • TID.m.BALANCED.videoonly: This submission is based on the monomodal video-based system using DART local features and with a temporal consistency postprocessing. • TID.m.BALANCED.audioonly: This submission is based on the monomodal audio-based system using frequency-based audio local features. From these four systems submitted, two of them are processing only monomodal information (audio or video) and the fusion system takes the output of the previous two to output a fused result. Results for the monomodal systems in terms of NDCR are far from optimal, mainly due to an exces of false alarms that our monomodal systems still output. Results for F1 scores are very good for all cases. When combining the monomodal systems into he fusion the NDCR scores improve quite a bit as most false alarms are eliminated. The proposed fusion turned out to work very well for combining our two monomodal systems. We will further investigate it to improve it for future evaluations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Telefonica Research Content-Based Copy Detection TRECVID Submission

This notebook paper presents the systems presented by Telefonica Research within the MESH team for the task of Video copy detection in TRECVID 2009. We participated in the Video-only, Audio-only and Audio+Video tasks. Our main contribution is the combination (when possible) of audio and video features within the same system by using global features extracted both from the reference videos and t...

متن کامل

Telefonica Research at TRECVID 2011 Content - Based Copy Detection

This notebook paper summarizes the algorithms behind Telefonica Research participation in the NIST-TRECVID 2011 evaluation on the Video Copy Detection task. This year we have focused on 1) Improving the image-based matching system to better process video files; 2) implemented and tested a novel audio local fingerprint; and 3) improved the multimodality fusion algorithm from last year. For this ...

متن کامل

NTNU-Academia Sinica at TRECVID 2010 Content Based Copy Detection

This paper presents two video copy detection systems built for the TRECVID 2010 content-based copy detection task. Three runs were submitted using video-only content. Two systems differ in terms of the feature design as well as the matching scheme. In this paper we overview the underlying methodologies and discuss the various design choices for developing a practical video copy detection system.

متن کامل

NTT Communication Science Laboratories at TRECVID 2010 Content Based Copy Detection

In this paper, we describe our approaches that were tested in the TRECVID 2010 Content-Based Copy Detection (CBCD) task. We introduce a method consisting of a feature degeneration and sparse feature selection process for video detection tasks, which is highly robust as regards video signal distortion. For audio detection tasks, we adopt a method based on spectral partitioning to cope with addit...

متن کامل

Combining Features at Search Time: PRISMA at Video Copy Detection Task

Most of current Video Copy Detection systems (VCD) perform a multimodal detection by dividing the system into subsystems. Each subsystem performs a copy detection using a different feature (either visual or audio), and the sets of candidates are combined (fused) to create the final result. We present a VCD system that fuses visual and audio descriptors at the similarity search level. The system...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010