The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search Task
نویسندگان
چکیده
This paper describes the system proposed for the Spoken Web Search task at Mediaeval 2012 campaign. We use an audio-only system based on our new called Cumulative Dynamic Time Warping (CDTW) algorithm. This algorithm combines the scores of all the alignment paths and allows for the learning of different cost functions for each kind of step in the alignment matrix (diagonal, horizontal and vertical). The results obtained with basic audio descriptors show the promising potential of our algorithm.
منابع مشابه
SWS task: Articulatory phonetic units and sliding DTW
This paper describes the experiments conducted for spoken web search at MediaEval 2011 evaluations. The task consists of searching for audio segments within audio content using an audio query. The current approach uses a broad articulatory phonetic units for indexing the audio files and to obtain audio segments. Sliding DTW is applied on the audio segments to determine the time instants.
متن کاملBUT2012 Approaches for Spoken Web Search - MediaEval 2012
We submitted two approaches as the required runs: Acoustic Keyword Spotting as the primary one (AKWS) and Dynamic Time Wrapping as the secondary one (DTW) for the Spoken Web Search task. We aimed at building a simple phone based language-dependent system. We experimented with universal context bottle-neck neural network classifier with 3-state phone posterior features or bottle-neck features.
متن کاملTUKE MediaEval 2012: Spoken Web Search using DTW and Unsupervised SVM
This working paper provides the basic information about experiments conducted on audio documents within the MediaEval 2012 spoken web search evaluation project. The main purpose of these experiments was to build a robust and language independent system for spoken term detection. Therefore we have proposed query-by-example searching system based on the minimum-cost alignment of DTW algorithm and...
متن کاملThe L2F Spoken Web Search System for Mediaeval 2013
The INESC-ID’s Spoken Language Systems Laboratory (LF) primary system developed for the Spoken Web Search task of the Mediaeval 2013 evaluation campaign consists of the fusion of six individual sub-systems exploiting 3 different language-dependent phonetic classifiers. For each phonetic classifier, an acoustic keyword spotting (AKWS) sub-system based on connectionist speech recognition and a dy...
متن کاملGTTS Systems for the SWS Task at MediaEval 2013
This paper briefly describes the systems presented by the Software Technologies Working Group (http://gtts.ehu.es, GTTS) of the University of the Basque Country (UPV/EHU) to the Spoken Web Search (SWS) task at MediaEval 2013. GTTS systems consist of four main modules: (1) feature extraction; (2) speech activity detection; (3) DTW-based query matching; and (4) score calibration and fusion. The m...
متن کامل