An Engine for Online Video Search in Large Archives of the Holocaust Testimonies
نویسندگان
چکیده
In this paper we present an online system for cross-lingual lexical (full-text) searching in the large archive of the Holocaust testimonies. Video interviews recorded in two languages (English and Czech) were automatically transcribed and indexed in order to provide efficient access to the lexical content of the recordings. The engine takes advantage of the state-of-the-art speech recognition system and performs fast spoken term detection (STD), providing direct access to the segments of interviews containing queried words or short phrases.
منابع مشابه
Fast Phonetic/Lexical Searching in the Archives of the Czech Holocaust Testimonies: Advancing Towards the MALACH Project Visions
In this paper we describe the system for a fast phonetic/lexical searching in the large archives of the Czech holocaust testimonies. The developed system is the first step to a fulfillment of the MALACH project visions [1,2], at least as for an easier and faster access to the Czech part of the archives. More than one thousand hours of spontaneous, accented and highly emotional speech of Czech h...
متن کاملDesigning and Explaining the Impact Pattern of Online Advertising on Actual Purchasing (Case Study: Atieh Saba Holding)
The purpose of this study is to design and explain the impact pattern of online advertising on actual purchasing (Case Study: Atieh Saba Holding). The study is qualitative based on the objective and data collection process. The population of the study were all marketing and sales experts at Atieh Saba Holding, and among these experts, 10 were selected as the sample. In this study, data were col...
متن کاملDesigning Spontaneous Speech Search Interface for Historical Archives
Spontaneous speech in the form of conversations, meetings, voice-mail, interviews, oral history, etc. is one of the most ubiquitous forms of human communication. Search engines providing access to such speech collections have the potential to better inform intelligence and make relevant data over vast audio/video archives available to users. This project presents a search user interface design ...
متن کاملSystem for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive
The main objective of the work presented in this paper was to develop a complete system that would accomplish the original visions of the MALACH project. Those goals were to employ automatic speech recognition and information retrieval techniques to provide improved access to the large video archive containing recorded testimonies of the Holocaust survivors. The system has been so far developed...
متن کاملUsing Text Surrounding Method to Enhance Retrieval of Online Images by Google Search Engine
Purpose: the current research aimed to compare the effectiveness of various tags and codes for retrieving images from the Google. Design/methodology: selected images with different characteristics in a registered domain were carefully studied. The exception was that special conceptual features have been apportioned for each group of images separately. In this regard, each group image surr...
متن کامل