Lessons Learned from Building a Terabyte Digital Video Library

نویسندگان

Howard D. Wactlar

Michael G. Christel

Yihong Gong

Alexander G. Hauptmann

چکیده

T he Informedia Project at Carnegie Mellon University has created a terabyte digital video library in which automatically derived descriptors for the video are used for indexing , segmenting, and accessing the library contents. Digital video presented a number of interesting challenges for library creation and deployment: the way it embeds information, its voluminous file size, and its temporal characteristics. In the Informedia Project, we addressed these challenges by • automatically extracting information from digitized video, • creating interfaces that allowed users to search for and retrieve videos based on extracted information , and • validating the system through user testbeds. We met these objectives during the course of the project , learning many lessons along the way. Our library consisted of two types of video: news video (from the Cable News Network) and documentary video (from the British Open University, QED Communications, the Discovery Channel, and a number of US government agencies, including NASA, the National Park Service, and the US Geological Survey). As of May 1998, the library contained more than 1,000 hours of news and 400 hours of documentary video, with additional video being added daily. The news video often included tags which marked story boundaries within longer broadcasts. At first, we added these story boundaries manually for the documentary video; in our subsequent experiments we looked at generating story boundaries automatically. The stories, or video segments, averaged a few minutes in length, so that the entire library contained more than 40,000 video segments. We used artificial intelligence techniques to create metadata, the data that describes video content. We found that all the AI techniques we used were applicable to both the news corpus and the documentary corpus. These techniques included speech recognition, image processing, and information retrieval. We learned that by integrating across these three areas we were able to compensate for limitations in accuracy, coverage, and communicative power. We analyzed the audio component of the video with the CMU Sphinx speech recognition system. 1 This created a complete transcript for text-based retrieval from the speech and aligned existing imperfect transcripts to the video. The tightly synchronized transcript was subsequently used in the library interface for quickly locating regions of interest within relevant video segments. In creating transcripts, CMU Sphinx's word error rate is inversely proportional to the amount of processing time devoted to the task. Processing time varies from real time, which offers relatively poor …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Accessing News Video Libraries through Dynamic Information Extraction, Summarization, and Visualization

The Informedia Project has developed and evaluated surrogates, summary interfaces, and visualizations for accessing a digital video library containing thousands of documents and terabytes of data. This paper begins with a review of Informedia surrogates for a single video document, including titles, storyboards, and skims. Incorporating textual elements, considering user context and emphasizing...

متن کامل

Examining User Interactions with Video Retrieval Systems

The Informedia group at Carnegie Mellon University has since 1994 been developing and evaluating surrogates, summary interfaces, and visualizations for accessing digital video collections containing thousands of documents, millions of shots, and terabytes of data. This paper reports on TRECVID 2005 and 2006 interactive search tasks conducted with the Informedia system by users having no knowled...

متن کامل

Interactive Maps for a Digital Video Library

The Informedia Digital Video Library contains over 1200 hours of video. Through automatic processing, descriptors are derived for the video to improve library access. A new extension to the video processing is the extraction of geographic references from these descriptors. The operational library interface shows the geographic entities addressed in a given story, highlighting the regions discus...

متن کامل

Informedia Digital Video Library Accomplishments and Future Directions

The Informedia Digital Video Library Project (IDVL), launched in mid-1994, was one of six Digital Library Initiative (DLI) Phase 1 projects funded jointly by NSF, DARPA and NASA. IDVL was the only DLI project focusing specifically on information extraction from video and audio content, successfully pioneering the automated indexing and retrieval of multimedia documents from over a terabyte of o...

متن کامل

A Distributed Digital Library: Planning, Building, and Using

Cost cutting and personnel restructuring are forcing organizations to make difficult decisions on where to spend money. Among the areas hit hard by budget cuts are education and training. Academic, industrial, and governmental institutions are all seeking means to leverage technology to improve the timeliness, efficiency, and standardization of their required training. One way to extend budgets...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

IEEE Computer

دوره 32 شماره

صفحات -

تاریخ انتشار 1999

Lessons Learned from Building a Terabyte Digital Video Library

نویسندگان

چکیده

منابع مشابه

Accessing News Video Libraries through Dynamic Information Extraction, Summarization, and Visualization

Examining User Interactions with Video Retrieval Systems

Interactive Maps for a Digital Video Library

Informedia Digital Video Library Accomplishments and Future Directions

A Distributed Digital Library: Planning, Building, and Using

عنوان ژورنال:

اشتراک گذاری