نتایج جستجو برای: audio system

تعداد نتایج: 2277301  

2005
Stuart N. Wrigley

An audio-visual localisation and tracking system for meeting scenarios is presented which draws its inspiration from neurobiological processing. Meetings are recorded by a KEMAR binaural manikin and a single camera placed directly above the manikin. Source localisation from the binaural audio and face, object and motion locations from the video frames are used as input to two linked neural osci...

2013
Marc Rébillat Xavier Boutillon Étienne Corteel Brian F.G. Katz

A study on audio, visual, and audio-visual egocentric distance perception by moving participants in virtual environments is presented. Audio-visual rendering is provided using tracked passive visual stereoscopy and acoustic wave eld synthesis (WFS). Distances are estimated using indirect blind-walking (triangulation) under each rendering condition. Experimental results show that distances perce...

Abstract   The streamed audio video (AV) content for IPTV across mobile WiMAX channel, the different schemes were discussed to reduce the noise, packet loss and latency. The objective of this paper is to verify the effectiveness of forward error correction (FEC) techniques and to suggest the techniques for robustness problems and to analysis the issues either due to AV coding encoding or due to...

2013
Sajeevan Joseph

Abstract—Digital water marking is a technique of embedding extra information into the multimedia content, which can be extracted to prove the copy rights. Compared to human visual system, audio system is more sensitive. As a result very few audio watermarking algorithms have been robust and imperceptible. In this paper we are implementing audio watermarking using discrete wavelet transform (DWT...

2014
Pavel Campr Marie Kunesová Jan Vanek Jan Cech Josef Psutka

Our goal is to create speaker models in audio domain and face models in video domain from a set of videos in an unsupervised manner. Such models can be used later for speaker identification in audio domain (answering the question ”Who was speaking and when”) and/or for face recognition (”Who was seen and when”) for given videos that contain speaking persons. The proposed system is based on an a...

2004
V. Kravcenko M. Purat

This paper addresses the problem of high quality audio transmission over an air interface. The analysis is based on an existing embedded audio multi-room system and the usage of standard IEEE802.11 WLAN components. An access scheme based on the emerging IEEE802.11e supplement is proposed to meet the hard real-time requirements of audio streaming. Measurements show the performance of the air int...

2012
Susanne Burger Qin Jin Peter F. Schulam Florian Metze

Audio information retrieval is a difficult problem due to the highly unstructured nature of the data. A general labeling system for identifying audio patterns could unite research efforts in the field. This paper introduces 42 distinct labels, the “noisemes”, developed for the manual annotation of noise segments as they occur in audio streams of consumer captured and semiprofessionally produced...

2003
Declan Murphy Tue Haste Andersen Kristoffer Jensen

This paper presents a system to control the playback of audio files by means of the standard classical conducting technique. Computer vision techniques are developed to track a conductor’s baton, and the gesture is subsequently analysed. Audio parameters are extracted from the sound-file and are further processed for audio beat tracking. The sound-file playback speed is adjusted in order to bri...

2016
Takahiko Tsuchiya Jason Freeman Lee W. Lerner

Creating interactive audio applications for web browsers often involves challenges such as time synchronization between non-audio and audio events within thread constraints and format-dependent mapping of data to synthesis parameters. In this paper, we describe a unique approach for these issues with a data-driven symbolic music application programming interface (API) for rapid and interactive ...

2003
Mingkun Li Gang Wei Valery A. Petrushin Ishwar K. Sethi

This paper presents a methodology for developing audio processing agents for a multi-agent environment that is known as the Community of Multimedia Agents. The Community’s philosophy, objectives and architecture are described. The methodology is illustrated using audio feature extraction agents as example. The algorithms used for extracting audio features are classical and work in general audio...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید