audio system

Physiologically Motivated Audio-Visua

2005

Stuart N. Wrigley

An audio-visual localisation and tracking system for meeting scenarios is presented which draws its inspiration from neurobiological processing. Meetings are recorded by a KEMAR binaural manikin and a single camera placed directly above the manikin. Source localisation from the binaural audio and face, object and motion locations from the video frames are used as input to two linked neural osci...

متن کامل

Audio, visual, and audio-visual egocentric distance perception by moving participants in virtual environments

2013

Marc Rébillat Xavier Boutillon Étienne Corteel Brian F.G. Katz

A study on audio, visual, and audio-visual egocentric distance perception by moving participants in virtual environments is presented. Audio-visual rendering is provided using tracked passive visual stereoscopy and acoustic wave eld synthesis (WFS). Distances are estimated using indirect blind-walking (triangulation) under each rendering condition. Experimental results show that distances perce...

متن کامل

Analysis of Packet Loss and Latency Control for Robust IPTV over Mobile WiMAX and LTE Assessment (RESEARCH NOTE)

Journal: International Journal of Engineering 2013

Fareeha Zafar, Muhammad Akram,

Abstract The streamed audio video (AV) content for IPTV across mobile WiMAX channel, the different schemes were discussed to reduce the noise, packet loss and latency. The objective of this paper is to verify the effectiveness of forward error correction (FEC) techniques and to suggest the techniques for robustness problems and to analysis the issues either due to AV coding encoding or due to...

متن کامل

FPGA implementation of DWT for Audio Watermarking Application

2013

Sajeevan Joseph

Abstract—Digital water marking is a technique of embedding extra information into the multimedia content, which can be extracted to prove the copy rights. Compared to human visual system, audio system is more sensitive. As a result very few audio watermarking algorithms have been robust and imperceptible. In this paper we are implementing audio watermarking using discrete wavelet transform (DWT...

متن کامل

Audio-Video Speaker Diarization for Unsupervised Speaker and Face Model Creation

2014

Pavel Campr Marie Kunesová Jan Vanek Jan Cech Josef Psutka

Our goal is to create speaker models in audio domain and face models in video domain from a set of videos in an unsupervised manner. Such models can be used later for speaker identification in audio domain (answering the question ”Who was speaking and when”) and/or for face recognition (”Who was seen and when”) for given videos that contain speaking persons. The proposed system is based on an a...

متن کامل

Multicast Audio over Wireless LAN for Professional Applications

2004

V. Kravcenko M. Purat

This paper addresses the problem of high quality audio transmission over an air interface. The analysis is based on an existing embedded audio multi-room system and the usage of standard IEEE802.11 WLAN components. An access scheme based on the emerging IEEE802.11e supplement is proposed to meet the hard real-time requirements of audio streaming. Measurements show the performance of the air int...

متن کامل

Noisemes: Manual Annotation of Environmental Noise in Audio Streams

2012

Susanne Burger Qin Jin Peter F. Schulam Florian Metze

Audio information retrieval is a difficult problem due to the highly unstructured nature of the data. A general labeling system for identifying audio patterns could unite research efforts in the field. This paper introduces 42 distinct labels, the “noisemes”, developed for the manual annotation of noise segments as they occur in audio streams of consumer captured and semiprofessionally produced...

متن کامل

Conducting Audio Files via Computer Vision

2003

Declan Murphy Tue Haste Andersen Kristoffer Jensen

This paper presents a system to control the playback of audio files by means of the standard classical conducting technique. Computer vision techniques are developed to track a conductor’s baton, and the gesture is subsequently analysed. Audio parameters are extracted from the sound-file and are further processed for audio beat tracking. The sound-file playback speed is adjusted in order to bri...

متن کامل

Data-Driven Live Coding with DataToMusic API

2016

Takahiko Tsuchiya Jason Freeman Lee W. Lerner

Creating interactive audio applications for web browsers often involves challenges such as time synchronization between non-audio and audio events within thread constraints and format-dependent mapping of data to synthesis parameters. In this paper, we describe a unique approach for these issues with a data-driven symbolic music application programming interface (API) for rapid and interactive ...

متن کامل

Developing Audio Processing Agents for Multi-agent Mpeg-7 Enabled Environment

2003

Mingkun Li Gang Wei Valery A. Petrushin Ishwar K. Sethi

This paper presents a methodology for developing audio processing agents for a multi-agent environment that is known as the Community of Multimedia Agents. The Community’s philosophy, objectives and architecture are described. The methodology is illustrated using audio feature extraction agents as example. The algorithms used for extracting audio features are classical and work in general audio...

متن کامل