Towards Morphological Sound Description Using Segmental Models
نویسندگان
چکیده
We present an approach to model the temporal evolution of audio descriptors using Segmental Models (SMs). This method yields a signal segmentation into a sequence of primitives, constituted by a set of user-defined trajectories . This allows one to consider specific primitive shapes, model their duration and to take into account the time dependence between successive signal frames, contrary to standard Hidden Markov Models. We applied this approach to a database of violin playing. Various types of glissando and dynamics variations were specifically recorded. The results show that our approach using Segmental Models provides a segmentation that can be easily interpreted. Quantitatively, the Segmental Models performed better than standard implementation of Hidden Markov Models.
منابع مشابه
Using morphological description for generic sound retrieval
Systems for sound retrieval are usually “sourcecentred”. This means that retrieval is based on using the proper keywords that define or specify a sound source. Although this type of description is of great interest, it is very difficult to implement it into realistic automatic labelling systems because of the necessity of dealing with thousands of categories, hence with thousands of different s...
متن کاملMorphological Segmentation
Many applications and practices of working with recorded sounds are based on the segmentation and concatenation of fragments of audio streams. In collaborations with composers and sound artists we have observed that a recurrent musical event or sonic shape is often identified by the temporal evolution of the sound features. We would like to contribute to the development of a novel segmentation ...
متن کاملTowards computational morphological description of sound
Research on audio content description deals with limited types of sounds. Most of the work done in this area is applied to automatic transcription of traditional western music, i.e. the conversion of audio into the traditional musical notation pitch/duration/loudness/source or the recognition of the origin of specific sounds (speech, music, applause...) for indexing or retrieval purpose. In tha...
متن کاملInfluences of Segmental Content on the Perception of Word Duration: A First Approach towards a New Perceptual Model of Speech Rhythm
The present research tested the segmental influences on the perception of monosyllabic word durations. 12 listeners of Swiss-German heard pairs of speech and non-speech sounds (monosyllabic words and rectangular-gated sinusoids). They were asked to change the duration of the second sound so that it would match the duration of the first one. Results showed that the Weber Fraction (∆T/T) for tone...
متن کاملEffect of porosity on the characteristics of underwater acoustic sound absorbers using theoretical models
Porous materials have good acoustic damping characteristics over a wide frequency range. As for sound waves, many small-scale pores in the coating materials can convert underwater-coating to rough surfaces. The main property of porous absorbents is their resistance against incident sound wave that leads to damping effect. From a physical point of view, damping occurs due to friction between flu...
متن کامل