Supra-segmental Feature Based Speaker Trait Detection

نویسندگان

Gang Liu

John H.L. Hansen

Erik Jonsson

چکیده

It is well known that speech utterances convey a rich diversity of information concerning the speaker in addition to related semantic content. Such information may contain speaker traits such as personality, likability, health/pathology, etc. To detect speaker traits in human computer interface is an important task toward formulating more efficient and natural computer engagement. This study proposes two groups of supra-segmental features for improving speaker trait detection performance. Compared with the 6125 dimension features based baseline system, the proposed supra-segmental system not only improves performance by 9.0%, but also is computationally attractive and proper for real life application since it derives a less than 63 dimension features, which are 99% less than the baseline system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Target Speaker Tracking in Broadcast TV Streams

This paper addresses the problem of audio change detection and speaker tracking in broadcast TV streams. A two-pass audio change detection algorithm, which includes detection of the potential change boundaries and refinement, is proposed. Speaker tracking is performed based on the results of speaker change detection. In speaker tracking, Wiener filtering, endpoint detection of pitch, and segmen...

متن کامل

Speech Recognition Only with Supra - segmental Features — Hearing Speech as Music —

This paper proposes a novel paradigm of speech recognition where only the supra-segmental features are utilized. Absolute properties of speech events such as formants and spectrums are completely discarded and only the relative and differential properties of the events are extracted as phonic contrasts. The phonic contrasts are considered as supra-segmental features and they are mathematically ...

متن کامل

On the use of supra model information from multiple classifiers for robust speaker identification

In this paper, we propose a text-independent speaker identification (SI) scheme under uncertainty. In this scheme, extraction of supra model information about probability distributions in the feature space is proposed. Supra modeling is a model clustering technique which groups the speaker models into model sets where the speakers in these sets have similar properties. The scheme uses the Demps...

متن کامل

The Eﬀect of Using PRAAT Software on Pre-Intermediate EFL Learners’ Supra Segmental Features

The present study investigated the eﬀect of using PRAAT as a free computer software package for the scientific analysis of speech in phonetics on pre-intermediate Iranian English as foreign language (EFL) learners’ supra segmental features (i.e., intonation and stress). The design of the study was a Quasi-experimental research design with a pre and post-test. In doing so...

متن کامل

A novel feature extraction strategy for multi-stream robust emotion identification

We investigate an effective feature extraction front-end for speech emotion recognition, which performs well in clean and noisy conditions. First, we explore the use of perceptual minimum variance distortionless response (PMVDR). These features, originally proposed for accent/dialect and language identification (LID), can better approximate the perceptual scales and are less sensitive to noise ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2014

Supra-segmental Feature Based Speaker Trait Detection

نویسندگان

چکیده

منابع مشابه

Robust Target Speaker Tracking in Broadcast TV Streams

Speech Recognition Only with Supra - segmental Features — Hearing Speech as Music —

On the use of supra model information from multiple classifiers for robust speaker identification

The Eﬀect of Using PRAAT Software on Pre-Intermediate EFL Learners’ Supra Segmental Features

A novel feature extraction strategy for multi-stream robust emotion identification

عنوان ژورنال:

اشتراک گذاری