speaker transformation

نتایج جستجو برای: speaker transformation

تعداد نتایج: 242055 فیلتر نتایج به سال:

Connectionist Transformation Network Features for Speaker Recognition

2010

Alberto Abad Jordi Luque

Alternative approaches to conventional short-term cepstral modelling of speaker characteristics have been proposed and successfully incorporated to current state-of-the art systems for speaker recognition. Particularly, the use of adaptation transforms employed in speech recognition systems as features for speaker recognition is one of the most appealing recent proposals. In this paper, we also...

متن کامل

Transformation of formants for voice conversion using artificial neural networks

Journal: :Speech Communication 1995

M. Narendranath Hema A. Murthy S. Rajendran Bayya Yegnanarayana

In this paper we propose a scheme for developing a voice conversion system that converts the speech signal uttered by a source speaker to a speech signal having the voice characteristics of the target speaker. In particular, we address the issue of transformation of the vocal tract system features from one speaker to another. Formants are used to represent the vocal tract system features and a ...

متن کامل

Probabilistic feature-based transformation for speaker verification over telephone networks

Journal: :Neurocomputing 2007

متن کامل

Voice transformation using principle component analysis based LSF quantization and dynamic programming approach

2005

Özgül Salor-Durna Mübeccel Demirekler

The goal of voice transformation (VT) is to modify the speech of a source speaker such that it is perceived as if spoken by a target speaker. In this paper, we present a speaker specific line spectral frequency (LSF) quantization based on principle component analysis (PCA) and k-means clustering for VT. An LPC based source-filter model is used to model the speech. Transformation is applied to t...

متن کامل

A DYNAMIC PROGRAMMING APPROACH TO CONTEXT−FREE VOICE TRANSFORMATION (MonAmOR3)

2005

Ozgul Salor Mubeccel Demirekler

In this paper, we present a dynamic programming approach to voice transformation (VT). The goal of VT is to modify the speech of a source speaker such that it is perceived as if spoken by a target speaker. The speech model used in this work is based on MELP (Mixed Excitation Linear Prediction) speech coding algorithm. The designed system obtains speaker−specific codebooks of line spectral frequ...

متن کامل

Voice Transformation Using Two-Level Dynamic Warping and Neural Networks

Journal: :Signals 2021

Voice transformation, for example, from a male speaker to female speaker, is achieved here using two-level dynamic warping algorithm in conjunction with an artificial neural network. An outer process which temporally aligns blocks of speech (dynamic time warp, DTW) invokes inner process, spectrally based on magnitude spectra frequency DFW). The mapping function produced by warp used move spectr...

متن کامل

Including dynamic and phonetic information in voice conversion systems

2004

Antonio Bonafonte Alexander Kain Jan P. H. van Santen Helenca Duxans

Voice Conversion (VC) systems modify a speaker voice (source speaker) to be perceived as if another speaker (target speaker) had uttered it. Previous published VC approaches using Gaussian Mixture Models [1] performs the conversion in a frame-by-frame basis using only spectral information. In this paper, two new approaches are studied in order to extend the GMM-based VC systems. First, dynamic ...

متن کامل

Denoising autoencoder-based speaker feature restoration for utterances of short duration

2015

Hitoshi Yamamoto Takafumi Koshinaka

This paper describes a speaker feature restoration method for improving text-independent speaker recognition with short utterances. The method employs a denoising autoencoder (DAE) to compensate speaker features of a short utterance which contains limited phonetic information. It first estimates phonetic distribution in the utterance as posteriors based on speech models and then transforms an i...

متن کامل

Talking-Face Identity Verification, Audiovisual Forgery, and Robustness Issues

Journal: :EURASIP J. Adv. Sig. Proc. 2009

Walid Karam Hervé Bredin Hanna Greige Gérard Chollet Chafic Mokbel

The robustness of a biometric identity verification (IV) system is best evaluated by monitoring its behavior under impostor attacks. Such attacks may include the transformation of one, many, or all of the biometric modalities. In this paper, we present the transformation of both speech and visual appearance of a speaker and evaluate its effects on the IV system. We propose MixTrans, a novel met...

متن کامل

A study on the influence of covariance adaptation on jacobian compensation in vocal tract length normalization

2009

Doddipatla Rama Sanand Shakti Prasad Rath Srinivasan Umesh

In this paper, we first show that accounting for Jacobian in Vocal-Tract Length Normalization (VTLN) will degrade the performance when there is a mismatch between the train and test speaker conditions. VTLN is implemented using our recently proposed approach of linear transformation of conventional MFCC, i.e. a feature transformation. In this case, Jacobian is simply the determinant of the line...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید