Denoising Speech Signals with Hifi-Coulomb-GANs

نویسندگان

چکیده

Recorded speech signals often contain noise that affects the quality of signal and reduces intelligibility. Several studies have used Generative Adversarial Networks (GANs) to remove artifacts improve However, GANs can suffer from gradient vanishing or explosion reduce their effectiveness in denoising. To mitigate vanishing, we applied CoulombGAN architecture denoising using a model structure similar Hifi-GAN, current state art denoiser. We call this new Hifi-CoGAN. WaveNet generator denoise signals, PostNet for general cleanup, Multi-Resolution Discriminator evaluate relative clean signal. Our results show Hifi-CoGAN was able outperform Hifi-GAN many narrowband (signals with limited range frequencies) terms Short-Term Objective Intelligibility (STOI) Perceptual Evaluation Speech Quality (PESQ) metrics. did not perform as well wideband wider such white noise, so future work must be done these signals.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Coulomb Gans: Provably Optimal Nash Equi-

Generative adversarial networks (GANs) evolved into one of the most successful unsupervised techniques for generating realistic images. Even though it has recently been shown that GAN training converges, GAN models often end up in local Nash equilibria that are associated with mode collapse or otherwise fail to model the target distribution. We introduce Coulomb GANs, which pose the GAN learnin...

متن کامل

Controlling a HIFI with a continuous speech understanding system

In this paper we present a speech understanding system that accepts continuous speech sentences as input to command a HIFI set. The string of words obtained from the recogniser is sent to the understanding system that tries to fill in a set of frames specifying the triplet (SUBSYSTEM, PARAMETER, VALUE). The understanding module follows the philosophy presented in [1]. The triplets are finally t...

متن کامل

Inertial Sensor Signals Denoising with Wavelet Transform

In the current paper we propose a new software procedure for processing data from an inertial navigation system boarded on a moving vehicle, in order to achieve accurate navigation information on the displacement of the vehicle in terms of position, speed, acceleration and direction. We divided our research in three phases. In the first phase of our research, we implemented a realtime evaluatio...

متن کامل

Speech enhancement with weighted denoising auto-encoder

A novel speech enhancement method with Weighted Denoising Auto-encoder (WDA) is proposed in this paper. A weighted reconstruction loss function is introduced to the conventional Denoising Auto-encoder (DA), and makes it suitable for the task of speech enhancement. First, the proposed WDA is used to model the relationship between the noisy and clean power spectrums of speech signal. Then, the es...

متن کامل

Denoising base-band communication signals

This paper presents a new denoising method for base-band communication signals corrupted by additive noise. The novelty of this paper is the use of a special MAP filter, called composed bishrink, for communication purposes. A complete statistical analysis of this filter is reported. Some simulations are presented. The results obtained are compared with the results of matched filtering technique...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Student Research

سال: 2022

ISSN: ['2167-1907']

DOI: https://doi.org/10.47611/jsrhs.v11i3.3501