compressed speech

Multichannel compression processing for profound deafness.

Journal: :Journal of rehabilitation research and development 1987

E Villchur

Three-channel amplitude compression followed by frequency shaping was used to process test sentences for five profoundly deaf subjects, and the recognition scores were compared to scores achieved with frequency shaping only. At preferred levels, the scores of three of the five subjects showed a statistically significant but not dramatic advantage for compression; the averages of the scores for ...

متن کامل

Static and dynamic modulation spectrum for speech recognition

2009

Sriram Ganapathy Samuel Thomas Hynek Hermansky

We present a feature extraction technique based on static and dynamic modulation spectrum derived from long-term envelopes in sub-bands. Estimation of the sub-band temporal envelopes is done using Frequency Domain Linear Prediction (FDLP). These sub-band envelopes are compressed with a static (logarithmic) and dynamic (adaptive loops) compression. The compressed sub-band envelopes are transform...

متن کامل

Improving robustness to compressed speech in speaker recognition

2013

Mitchell McLaren Victor Abrash Martin Graciarena Yun Lei Jan Pesán

The goal of this paper is to analyze the impact of codecdegraded speech on a state-of-the-art speaker recognition system and propose mitigation techniques. Several acoustic features are analyzed, including the standard Mel filterbank cepstral coefficients (MFCC), as well as the noise-robust medium duration modulation cepstrum (MDMC) and power normalized cepstral coefficients (PNCC), to determin...

متن کامل

Steganalysis of compressed speech to detect covert voice over Internet protocol channels

Journal: :IET Information Security 2011

Yongfeng Huang Shanyu Tang Chunlan Bao Yau Jim Yip

A network covert channel is a passage along which information leaks across the network in violation of security policy in a completely undetectable manner. This paper reveals our findings in analysing the principle of G.723.1 codec that there are ‘unused’ bits in G.723.1 encoded audio frames, which can be used to embed secret messages. A novel steganalysis method that employs the second detecti...

متن کامل

Perceptual adjustment to highly compressed speech: effects of talker and rate changes.

Journal: :Journal of experimental psychology. Human perception and performance 1997

E Dupoux K Green

This study investigated the perceptual adjustments that occur when listeners recognize highly compressed speech. In Experiment 1, adjustment was examined as a function of the amount of exposure to compressed speech by use of 2 different speakers and compression rates. The results demonstrated that adjustment takes place over a number of sentences, depending on the compression rate. Lower compre...

متن کامل

A Novel Algorithm of Sparse Representations for Speech Compression/Enhancement and Its Application in Speaker Recognition System

2016

Satyanand Singh Mansour H. Assaf Abhay Kumar

This paper proposes sparse and redundancy representation spectral domain compression of the speech signal using novel sparsing algorithms to the problem of speech compression (SC)/enhancement (SE). In Automatic Speaker Recognition (ASR) sparsification can play a major role to resolve big data issues in speech compression and its storage in the database, where the speech signal can be uncompress...

متن کامل

Compression of speech database by feature separation and pattern clustering using STRAIGHT

2004

Zhen-Hua Ling Yu Hu Zhiwei Shuang Ren-Hua Wang

This paper presents an alternative solution for speech database compression aiming at the embedded application of concatenative synthesis systems. The waveform of a speech segment is firstly decomposed into a prosodic pattern and a spectral pattern by STRAIGHT – a powerful speech analysissynthesis algorithm. Then all the prosodic and spectral patterns are clustered respectively to remove the re...

متن کامل

Steganalysis of Compressed Speech to Detect Covert VoIP Channels

2010

Yongfeng Huang Shanyu Tang Chunlan Bao Jim Yip

A network covert channel is a passage along which information leaks across the network in violation of security policy in a completely undetectable manner. This paper reveals our findings in analysing the principle of G.723.1 codec that there are ‘unused’ bits in G.723.1 encoded audio frames, which can be used to embed secret messages. A novel steganalysis method that employs the second detecti...

متن کامل

Published in proceedings of the Symposium on Signal Processing for

2003

Cristina Videira Lopes Anshuman Chadha

We describe a method and an implementation for producing a highly compressed representation of speech, in the order of 40 bps. This compression method uses a speech recognition engine to analyze the speech signal at the morphological level, i.e. the words. The words are then coded using a wordlevel text compression mechanism. After decompression, the speech message is recovered using Text-To-Sp...

متن کامل

A Study of Rate Discrimination of Time-Compressed Speech

2007

Lisa J. Stifelman

The purpose of this study is to determine the just noticeable differences (JNDs) for speech rate. The results are intended to be used for the design of an interactive speech speed control. The JND at three different speech rates was determined using the psychophysical method of constant stimuli. Speech stimuli were compressed using the SOLA time-compression technique. The findings show a signif...

متن کامل