Voice-based Age and Gender Recognition using Training Generative Sparse Model

author

Samira Mavaddati , University of Mazandaran

Abstract:

Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to general signal classification scheme, our proposed algorithm includes train step to provide related atoms to each signal class and test phase to assess classification performance. Since the classification accuracy depends highly on the selected features, we employ Mel-frequency cepstral coefficients to train basis for better representation of speech structure. These bases are learned over data of male and female speakers using non-negative matrix factorization with sparsity constraint. Then, atom correction is carried out using an energy-based algorithm to decrease coherence between different categories of trained dictionaries. In sparse representation of each data class, atoms related to other sets with the highest energy are replaced with the lowest energy bases if reconstruction error does not exceed from a specified limit. The experimental results show that the proposed algorithm performs better than the earlier methods in this context especially in the presence of background noise.

Download for Free

Already have an account?login

similar resources

Face Recognition in Thermal Images based on Sparse Classifier

Despite recent advances in face recognition systems, they suffer from serious problems because of the extensive types of changes in human face (changes like light, glasses, head tilt, different emotional modes). Each one of these factors can significantly reduce the face recognition accuracy. Several methods have been proposed by researchers to overcome these problems. Nonetheless, in recent ye...

full text

A Novel Text - Independent Voice based Automatic Gender Recognition System

Voice based gender and age classification can be helpful in a number of Information Technology based applications with speech interfaces. The recognition has to be independent of the text of the input speech if the application is online. In this work three different feature sets were tried for text independent gender recognition. The first set is Mel-Frequency Cepstral Coefficients (MFCC) C1 to...

full text

Face Recognition using an Affine Sparse Coding approach

Sparse coding is an unsupervised method which learns a set of over-complete bases to represent data such as image and video. Sparse coding has increasing attraction for image classification applications in recent years. But in the cases where we have some similar images from different classes, such as face recognition applications, different images may be classified into the same class, and hen...

full text

Voice Morphing Using the Generative Topographic Mapping

In this paper we address the problem of Voice Morphing. We attempt to transform the spectral characteristics of a source speakers speech signal so that the listener would believe that the speech was uttered by a target speaker. The voice morphing system transforms the spectral envelope as represented by a Linear Prediction model. The transformation is achieved by codebook mapping using the Gen...

full text

Voice Impersonation using Generative Adversarial Networks

Voice impersonation is not the same as voice transformation, although the latter is an essential element of it. In voice impersonation, the resultant voice must convincingly convey the impression of having been naturally produced by the target speaker, mimicking not only the pitch and other perceivable signal qualities, but also the style of the target speaker. In this paper, we propose a novel...

full text

Feature Extraction based Face Recognition, Gender and Age Classification

The face recognition system with large sets of training sets for personal identification normally attains good accuracy. In this paper, we proposed Feature Extraction based Face Recognition, Gender and Age Classification (FEBFRGAC) algorithm with only small training sets and it yields good results even with one image per person. This process involves three stages: Pre-processing, Feature Extrac...

full text

My Resources

Save resource for easier access later

Save to my library Already added to my library

{@ msg_add @}

Journal title

International Journal of Engineering

volume 31 issue 9

pages 1529- 1535

publication date 2018-09-01

unfollow

{@ msg @}

By following a journal you will be notified via email when a new issue of this journal is published.

Keywords

Gender recognition Sparse Non negative matrix factorization Incoherence Mel frequency cepstral coefficient Voice processing

Hosted on Doprax cloud platform doprax.com