Assamese Vowel Phoneme Recognition Using Zero Crossing Rate and Short-time Energy

نویسنده

  • Bhargab Medhi
چکیده

Speaker recognition is the identification of the person who is speaking by the characteristics of their voices. Assamese is a Indo-Aryan family of languages, mainly spoken in the North-Eastern of India. In this paper text dependent speaker modelling technique is used. The system contains training phase, the testing phase and the recognition phase. The database consists of utterance of 10 speakers with equal number of male and female speaker. Each phoneme is repeated 10 times by each speaker. The feature Zero Crossing Rate (ZCR) and Short-time Energy (STE) are used for the acoustic measures which can be helpful to design an Assamese speaker recognition system. Keywords—Speech recognition, Feature Extraction, Zero Crossing Rate, Short-time Energy, Frame.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

LPC and MFCC Analysis of Assamese Vowel Phonemes

A speech signal contains many levels of information. Speech conveys the information about the language being spoken, the emotion, gender, and the identity of the speaker. Features parameters extracted from speech are very useful for speaker recognition as well as speech recognition. In this paper, the features LPC and MFCC are computed of Assamese vowel phonemes which will be helpful to develop...

متن کامل

Lip Synchronization using Linear Predictive Analysis

Linear Predictive analysis is a widely used technique for speech analysis and encoding. In this paper, we discuss the issues involved in its application to phoneme extraction and lip synchronization. The LP analysis results in a set of reflection coefficients that are closely related to the vocal tract shape. Since the vocal tract shape can be correlated with the phoneme being spoken, LP analys...

متن کامل

Development of a Real-time Embedded System for Speech Emotion Recognition

Speech emotion recognition is one of the latest challenges in speech processing and Human Computer Interaction (HCI) in order to address the operational needs in real world applications. Besides human facial expressions, speech has proven to be one of the most promising modalities for automatic human emotion recognition. Speech is a spontaneous medium of perceiving emotions which provides in-de...

متن کامل

Segmentation and Classification of Vowel Phonemes of Assamese Speech Using a Hybrid Neural Framework

In spoken word recognition, one of the crucial point is to identify the vowel phonemes. Vowel phonemes are used to combine two or more consonant phonemes in most of the words spoken, and the meaning of the words changes with the change of vowels. Therefore, in order to recognize a word, identification of vowel phoneme is as important as the identification of constituent consonant phonemes. This...

متن کامل

Automatic Arabic Speech Segmentation Syste

growth of information and communication technologies has influenced the research trends n speech technologies. This research explains a basic speech segmentation application for Arabic language with the aim to further develop a language tutor. The focus is on rabic as there are standards available which help in obtaining better accuracy. The roblem has been formulated in the form of a number of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014