Person Identification from Text and Speech Genre Samples

نویسندگان

Jade Goldstein-Stewart

Ransom K. Winder

Roberta Evans Sabin

چکیده

In this paper, we describe experiments conducted on identifying a person using a novel unique correlated corpus of text and audio samples of the person’s communication in six genres. The text samples include essays, emails, blogs, and chat. Audio samples were collected from individual interviews and group discussions and then transcribed to text. For each genre, samples were collected for six topics. We show that we can identify the communicant with an accuracy of 71% for six fold cross validation using an average of 22,000 words per individual across the six genres. For person identification in a particular genre (train on five genres, test on one), an average accuracy of 82% is achieved. For identification from topics (train on five topics, test on one), an average accuracy of 94% is achieved. We also report results on identifying a person’s communication in a genre using text genres only as well as audio genres only.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust audio-based classification of video genre

Video genre classification is a challenging task in a global context of fast growing video collections available on the Internet. This paper presents a new method for video genre identification by audio analysis. Our approach relies on the combination of low and high level audio features. We investigate the discriminative capacity of features related to acoustic instability, speaker interactivi...

متن کامل

Development of a genre-dependent TTS system with cross-speaker speaking-style transplantation

One of the biggest challenges in speech synthesis is the production of contextually-appropriate naturally sounding synthetic voices. This means that a Text-To-Speech system must be able to analyze a text beyond the sentence limits in order to select, or even modulate, the speaking style according to a broader context. Our current architecture is based on a two-step approach: text genre identifi...

متن کامل

A comparative sociopragmatic analysis of wedding invitations in American and Iranian societies and teaching implications

Wedding invitations (WIs), as a uniquely socially and culturally constructed genre, provide a distinct opportunity to compare the sociocultural values of different speech communities as reflected in the textual content and organization of the different moves. Students can be exposed to this genre and its different moves using a genre-based pedagogy. Genre-based ped...

متن کامل

Modeling the acoustic correlates of expressive elements in text genres for expressive text-to-speech synthesis

This paper proposes a novel approach for describing the expressive elements in text genres and modeling their acoustic correlates for expressive text-to-speech synthesis (TTS). We apply the three-dimensional PAD (pleasure-displeasure, arousal-nonarousal and dominance-submissiveness) model in describing expressivity. In particular, we define a set of principles for annotating the P and A values ...

متن کامل

Automated captioning of television programs: development and analysis of a soundtrack corpus

The purpose of this research is to investigate methods for applying speech recognition techniques to improve the productivity of off-line captioning for television. We posit that existing corpora for training continuous speech recognisers are unrepresentative of the acoustic conditions of television soundtracks. To evaluate the use of application specific models to this task we have developed a...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2009

Person Identification from Text and Speech Genre Samples

نویسندگان

چکیده

منابع مشابه

Robust audio-based classification of video genre

Development of a genre-dependent TTS system with cross-speaker speaking-style transplantation

A comparative sociopragmatic analysis of wedding invitations in American and Iranian societies and teaching implications

Modeling the acoustic correlates of expressive elements in text genres for expressive text-to-speech synthesis

Automated captioning of television programs: development and analysis of a soundtrack corpus

عنوان ژورنال:

اشتراک گذاری