Person Identification from Text and Speech Genre Samples
نویسندگان
چکیده
In this paper, we describe experiments conducted on identifying a person using a novel unique correlated corpus of text and audio samples of the person’s communication in six genres. The text samples include essays, emails, blogs, and chat. Audio samples were collected from individual interviews and group discussions and then transcribed to text. For each genre, samples were collected for six topics. We show that we can identify the communicant with an accuracy of 71% for six fold cross validation using an average of 22,000 words per individual across the six genres. For person identification in a particular genre (train on five genres, test on one), an average accuracy of 82% is achieved. For identification from topics (train on five topics, test on one), an average accuracy of 94% is achieved. We also report results on identifying a person’s communication in a genre using text genres only as well as audio genres only.
منابع مشابه
Robust audio-based classification of video genre
Video genre classification is a challenging task in a global context of fast growing video collections available on the Internet. This paper presents a new method for video genre identification by audio analysis. Our approach relies on the combination of low and high level audio features. We investigate the discriminative capacity of features related to acoustic instability, speaker interactivi...
متن کاملDevelopment of a genre-dependent TTS system with cross-speaker speaking-style transplantation
One of the biggest challenges in speech synthesis is the production of contextually-appropriate naturally sounding synthetic voices. This means that a Text-To-Speech system must be able to analyze a text beyond the sentence limits in order to select, or even modulate, the speaking style according to a broader context. Our current architecture is based on a two-step approach: text genre identifi...
متن کاملA comparative sociopragmatic analysis of wedding invitations in American and Iranian societies and teaching implications
Wedding invitations (WIs), as a uniquely socially and culturally constructed genre, provide a distinct opportunity to compare the sociocultural values of different speech communities as reflected in the textual content and organization of the different moves. Students can be exposed to this genre and its different moves using a genre-based pedagogy. Genre-based ped...
متن کاملModeling the acoustic correlates of expressive elements in text genres for expressive text-to-speech synthesis
This paper proposes a novel approach for describing the expressive elements in text genres and modeling their acoustic correlates for expressive text-to-speech synthesis (TTS). We apply the three-dimensional PAD (pleasure-displeasure, arousal-nonarousal and dominance-submissiveness) model in describing expressivity. In particular, we define a set of principles for annotating the P and A values ...
متن کاملAutomated captioning of television programs: development and analysis of a soundtrack corpus
The purpose of this research is to investigate methods for applying speech recognition techniques to improve the productivity of off-line captioning for television. We posit that existing corpora for training continuous speech recognisers are unrepresentative of the acoustic conditions of television soundtracks. To evaluate the use of application specific models to this task we have developed a...
متن کامل