SU@PAN'2015: Experiments in Author Profiling
نویسندگان
چکیده
We describe the submission of the Sofia University team for the Author Profiling Task, part of the PAN 2015 Challenge. Given a set of writing samples by the same person, the task asks to predict some demographical information such as age and gender, as well as the personality type of that person. We experimented with SVM classifiers using variety of features extracted from publicly available resources, achieving the second-best score for Spanish out of 21 submissions, and the sixthbest for English out of 22 submissions.
منابع مشابه
Syntactic N-grams as Features for the Author Profiling Task: Notebook for PAN at CLEF 2015
This paper describes our approach to tackle the Author Profiling task at PAN 2015. Our method relies on syntactic features, such as syntactic based n-grams of various types in order to predict the age, gender and personality traits that has the author of a given text. In this paper, we describe the used features, the employed classification algorithm, and other general ideas concerning the expe...
متن کاملXRCE Personal Language Analytics Engine for Multilingual Author Profiling: Notebook for PAN at CLEF 2015
This technical notebook describes the methodology used – and results achieved – for the PAN 2015 Author Profiling Challenge by the team from Xerox Research Centre Europe (XRCE). This year, personality traits are introduced alongside age and gender in a corpus of tweets in four languages – English, Spanish, Italian and Dutch. We describe a largely language agnostic methodology for classification...
متن کاملSegmenting Target Audiences: Automatic Author Profiling using Tweets: Notebook for PAN at CLEF 2015
This paper describes a methodology proposed for author profiling using natural language processing and machine learning techniques. We used lexical information in the learning process. For those languages without lexicons, we automatically translated them, in order to be able to use this information. Finally, we will discuss how we applied this methodology to the 3rd Author Profiling Task at PA...
متن کاملOverview of the PAN/CLEF 2015 Evaluation Lab
This paper presents an overview of the PAN/CLEF evaluation lab. During the last decade, PAN has been established as the main forum of text mining research focusing on the identification of personal traits of authors left behind in texts unintentionally. PAN 2015 comprises three tasks: plagiarism detection, author identification and author profiling studying important variations of these problem...
متن کاملSU@PAN'2015: Experiments in Author Verification
We describe the submission of the Sofia University team for the Author Identification Task, part of the PAN 2015 Challenge. Given a small set of documents by a single person and a “questioned” document, possibly of a different genre and/or topic, the task is to determine whether the questioned document was written by the same person who wrote the known document set. This is a hard but realistic...
متن کامل