Machine Learning based approach for Human Trait Identification from Blog Data
نویسندگان
چکیده
Emotions form a major part of a person's personality. Emotional intelligence (EI) is the ability to identify, assess, and control the emotions of oneself, of others, and of groups. The written expressions reflect author's personality. Various personality traits can be determined by the analysis of the contents written by a person. This paper proposes a novel technique for human trait identification from the analysis of author's written expressions. The proposed technique is based on the concept of supervised machine learning and uses Support Vector Machine for classifying the personality of a writer. We classify the personality of a writer into five categories namely, highly extrovert, highly introvert, low introvert, low extrovert and ambivert. Experiments have been carried out on the real world blog data and results demonstrate that the proposed technique can determine the personality traits of a writer with accuracy and speed. We have also implemented a PHP based online system, which reads the contents of a blog and can automatically predict the personality of writer of the blog
منابع مشابه
Damage identification of structures using second-order approximation of Neumann series expansion
In this paper, a novel approach proposed for structural damage detection from limited number of sensors using extreme learning machine (ELM). As the number of sensors used to measure modal data is normally limited and usually are less than the number of DOFs in the finite element model, the model reduction approach should be used to match with incomplete measured mode shapes. The second-order a...
متن کاملFuzzy Rule Based Systems for Gender Classification from Blog Data
Gender classification is a popular machine learning task, which has been involved in various application areas, such as business intelligence, access control and cyber security. In the context of information granulation, gender related information can be divided into three types, namely, biological information, vision based information and social network based information. In traditional machin...
متن کاملBehavioral Analysis of Traffic Flow for an Effective Network Traffic Identification
Fast and accurate network traffic identification is becoming essential for network management, high quality of service control and early detection of network traffic abnormalities. Techniques based on statistical features of packet flows have recently become popular for network classification due to the limitations of traditional port and payload based methods. In this paper, we propose a metho...
متن کاملEmotion Detection in Persian Text; A Machine Learning Model
This study aimed to develop a computational model for recognition of emotion in Persian text as a supervised machine learning problem. We considered Pluthchik emotion model as supervised learning criteria and Support Vector Machine (SVM) as baseline classifier. We also used NRC lexicon and contextual features as training data and components of the model. One hundred selected texts including pol...
متن کاملAPPLICATION OF THE HYBRID HARMONY SEARCH WITH SUPPORT VECTOR MACHINE FOR IDENTIFICATION AND CALSSIFICATION OF DAMAGED ZONE AROUND UNDERGROUND SPACES
An excavation damage zone (EDZ) can be defined as a rock zone where the rock properties and conditions have been changed due to the processes related to an excavation. This zone affects the behavior of rock mass surrounding the construction that reduces the stability and safety factor and increase probability of failure of the structure. This paper presents an approach to build a model for the ...
متن کامل