Contrasting Objective and Subjective Portuguese Texts from Heterogeneous Sources

نویسندگان

  • Michel Généreux
  • William Martinez
چکیده

This paper contrasts the content and form of objective versus subjective texts. A collection of on-line newspaper news items serve as objective texts, while parliamentary speeches (debates) and blog posts form the basis of our subjective texts, all in Portuguese. The aim is to provide general linguistic patterns as used in objective written media and subjective speeches and blog posts, to help construct domainindependent templates for information extraction and opinion mining. Our hybrid approach combines statistical data along with linguistic knowledge to filter out irrelevant patterns. As resources for subjective classification are still limited for Portuguese, we use a parallel corpus and tools developed for English to build our subjective spoken corpus, through annotations produced for English projected onto a parallel corpus in Portuguese. A measure for the saliency of n-grams is used to extract relevant linguistic patterns deemed “objective” and “subjective”. Perhaps unsurprisingly, our contrastive approach shows that, in Portuguese at least, subjective texts are characterized by markers such as descriptive, reactive and opinionated terms, while objective texts are characterized mainly by the absence of subjective markers.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised Topic Modeling for Short Texts Using Distributed Representations of Words

We present an unsupervised topic model for short texts that performs soft clustering over distributed representations of words. We model the low-dimensional semantic vector space represented by the dense distributed representations of words using Gaussian mixture models (GMMs) whose components capture the notion of latent topics. While conventional topic modeling schemes such as probabilistic l...

متن کامل

A Definition of Literary Literacy: a Content Analysis of Literature Syllabuses and Interviews with Portuguese Lecturers of Literature

A DEFINITION OF LITERARY LITERACY: A CONTENT ANALYSIS OF LITERATURE SYLLABUSES AND INTERVIEWS WITH PORTUGUESE LECTURERS OF LITERATURE Rita Baleiro School of Management, Hospitality and Tourism, University of the Algarve (Portugal) [email protected] Abstract: The aim of this paper is to present a definition of literary literacy in the context of majors in languages, literatures and cultures, in P...

متن کامل

Towards an Objective Voice Preference Definition for the Portuguese Language

In this paper, it is our aim to define a set of objective acoustic criteria, based on subjective listeners’ assessment of talent voices, which can help to automatically rate the voice font quality, bearing in mind the objective definition of voice preference for the Portuguese language. For this purpose a multilingual and multispeaker database was recorded and a set of subjective and objective ...

متن کامل

The Relationship between Subjective Evaluation of Stressors and Depression in Menopausal Women: The Mediating Role of Life Satisfaction

Objective: Previous studies have shown that menopausal women are more likely to experience depression. However, there are few studies that investigated the cognitive mechanism that may have a role in developing depression in menopausal women. Thus, the present study aimed to investigate the mediating role of life satisfaction in the relation between subjective evaluation of stressors and depres...

متن کامل

Uma Ferramenta para Identificar Desvios de Linguagem na Língua Portuguesa (A tool to identify the linguistic deviations in the Portuguese Language)[In Portuguese]

Abstract. The revision of formal texts is a complex task and occurs in several areas. The objective of this work is to create a tool to support the revision of texts and promote studies in automatic correction of descriptive texts. We propose a reviewer for automatic identification of language deviations in formal descriptive texts using natural language processing techniques. A case study...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012