Classifying Message Board Posts with an Extracted Lexicon of Patient Attributes
نویسندگان
چکیده
The goal of our research is to distinguish veterinary message board posts that describe a case involving a specific patient from posts that ask a general question. We create a text classifier that incorporates automatically generated attribute lists for veterinary patients to tackle this problem. Using a small amount of annotated data, we train an information extraction (IE) system to identify veterinary patient attributes. We then apply the IE system to a large collection of unannotated texts to produce a lexicon of veterinary patient attribute terms. Our experimental results show that using the learned attribute lists to encode patient information in the text classifier yields improved performance on this task.
منابع مشابه
Classifying Sentences as Speech Acts in Message Board Posts
This research studies the text genre of message board forums, which contain a mixture of expository sentences that present factual information and conversational sentences that include communicative acts between the writer and readers. Our goal is to create sentence classifiers that can identify whether a sentence contains a speech act, and can recognize sentences containing four different spee...
متن کاملPredicting the Importance of Newsfeed Posts and Social Network Friends
As users of social networking websites expand their network of friends, they are often flooded with newsfeed posts and status updates, most of which they consider to be “unimportant” and not newsworthy. In order to better understand how people judge the importance of their newsfeed, we conducted a study in which Facebook users were asked to rate the importance of their newsfeed posts as well as...
متن کاملIdentifying potential adverse effects using the web: A new approach to medical hypothesis generation
Medical message boards are online resources where users with a particular condition exchange information, some of which they might not otherwise share with medical providers. Many of these boards contain a large number of posts and contain patient opinions and experiences that would be potentially useful to clinicians and researchers. We present an approach that is able to collect a corpus of m...
متن کاملMedpie: an Information Extraction Package for Medical Message Board Posts
SUMMARY We have developed medpie, a software package for preparing medical message board corpora and extracting patient mentions and statistics for drugs, herbs and adverse effects experienced from them. The package is divided into web-crawling, HTML-cleaning, de-identification and information extraction modules. It also includes a sample controlled vocabulary of drugs, herbs and adverse effect...
متن کاملIdentifying Information in Stock Message Boards and Its Implications for Stock Market Efficiency
The information value of stock message boards has often been debated. A main difficulty in assessing the value is the presence of a large number of posts with varying quality. This paper presents an intuitive approach to identify and aggregate information in stock message boards. We weigh each post’s recommendation by its author’s credibility based on accuracy of his past posts. We find that th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013