What Patients Can Tell Us: Topic Analysis for Social Media on Breast Cancer
نویسندگان
چکیده
BACKGROUND Social media dedicated to health are increasingly used by patients and health professionals. They are rich textual resources with content generated through free exchange between patients. We are proposing a method to tackle the problem of retrieving clinically relevant information from such social media in order to analyze the quality of life of patients with breast cancer. OBJECTIVE Our aim was to detect the different topics discussed by patients on social media and to relate them to functional and symptomatic dimensions assessed in the internationally standardized self-administered questionnaires used in cancer clinical trials (European Organization for Research and Treatment of Cancer [EORTC] Quality of Life Questionnaire Core 30 [QLQ-C30] and breast cancer module [QLQ-BR23]). METHODS First, we applied a classic text mining technique, latent Dirichlet allocation (LDA), to detect the different topics discussed on social media dealing with breast cancer. We applied the LDA model to 2 datasets composed of messages extracted from public Facebook groups and from a public health forum (cancerdusein.org, a French breast cancer forum) with relevant preprocessing. Second, we applied a customized Jaccard coefficient to automatically compute similarity distance between the topics detected with LDA and the questions in the self-administered questionnaires used to study quality of life. RESULTS Among the 23 topics present in the self-administered questionnaires, 22 matched with the topics discussed by patients on social media. Interestingly, these topics corresponded to 95% (22/23) of the forum and 86% (20/23) of the Facebook group topics. These figures underline that topics related to quality of life are an important concern for patients. However, 5 social media topics had no corresponding topic in the questionnaires, which do not cover all of the patients' concerns. Of these 5 topics, 2 could potentially be used in the questionnaires, and these 2 topics corresponded to a total of 3.10% (523/16,868) of topics in the cancerdusein.org corpus and 4.30% (3014/70,092) of the Facebook corpus. CONCLUSIONS We found a good correspondence between detected topics on social media and topics covered by the self-administered questionnaires, which substantiates the sound construction of such questionnaires. We detected new emerging topics from social media that can be used to complete current self-administered questionnaires. Moreover, we confirmed that social media mining is an important source of information for complementary analysis of quality of life.
منابع مشابه
The Effect of Social Media on the Breast Cancer Knowledge and Health Beliefs of Women
Introduction: The present study aimed at determining the effect of social media on breast cancer knowledge and health behaviors of the women. Methods: The data were collected from 476 women who had willing to participate in the study, using Google forms on social media from February to May 2018. Results: The results indicated that the time spent on social media decreased, and self- efficacy a...
متن کاملThe Effect of Social Media on the Breast Cancer Knowledge and Health Beliefs of Women
Introduction: The present study aimed at determining the effect of social media on breast cancer knowledge and health behaviors of the women. Methods: The data were collected from 476 women who had willing to participate in the study, using Google forms on social media from February to May 2018. Results: The results indicated that the time spent on social media decreased, and self- efficacy a...
متن کامل"What does the Customer Want to Tell US?" an Automated Classification Approach for Social Media Posts at Small and Medium-Sized Enterprises
Social media posts created by customers capture a lot of business relevant information for decisionmakers, e.g., current consumer expectations on products and services. For that purpose, the social media posts need to be analyzed thoroughly. In this respect, a topic-related classification facilitates managerial decision-making because business relevant topics, social media users discuss about, ...
متن کاملHealth-Related Hot Topic Detection in Online Communities Using Text Clustering
Recently, health-related social media services, especially online health communities, have rapidly emerged. Patients with various health conditions participate in online health communities to share their experiences and exchange healthcare knowledge. Exploring hot topics in online health communities helps us better understand patients' needs and interest in health-related knowledge. However, th...
متن کاملTreatment and Predicting Life expectancy for women with breast cancer based on perception of the disease, perceived social support, and coping styles
Background: Today, cancer is a growing phenomenon that is recognized as one of the major problems for contemporary human health. Breast cancer is still the most common cancer among women in the world. Living with breast cancer presents women with significant challenges that interfere with their physical, social, psychological, economic and spiritual life of patients. These challenges are major ...
متن کامل