Cultural micro-blog Contextualization 2016 Workshop Overview: data and pilot tasks
نویسندگان
چکیده
CLEF Cultural micro-blog Contextualization Workshop is aiming at providing the research community with data sets to gather, organize and deliver relevant social data related to events generating a large number of micro-blog posts and web documents. It is also devoted to discussing tasks to be run from this data set and that could serve applications.
منابع مشابه
Tweet Data mining : the Cultural Microblog Contextualization Data Set
This paper presents an overview of the data set that was used for the Cultural Microblog Contextualization Workshop at CLEF 2016 and more specifically for the task 1: tweet contextualization. In this paper we first present a descriptive analysis of the data: we consider the variables or features associated with the tweets and analyse them. Then we also analyse the tweet textual content. The res...
متن کاملOverview of the CLEF 2016 Cultural Micro-blog Contextualization Workshop
Many statistical studies have shown the importance of social media; they seem to be now the main Internet activity for Americans, even when compared to email , and most of the social media. Chinese users spend an average of almost 90 minutes per day on social networks . Social media is thus a key media for any company or organization, specifically in Business Intelligence related activities. Co...
متن کاملOverview of the NLPCC-ICCPOL 2016 Shared Task: Chinese Word Segmentation for Micro-Blog Texts
In this paper, we give an overview for the shared task at the 5th CCF Conference on Natural Language Processing & Chinese Computing (NLPCC 2016): Chinese word segmentation for micro-blog texts. Different with the popular used newswire datasets, the dataset of this shared task consists of the relatively informal micro-texts. Besides, we also use a new psychometric-inspired evaluation metric for ...
متن کاملNamed Entity Resources - Overview and Outlook
Recognition of real-world entities is crucial for most NLP applications. Since its introduction some twenty years ago, named entity processing has undergone a significant evolution with, among others, the definition of new tasks (e.g. entity linking) and the emergence of new types of data (e.g. speech transcriptions, micro-blogging). These pose certainly new challenges which affect not only met...
متن کاملBuilding a Knowledge Base using Microblogs: the Case of Cultural MicroBlog Contextualization Collection
The Cultural MicroBlog Contextualization (CMC) Workshop provides a collection of tweets on cultural events related to festivals. Given the size of a tweet, the information obtained by a single post is often very partial. We develop the idea that using a set of tweets about an event could enable having a more complete view of that event by combining all information posted. In this paper, we prop...
متن کامل