نتایج جستجو برای: wikipedia mining

تعداد نتایج: 92181  

2017
Johannes Kiesel Martin Potthast Matthias Hagen Benno Stein

Little is known about what causes anti-social behavior online. The paper at hand analyzes vandalism and damage in Wikipedia with regard to the time it is conducted and the country it originates from. First, we identify vandalism and damaging edits via ex post facto evidence by mining Wikipedia’s revert graph. Second, we geolocate the cohort of edits from anonymous Wikipedia editors using their ...

2008
Simon E Overell

This thesis aims to augment the Geographic Information Retrieval process with information extracted from world knowledge. This aim is approached from three directions: classifying world knowledge, disambiguating placenames and modelling users. Geographic information is becoming ubiquitous across the Internet, with a significant proportion of web documents and web searches containing geographic ...

2008
Darren Hardy

While current GIS research has focused on technological issues of visualization and data organization, the emergence of new forms of collective authorship suggest we need new information frameworks and behaviors. How do individuals contribute place-based information to a digital commons? What are the authorship dynamics of such collective effort? For my research, I will use spatial data mining ...

2012
Guillermo Garrido Jean-Yves Delort Enrique Alfonseca Anselmo Peñas

In this paper, we describe the collection of a large structured dataset of temporally anchored relational data, obtained from the full revision history of the English Wikipedia. By mining (attribute, value) pairs from this revision history, we are able to collect a comprehensive, temporally-aware knowledge base that contains data on how attributes change over time. We discuss different characte...

2011
Toshiro Minami Eunja Kim

Seat Usage Data Analysis and Its Application for Library Marketing MDL: Metrics Definition Language p. 248 Natural Language Processing and Computational Linguistics A Statistical Global Feature Extraction Method for Optical Font Recognition p. 257 Domain N-Gram Construction and Its Application to Text Editor p. 268 Grounding Two Notions of Uncertainty in Modal Conditional Statements p. 278 Deve...

2014
Hao Ma Irwin King Michael R. Lyu

Barbier G (2012) Finding provenance data in social media. Doctoral dissertation, Arizona State University Facebook (2012) https://www.facebook.com/photo.php? fbid=268506716591158 & set=a.24724116871771349 889.247222755386221&type=3&theater. Accessed 2 Oct 2013 Leskovec J, Backstrom L, Kleinberg J (2009) Memetracking and the dynamics of the news cycle. In: Proceedings of the 15th ACM SIGKDD inte...

2011
Ling-Xiang Tang Daniel Cavanagh Andrew Trotman Shlomo Geva Yue Xu Laurianne Sitbon

At NTCIR-9, we participated in the cross-lingual link discovery (Crosslink) task. In this paper we describe our approaches to discovering Chinese, Japanese, and Korean (CJK) cross-lingual links for English documents in Wikipedia. Our experimental results show that a link mining approach that mines the existing link structure for anchor probabilities and relies on the “translation” using cross-l...

2010
Alexander Dekhtyar

• Data mining: the techniques, methods and algorithms for finding patterns in structured data. • Data warehousing: the methods and techniques for managing data and processing complex analytical decision-support queries in databases. • Information Retrieval: the techniques, methods, algorithms and data models for finding information in unstructured (primarily, but not always, textual) data. • Co...

Journal: :Journal of Machine Learning Research 2012
Tom De Smedt Walter Daelemans

Pattern is a package for Python 2.4+ with functionality for web mining (Google + Twitter + Wikipedia, web spider, HTML DOM parser), natural language processing (tagger/chunker, n-gram search, sentiment analysis, WordNet), machine learning (vector space model, k-means clustering, Naive Bayes + k-NN + SVM classifiers) and network analysis (graph centrality and visualization). It is well documente...

2012
Teemu Hynönen Sébastien Mahler Hannu Toivonen

We propose a method to mine novel, document-specific associations between terms in a collection of unstructured documents. We believe that documents are often best described by the relationships they establish. This is also evidenced by the popularity of conceptual maps, mind maps, and other similar methodologies to organize and summarize information. Our goal is to discover term relationships ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید