wikipedia mining

نتایج جستجو برای: wikipedia mining

تعداد نتایج: 92181 فیلتر نتایج به سال:

Understanding the Query: THCIB and THUIS at NTCIR-10 Intent Task

2013

Junjun Wang Guoyu Tang Yunqing Xia Qiang Zhou Thomas Fang Zheng Qinan Hu Sen Na Yaohai Huang

Understanding intent underlying search query recently attracted enormous research interests. Two challenging issues are worth noting: First, words within query are usually ambiguous while query in most cases is too short to disambiguate. Second, ambiguity in some cases cannot be resolved according merely to the limited query context. It is thus demanded that the ambiguity be resolved/analyzed w...

متن کامل

Crowdsourcing a Personalized Learning Environment for Mathematics

2010

Joseph Corneli

Can we build a semantically adaptive personal learning environment that helps people learn mathematics and that meets reasonable criteria for sustainable growth and development? This is question that applies at the interface between participatory social media and interactive, adaptive, “knowledge media”. Ten years ago, no one had heard of Wikipedia. Perhaps in another ten, P2PU will be as popul...

متن کامل

Hierarchical Triadic Context Analysis for Folksonomy-Based Web Applications

Journal: :JDCTA 2008

Suk-hyung Hwang Yu-Kyung Kang

Services such as Wikipedia, Flickr, Technorati, Yahoo!, YouTube, del.icio.us appear in many web applications, employ the folksonomy as their social tagging mechanism, where users assign tags to resources and share it with each other within their community. As the number of the folksonomy-based systems is increased, some proper data mining approaches to folksonomies are necessary to better under...

متن کامل

Wikipedia-based Unsupervised Query Classification

2013

Milen Kouylekov Luca Dini Alessio Bosca Marco Trevisan

In this paper we present an unsupervised approach to Query Classification. The approach exploits the Wikipedia encyclopedia as a corpus and the statistical distribution of terms, from both the category labels and the query, in order to select an appropriate category. We have created a classifier that works with 55 categories extracted from the search section of the Bridgeman Art Library website...

متن کامل

Exploring Wikipedia's Category Graph for Query Classification

2011

Milad Alemzadeh Richard Khoury Fakhri Karray

Wikipedia’s category graph is a network of 400,000 interconnected category labels, and can be a powerful resource for many classification tasks. However, its size and the lack of order can make it difficult to navigate. In this paper, we present a new algorithm to efficiently explore this graph and discover accurate classification labels. We implement our algorithm as the core of a query classi...

متن کامل

Short Text Conceptualization Using a Probabilistic Knowledgebase

2011

Yangqiu Song Haixun Wang Zhongyuan Wang Hongsong Li Weizhu Chen

Most text mining tasks, including clustering and topic detection, are based on statistical methods that treat text as bags of words. Semantics in the text is largely ignored in the mining process, and mining results often have low interpretability. One particular challenge faced by such approaches lies in short text understanding, as short texts lack enough content from which statistical conclu...

متن کامل

Inducing Distant Supervision in Suggestion Mining through Part-of-Speech Embeddings

Journal: :CoRR 2017

Sapna Negi Paul Buitelaar

Mining suggestion expressing sentences from a given text is a less investigated sentence classification task, and therefore lacks hand labeled benchmark datasets. In this work, we propose and evaluate two approaches for distant supervision in suggestion mining. The distant supervision is obtained through a large silver standard dataset, constructed using the text from wikiHow and Wikipedia. Bot...

متن کامل

Wikipedia

Journal: :Television & New Media 2012

متن کامل

WikEye – Using Magic Lenses to Explore Georeferenced Wikipedia Content

2007

Brent Hecht Michael Rohs Johannes Schöning Antonio Krüger

Traditional paper-based maps are still superior in several ways to their digital counterparts used on mobile devices. Namely, paper-based maps provide high-resolution, largescale information with zero power consumption. Digital maps offer personalized and dynamic information, but suffer from small outer scales and low resolutions. In this paper, we present WikEye, an interdisciplinary project t...

متن کامل

Unsupervised comparable corpora preparation and exploration for bi-lingual translation equivalents

Journal: :CoRR 2015

Krzysztof Wolk Krzysztof Marasek

The multilingual nature of the world makes translation a crucial requirement today. Parallel dictionaries constructed by humans are a widely-available resource, but they are limited and do not provide enough coverage for good quality translation purposes, due to out-of-vocabulary words and neologisms. This motivates the use of statistical translation systems, which are unfortunately dependent o...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید