wikipedia mining

نتایج جستجو برای: wikipedia mining

تعداد نتایج: 92181 فیلتر نتایج به سال:

Mining a multilingual association dictionary from Wikipedia for cross-language information retrieval

Journal: :JASIST 2012

Zheng Ye Xiangji Huang Ben He Hongfei Lin

The Wikipedia is characterized by its dense link structure and a huge amount of articles in different languages, which make it a notable Web corpus for knowledge extraction and mining, in particular for mining the multilingual associations. In this paper, motivated by a psychological theory of word meaning, we propose a graphbased approach to constructing a cross-language association dictionary...

متن کامل

Wikipedia matters

Journal: :Journal of Economics and Management Strategy 2021

We document a causal impact of online user-generated information on real-world economic outcomes. In particular, we conduct randomized field experiment to test whether additional content Wikipedia pages about cities affects tourists' choices overnight visits. Our treatment adding increases stays in treated compared nontreated cities. The is largely driven by improvements shorter and relatively ...

متن کامل

Large SMT data-sets extracted from Wikipedia

2014

Dan Tufis

The article presents experiments on mining Wikipedia for extracting SMT useful sentence pairs in three language pairs. Each extracted sentence pair is associated with a cross-lingual lexical similarity score based on which, several evaluations have been conducted to estimate the similarity thresholds which allow the extraction of the most useful data for training three-language pairs SMT system...

متن کامل

Citing Wikipedia

Journal: :BMJ 2014

متن کامل

Wikipedia mining of hidden links between political leaders

Journal: :CoRR 2016

Klaus M. Frahm Katia Jaffrès-Runser Dima Shepelyansky

We describe a new method of reduced Google matrix which allows to establish direct and hidden links between a subset of nodes of a large directed network. This approach uses parallels with quantum scattering theory, developed for processes in nuclear and mesoscopic physics and quantum chaos. The method is applied to the Wikipedia networks in different language editions analyzing several groups ...

متن کامل

Mining for Practices in Community Collections: Finds From Simple Wikipedia

2008

Matthijs den Besten Alessandro Rossi Loris Gaio Max Loubser Jean-Michel Dalle

The challenges of commons based peer production are usually associated with the development of complex software projects such as Linux and Apache. But the case of open content production should not be treated as a trivial one. For instance, while the task of maintaining a collection of encyclopedic articles might seem negligible compared to the one of keeping together a software system with its...

متن کامل

Aisles through the Category Forest - Utilising the Wikipedia Category System for Corpus Building in Machine Learning

2007

Rüdiger Gleim Alexander Mehler Matthias Dehmer Olga Pustylnikov

The Word Wide Web is a continuous challenge to machine learning. Established approaches have to be enhanced and new methods be developed in order to tackle the problem of finding and organising relevant information. It has often been motivated that semantic classifications of input documents help solving this task. But while approaches of supervised text categorisation perform quite well on gen...

متن کامل

Morbus Wikipedia

Journal: :MMW - Fortschritte der Medizin 2010

متن کامل

3D Wikipedia

Journal: :ACM Transactions on Graphics 2013

متن کامل

Exploring Wikipedia and DMoz as Knowledge Bases for Engineering a User Interests Hierarchy for Social Network Applications

2009

Mandar Haridas Doina Caragea

The outgrowth of social networks in the recent years has resulted in opportunities for interesting data mining problems, such as interest or friendship recommendations. A global ontology over the interests specified by the users of a social network is essential for accurate recommendations. We propose, evaluate and compare three approaches to engineering a hierarchical ontology over user intere...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید