نتایج جستجو برای: crawler

تعداد نتایج: 1856  

Journal: :International Journal on Web Service Computing 2015

2017
Donglin Jiang DONGLIN JIANG

In order to develop a mechanical simulation experiment, a dynamic simulation of crawler excavator walking mechanism is designed. The performance of walking mechanism of excavator is tested by dynamic simulation. According to the data analysis, it is known that crawler walking mechanism is an important part of mine excavator and a walking device with better adaptability than wheeled walking mech...

Journal: :Computer Standards & Interfaces 2016
Ali Seyfi Ahmed Patel Joaquim Celestino

Indexing the Web is becoming a laborious task for search engines as the Web exponentially grows in size and distribution. Presently, the most effective known approach to overcome this problem is the use of focused crawlers. A focused crawler applies a proper algorithm in order to detect the pages on the Web that relate to its topic of interest. For this purpose we proposed a custom method that ...

Journal: :Data Knowl. Eng. 2009
Sotiris Batsakis Euripides G. M. Petrakis Evangelos E. Milios

This work addresses issues related to the design and implementation of focused crawlers. Several variants of state-of-the-art crawlers relying on web page content and link information for estimating the relevance of web pages to a given topic are proposed. Particular emphasis is given to crawlers capable of learning not only the content of relevant pages (as classic crawlers do) but also paths ...

2017
Soumick Chatterjee Asoke Nath

World Wide Web is an ever-growing public library with hundreds of millions of books without any central management system. Finding a piece of information without a proper directory is like finding a middle in a haystack. Various search engines solve this problem by indexing an amount of the complete content that is available in the internet. For accomplishing this job, search engines use an aut...

2008
Pabitra Mitra

The work describes the design of the focused crawler for Intinno, an intelligent web based content management system. Intinno system aims to circumvent the drawbacks of existing learning management systems in terms of scarcity of content which often leads to the cold start problem. The scarcity problem is solved by using a focused crawler to mine educational content from the web. Educational co...

2006
Leigh Dodds

This paper introduces “Slug” a web crawler (or “Scutter”) designed for harvesting semantic web content. Implemented in Java using the Jena API, Slug provides a configurable, modular framework that allows a great degree of flexibility in configuring the retrieval, processing and storage of harvested content. The framework provides an RDF vocabulary for describing crawler configurations and colle...

2004
Alexandros M. Grigoriadis Georgios Paliouras

This paper deals with the problem of constructing an intelligent Focused Crawler, i.e. a system that is able to retrieve documents of a specific topic from the Web. The crawler must contain a component which assigns visiting priorities to the links, by estimating the probability of leading to a relevant page in the future. Reinforcement Learning was chosen as a method that fits this task nicely...

2010
Debashis Hati Amritesh Kumar Lizashree Mishra

Vertical search engines use focused crawler as their key component and develop some specific algorithms to select web pages relevant to some pre-defined set of topics. Crawlers are software which can traverse the internet and retrieve web pages by hyperlinks. The focused crawler of a special-purpose search engine aims to selectively seek out pages that are relevant to a pre-defined set of topic...

2004
Martin Ester Hans-Peter Kriegel Matthias Schubert

Focused web crawlers have recently emerged as an alternative to the well-established web search engines. While the well-known focused crawlers retrieve relevant webpages, there are various applications which target whole websites instead of single webpages. For example, companies are represented by websites, not by individual webpages. To answer queries targeted at websites, web directories are...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید