نتایج جستجو برای: webcrawler

تعداد نتایج: 35  

Journal: :D-Lib Magazine 1995
Narayanan Shivakumar Hector Garcia-Molina

Scenario 1 Your local publishing company Books'R'Us decides to publish on the Internet its latest book in an effort to cut down on printing costs and book distribution expenses. Customers pay for the digital books using sophisticated electronic payment mechanisms such as DigiCash, First Virtual or InterPay. When the payment is received, the book distribution server at Books'R'Us sends a digital...

Journal: :BMJ 2002
Heinke Kunst Diederik Groot Pallavi M Latthe Manish Latthe Khalid S Khan

involved in the conception and design of the study, collection of data on links, interpretation of data, and drafting of the paper. NQM carried out the statistical analysis. KKH, FCA, MIR, HMK, and REP were involved in interpreting the data and revised the paper for intellectual content. MAM and SES advised on methods and revised the paper for intellectual content. FM and EVB are the guarantors...

2006
William Aiello Andrei Z. Broder Jeannette C. M. Janssen Evangelos E. Milios

The Web as a text corpus Pages close in word vector space tend to be related Cluster hypothesis (van Rijsbergen 1979) The WebCrawler (Pinkerton 1994) The whole first generation of search engines weapons mass destruction p 1 p 2 Enter the Web's link structure Broder & al. 2000 p(i) = α N + (1 − α) j:j→i p(j) Text Links Meaning Connection between semantic topology (topicality or relevance) and li...

2014
Linnea Passing

The Resource Description Framework (RDF) is the de facto standard for representing semantic data, employed e.g., in the Semantic Web or in data-intense domains such as the Life Sciences. Data in the RDF format can be handled efficiently using relational database systems (RDBMSs), because decades of research in RDBMSs led to mature techniques for storing and querying data. Previous work merely f...

1998
I. Fortanet J. C. Palmer S. Posteguillo

We have endeavored to define a new emerging digital genre in the world of Internet : netvertising. We present an analysis divided into two stages. An initial survey of 20 randomly selected netads reveals that texts in this genre are formed by very brief sentences and noun phrases, the systematic use of imperative and simple present tenses, second person personal pronouns, a higher use of punctu...

1998
Michelle Q Wang

We describe a set of techniques that allows users to interact with results at a higher level than the citation level, even when those results come from a variety of heterogeneous on-line search services. We believe that interactive result analysis allows users to “make sense” out of the potentially many results that may match the constraints they have supplied to the search services. The inspir...

2005
Elwin Chai Rick Jones Zachary Ives

In this paper, we explore the possibility of creating a product search engine that is able to dynamically find commercial sites, independent of merchant feeds and other human involvement in the management of internal databases. We evaluate briefly the constraints of current shopping search engines and the benefits of offering a fully automated version. In addition, we consider the application o...

1997
Alan F. Smeaton Francis Crimmins

A fully operational large scale digital library is likely to be based on a distributed architecture and because of this it is likely that a number of independent search engines may be used to index different overlapping portions of the entire contents of the library. In any case, different media, text, audio, image, etc., will be indexed for retrieval by different search engines so techniques w...

2009
Xiannong Meng

This chapter reports the results of a project attempting to assess the performance of a few major search engines from various perspectives. The search engines involved in the study include the Microsoft Search Engine (MSE) when it was in its beta test stage, AllTheWeb, and Yahoo. In a few comparisons, other search engines such as Google, Vivisimo are also included. The study collects statistics...

Journal: :Library Trends 1999
Bipin C. Desai Rajjan Shinghal Nader Shayan Youquan Zhou

THISARTICLE DPSCRIBES A SYSTEM CALLED CINDI for cataloging and searching documents in a distributed virtual library. Mihen putting a document in the library, the author provides and registers metadata in the form of a semantic header for the document. The semantic header contains information on both the syntactic and semantic content of the document. An expert system simulating the expertise of...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید