نتایج جستجو برای: web wrapper generation

تعداد نتایج: 567401  

Journal: :Data Knowl. Eng. 2007
Juan Raposo Alberto Pan Manuel Álvarez Justo Hidalgo

In order to let software programs gain full benefit from semi-structured web sources, wrapper programs must be built to provide a “machine-readable” view over them. A significant problem in this approach arises as Web sources may undergo changes that invalidate the current wrappers. In this paper, we present novel heuristics and algorithms to address this problem. In our approach the system col...

2002
Johan Petrini

A main memory object-relational database system, AMOS II, has been developed at Uppsala Database Laboratory (UDBL). The system provides common database facilities and a powerful query language but also, through it’s mediator-wrapper approach, features for the combination of data from heterogeneous data sources. The AMOS II query processor is extensible through a generalized foreign function mec...

2017
Waleed Ali

The problem of Web phishing attacks has grown considerably in recent years and phishing is considered as one of the most dangerous Web crimes, which may cause tremendous and negative effects on online business. In a Web phishing attack, the phisher creates a forged or phishing website to deceive Web users in order to obtain their sensitive financial and personal information. Several conventiona...

2012
Julien Wollbrett Pierre Larmande Manuel Ruiz

In recent years, a large amount of “-omics” data has been produced. However, these data are stored in many different species-specific databases that are managed by different institutes and laboratories. Biologists often need to find and assemble data from disparate sources to perform certain analyses. Searching for these data and assembling it is a time-consuming task. The Semantic Web helps to...

2005
Juan Raposo Alberto Pan Manuel Álvarez Justo Hidalgo

A substantial subset of the web data follows some kind of underlying structure. Nevertheless, HTML does not contain any schema or semantic information about the data it represents. A program able to provide software applications with a structured view of those semi-structured web sources is usually called a wrapper. Wrappers are able to accept a query against the source and return a set of stru...

2004
Theodore W. Hong Keith L. Clark

The wealth of information contained in the world-wide web has created much interest in systems for integrating information from multiple sites. We describe a universal wrapper machine that can learn to extract information from the web given only a set of general rules describing the data domain. It cleanly separates out site-independent and site-specific knowledge from execution implementation....

2010
Remi Senjaya

The number of data source on internet has increased in volume and type since the last decade, causing problems to query the data or information because of the diversity, dynamic and heterogeneity of the data source or information. Therefore, to simplify the task of obtaining information, several tools have been created for extracting the data from multiple web sources, including Wrapper. Wrappe...

Journal: :JASIST 2005
Chun-Nan Hsu Chia-Hui Chang Chang-Huain Hsieh Jiann-Jyh Lu Chien-Chi Chang

A variety of biological data is transferred and exchanged in overwhelming volumes on the World Wide Web. How to rapidly capture, utilize and integrate the information on the Internet to discover valuable biological knowledge is one of the most critical issues in bioinformatics. Many information integration systems have been proposed for integrating biological data. These systems usually rely on...

1998
Chun-Nan Hsu

This paper presents SoftMealy, a novel Web wrapper representation formalism. This representation is based on a finite-state transducer (FST) and contextual rules, which allow a wrapper to wrap semistructured Web pages containing missing attributes, multiple attribute values, variant attribute permutations, exceptions and typos, the features that no previous work can handle. A SoftMealy wrapper ...

2000
Aykut Firat Stuart E. Madnick Michael Siegel

The web is rapidly becoming the universal repository of information. A major challenge is the ability to support the effective flow of information among the sources and services on the web and their interconnection with legacy systems that were designed to operate with traditional relational databases. This paper describes a technology and infrastructure to address these needs, based on the des...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید