نتایج جستجو برای: wrapper approach

تعداد نتایج: 1291639  

Journal: :CoRR 2011
Emilio Ferrara Robert Baumgartner

Information distributed through the Web keeps growing faster day by day, and for this reason, several techniques for extracting Web data have been suggested during last years. Often, extraction tasks are performed through so called wrappers, procedures extracting information from Web pages, e.g. implementing logic-based techniques. Many fields of application today require a strong degree of rob...

1998
Ion Muslea Steve Minton Craig Knoblock

Information mediators are systems capable of providing a unified view of several information sources. Central to any mediator that accesses Web-based sources is a set of wrappers that can extract relevant information from Web pages. In this paper, we present a wrapper-induction algorithm that generates extraction rules for Web-based information sources. We introduce landmark automata, a formali...

Journal: :IEEE Trans. Computers 2002
Vikram Iyengar Krishnendu Chakrabarty

ÐSystem-on-a-chip (SOC) designs present a number of unique testability challenges to system integrators. Test access to embedded cores often requires dedicated test access mechanisms (TAMs). We present an improved approach for designing efficient TAMs and investigate the problems of improved deserialization of test data in the core wrapper, optimal test bus sizing, and optimal assignment of cor...

2011
Lena Scheubert Rainer Schmidt Dirk Repsilber Mitja Luštrek Georg Fuellen

Pluripotent stem cells are able to self-renew, and to differentiate into all adult cell types. Many studies report data describing these cells, and characterize them in molecular terms. Machine learning yields classifiers that can accurately identify pluripotent stem cells, but there is a lack of studies yielding minimal sets of best biomarkers (genes/features). We assembled gene expression dat...

2013
Abdolreza Rashno Hossein SadeghianNejad Abed Heshmati

Automatic speaker verification (ASV) systems are among the biometric systems used in security and telephone-based remote control applications. Recent years have witnessed an increasing trend in research on such systems. These systems usually use high dimension feature vectors and therefore involve high complexity. However, there is a general belief that many of the features used in such systems...

Journal: :JASIST 2005
Chun-Nan Hsu Chia-Hui Chang Chang-Huain Hsieh Jiann-Jyh Lu Chien-Chi Chang

A variety of biological data is transferred and exchanged in overwhelming volumes on the World Wide Web. How to rapidly capture, utilize and integrate the information on the Internet to discover valuable biological knowledge is one of the most critical issues in bioinformatics. Many information integration systems have been proposed for integrating biological data. These systems usually rely on...

2001
Peter Popov Steve Riddle Alexander Romanovsky Lorenzo Strigini

Off-the-shelf (OTS) components are increasingly used in application areas with stringent dependability requirements. Component wrapping is a well known structuring technique used in many areas. We propose a general approach to developing protective wrappers that assist in integrating OTS items with a focus on the overall system dependability. The wrappers are viewed as redundant software employ...

2002
Chia-Hui Chang Shih-Chien Kuo Kuo-Yu Huang Tsung-Hsin Ho Chih-Lung Lin

TheWorld WideWeb is now undeniably the richest and most dense source of information, yet its structure makes it diÆcult to make use of that information in a systematic way. This paper extends a pattern discovery approach called IEPAD to the rapid generation of information extractors that can extract structured data from semi-structured Web documents. IEPAD is proposed to automate wrapper genera...

2003
Georgios Sigletos Georgios Paliouras Constantine D. Spyropoulos Michael Hatzopoulos

This paper presents a novel method for extracting information from collections of Web pages across different sites. Our method uses a standard wrapper induction algorithm and exploits named entity information. We introduce the idea of post-processing the extraction results for resolving ambiguous facts and improve the overall extraction performance. Postprocessing involves the exploitation of t...

2009
Charalampos E. Tsourakakis Georgios Paliouras

Web wrappers play an important role in extracting information from distributed web sources and subsequently in the integration of heterogeneous data. Changes in the layout of web sources typically break the wrapper, leading to erroneous extraction of infomation. Monitoring and repairing broken wrappers is an important hurdle for data integration, since it is an expensive and painful procedure. ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید