Visual Architecture based Web Information Extraction
نویسندگان
چکیده
منابع مشابه
Visual Architecture based Web Information Extraction
ISSN 2250 – 107X | © 2011 Bonfring Abstract--The World Wide Web has more online web database which can be searched through their web query interface. Deep Web contents are accessed by queries submitted to Web databases and the returned data records are enwrapped in dynamically generated Web pages. Extracting structured data from deep Web pages is a challenging task due to the underlying complic...
متن کاملVisual Web Information Extraction with Lixto
We present new techniques for supervised wrapper generation and automated web information extraction, and a system called Lixto implementing these techniques. Our system can generate wrappers which translate relevant pieces of HTML pages into XML. Lixto, of which a working prototype has been implemented, assists the user to semi-automatically create wrapper programs by providing a fully visual ...
متن کاملSemantic Based Information Extraction from Web
Extraction of information from web is a challenging task. The information stored in a web may be structured or unstructured information. The structured information provides enhanced knowledge which helps to retrieve relevant documents. It helps the user to understand particular domain. This paper explores the importance of information extraction using semantics. It enables the users to discover...
متن کاملInformation Architecture and Web-Based Instruction
This poster session presents the application of principles of information architecture to Web-based instruction. Information architecture is the structuring of data to meet the informational and management requirements of an organization or group of people. The discipline of information architecture is informed by principles of information theory and communication theory, information design, an...
متن کاملExtraction of Web Image Information: Semantic or Visual Cues?
Text based approaches for web image information retrieval have been exploited for many years, however the noisy textual content of the web pages makes their task challenging. Moreover, text based systems that retrieve information from textual sources such as image file names, anchor texts, existing keywords and, of course, surrounding text often share the inability to correctly assign all relev...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Bonfring International Journal of Data Mining
سال: 2011
ISSN: 2250-107X,2277-5048
DOI: 10.9756/bijdm.i1002