Web Information Extraction Based on Visual Characteristics
نویسندگان
چکیده
منابع مشابه
Visual Architecture based Web Information Extraction
ISSN 2250 – 107X | © 2011 Bonfring Abstract--The World Wide Web has more online web database which can be searched through their web query interface. Deep Web contents are accessed by queries submitted to Web databases and the returned data records are enwrapped in dynamically generated Web pages. Extracting structured data from deep Web pages is a challenging task due to the underlying complic...
متن کاملVisual Web Information Extraction with Lixto
We present new techniques for supervised wrapper generation and automated web information extraction, and a system called Lixto implementing these techniques. Our system can generate wrappers which translate relevant pieces of HTML pages into XML. Lixto, of which a working prototype has been implemented, assists the user to semi-automatically create wrapper programs by providing a fully visual ...
متن کاملWeb-based Multimedia Information Extraction Based on Social Redundancy
Social networking sites are among the most frequently visited on the web (Cha et al. 2007) and their use has expanded into professional contexts for expertise sharing and knowledge discovery (Millen, Feinberg and Kerr 2006). These virtual communities can be enormous, with millions of users and shared resources. Social multimedia websites, such as YouTube, are particularly popular. Network traff...
متن کاملRecord-Level Information Extraction from a Web Page based on Visual Features
Web databases contain a huge amount of structured data which are easily obtained via their query interfaces only. Query results are presented in dynamically generated web pages, usually in the form of data records, for human use. Decisive for web data integration applications is the problem of automatically extracting data records from query result pages, such as comparison shopping sites, meta...
متن کاملSemantic Based Information Extraction from Web
Extraction of information from web is a challenging task. The information stored in a web may be structured or unstructured information. The structured information provides enhanced knowledge which helps to retrieve relevant documents. It helps the user to understand particular domain. This paper explores the importance of information extraction using semantics. It enables the users to discover...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Information Technology Journal
سال: 2012
ISSN: 1812-5638
DOI: 10.3923/itj.2012.408.413