Querying Semantically Tagged Documents on the World-Wide Web

نویسندگان

  • Ziv Bar-Yossef
  • Yaron Kanza
  • Yakov A. Kogan
  • Werner Nutt
  • Yehoshua Sagiv
چکیده

QUEST is a system for Querying Semantically Tagged documents on the World-Wide Web. The advent of new markup languages, such as xml, facilitates authoring of Web documents that contain not just html tags for instructing a browser how to view a document, but also contain objects that represent the semantic structure of the document. When such documents become widely available, more powerful methods to access and query information on the Web will be possible. The QUEST system was designed and implemented for querying and manipulating documents written in the markup language ohtml. ohtml combines html and objects of the oem data model. QUEST has several new features. First, QUEST can be used to query a combination of hypertext and object structures. Second, The results of queries are ohtml pages and thus of the same type as the data being queried. Third, QUEST implements a new approach for querying semistructured data that produces meaningful answers even when the input data is incomplete, i.e., when some variables of the query cannot be bound to database values. Finally, the experience of developing and using QUEST for querying semantic documents on the Web can be useful for the design and implementation of query languages for xml. This paper provides an overview of the QUEST system and its components.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Semantic Web Engineering: WEESA - Mapping XML Schema to Ontologies

The existence of semantically tagged Web pages is crucial to bring the Semantic Web to life. But it is still costly to develop and maintain Web applications that offer data and meta-data. Several standard Web engineering methodologies exist for designing and implementing Web applications. In this paper we introduce a technique to extend existing Web engineering techniques to develop semanticall...

متن کامل

A Type System for Querying XML Documents

In the last few years, the trend of publishing and sharing information on the World Wide Web caused much of the existing electronic data to lay outside of database management systems in the form of so-called Web documents. This process was further eased by the introduction of the eXtensible Markup Language (XML) by the World Wide Web Consortium (W3C) [1], which provided a standard format for We...

متن کامل

Automatic Acquisition of Named Entity Tagged Corpus from World Wide Web

In this paper, we present a method that automatically constructs a Named Entity (NE) tagged corpus from the web to be used for learning of Named Entity Recognition systems. We use an NE list and an web search engine to collect web documents which contain the NE instances. The documents are refined through sentence separation and text refinement procedures and NE instances are finally tagged wit...

متن کامل

Persistent Storage and Querying of Compressed Xml Documents on the Web

We describe the design and implementation of a Web-based distributed system called TREESTORE, intended for storing compressed XML documents in a relational database. The use of a database is fully portable, requiring minimal changes to application code to substitute one database management system for another. In TREESTORE, compressed XML documents are shredded into a fixed number of relational ...

متن کامل

Annotation Semantique de Documents Semi-Structurés pour la recherche d'information. (Semantic Annotation of Semi-structured Documents for Information Retrieval)

The semantic web is defined by a set of methods and technologies enabling softwareagents to reason about the contents of Web resources. This vision of the Web depends onthe construction of ontologies and the use of metadata to represent these resources. Theobjective of our thesis is to annotate semantically tagged documents related to a domainof interest. These documents may...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999