Interpreting Xml Keyword Query Using Hidden Markov Model

نویسندگان

  • Xiping Liu
  • Changxuan Wan
  • Dexi Liu
چکیده

Original scientific paper Keyword search on XML database has attracted a lot of research interests. As XML documents are very different from flat documents, effective search of XML documents needs special considerations. Traditional bag-of-words model does not take the roles of keywords and the relationship between keywords into consideration, and thus is not suited for XML keyword search. In this paper, we present a novel model, called semi-structured keyword query (SSQ), which understands a keyword query in a different way: a keyword query is composed of several query units, where each unit represents query condition. To interpret a keyword query under this model, we take two steps. First, we propose a probabilistic approach based on a Hidden Markov Model for computing the best mapping of the query keywords into the database terms, i.e., elements, attributes and values. Second, we generate SSQs based on the mapping. Experimental results verify the effectiveness of our methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hidden Markov Model Approach to Keyword-Based Search over Relational Databases

We present a novel method for translating keyword queries over relational databases into SQL queries with the same intended semantic meaning. In contrast to the majority of the existing keyword-based techniques, our approach does not require any a-priori knowledge of the data instance. It follows a probabilistic approach based on a Hidden Markov Model for computing the top-K best mappings of th...

متن کامل

An Efficient Query Mining Framework Using Spatial Hidden Markov Models for Automatic Annotation of Images

A novel method for automatic annotation of images is used with keywords from a generic vocabulary of concepts or objects combined with annotation-based retrieval of images. This can be done by using spatial hidden Markov model, in which states represent concepts. The parameters of this model are estimated from a set of manually annotated training images. An image in a large test collection is t...

متن کامل

Applying Clinical Ontology for Biomedical Information Retrieval

Information retrieval (IR) is a technology to help people find information, however it is hard to formulate users’ information need into just a few words. In the biomedical area, if people are not familiar with specific domain knowledge, they may use inappropriate query keywords and then get non-relevant query results. To solve this problem we adopted knowledge ontology in the IR process. In th...

متن کامل

Query Segmentation and Resource Disambiguation Leveraging Background Knowledge

Accessing the wealth of structured data available on the Data Web is still a key challenge for lay users. Keyword search is the most convenient way for users to access information (e.g., from data repositories). In this paper we introduce a novel approach for determining the correct resources for user-supplied keyword queries based on a hidden Markov model. In our approach the user-supplied que...

متن کامل

Spoken Web Search using an Ergodic Hidden Markov Model of Speech

An ergodic hidden Markov model (EHMM) of speech can be trained in an unsupervised manner using unlabeled speech. A keyword spotting system has been developed where the queries and test observations are represented as sequences of states of the EHMM. A graphical keyword model is built by aggregating multiple instances of a query or by using mappings between phonemes and states of the EHMM. A mod...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016