A Qualitative Representation and Similarity Measurement Method in Geographic Information Retrieval
نویسندگان
چکیده
The modern geographic information retrieval technology is based on quantitative models and methods. The semantic information in web documents and queries cannot be effectively represented, leading to information lost or misunderstanding so that the results are either unreliable or inconsistent. A new qualitative approach is thus proposed for supporting geographic information retrieval based on qualitative representation, semantic matching, and qualitative reasoning. A qualitative representation model and the corresponding similarity measurement method are defined. Information in documents and user queries are represented using propositional logic, which considers the thematic and geographic semantics synthetically. Thematic information is represented as thematic propositions on the base of domain ontology. Similarly, spatial information is represented as geo-spatial propositions with the support of geographic knowledge base. Then the similarity is divided into thematic similarity and spatial similarity. The former is calculated by the weighted distance of proposition keywords in the domain ontology, and the latter similarity is further divided into conceptual similarity and spatial similarity. Represented by propositions and information units, the similarity measurement can take evidence theory and fuzzy logic to combine all sub similarities to get the final similarity between documents and queries. This novel retrieval method is mainly used to retrieve the qualitative geographic information to support the semantic matching and results ranking. It does not deal with geometric computation and is consistent with human commonsense cognition, and thus can improve the efficiency of geographic information retrieval technology.
منابع مشابه
Approaches to Semantic Similarity Measurement for Geo-Spatial Data: A Survey
Semantic similarity is central for the functioning of semantically enabled processing of geospatial data. It is used to measure the degree of potential semantic interoperability between data or different geographic information systems (GIS). Similarity is essential for dealing with vague data queries, vague concepts or natural language and is the basis for semantic information retrieval and int...
متن کاملSemantic Similarity Measurement and Geospatial Applications
With the increasing amount of geographic information available on the Internet, searching, browsing, and organizing such information has become a major challenge within the field of Geographic Information Science (GIScience). As all information is ultimately for and from human beings, the methodologies applied to retrieve and organize this information should correlate with human similarity judg...
متن کاملA Hybrid Semantic Similarity Measure for Spatial Information Retrieval
Semantic similarity is central to many cognitive processes and plays an important role in the way humans process and reason about information. In particular, the retrieval of knowledge from memory hinges crucially on similarity. Likewise, information retrieval systems use similarity to detect relevant information for a given query. Current information retrieval systems apply mainly syntactic te...
متن کاملA New Similarity Measure Based on Item Proximity and Closeness for Collaborative Filtering Recommendation
Recommender systems utilize information retrieval and machine learning techniques for filtering information and can predict whether a user would like an unseen item. User similarity measurement plays an important role in collaborative filtering based recommender systems. In order to improve accuracy of traditional user based collaborative filtering techniques under new user cold-start problem a...
متن کاملLearning Ranking Functions for Geographic Information Retrieval Using Genetic Programming
Geographic Information Retrieval (GIR) has emerged as a new and promising tool for representation, storage, organisation of and access to geographic information. One of the current issues in GIR research is ranking of retrieved documents by both textual and geographic similarity measures. This paper describes an approach that learns GIR ranking functions using Genetic Programming (GP) methods b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1311.4644 شماره
صفحات -
تاریخ انتشار 2013