Explorations in disambiguation using XML text representation
نویسنده
چکیده
In SENSEVAL-3, CL Research participated in four tasks: English all-words, English lexical sample, disambiguation of WordNet glosses, and automatic labeling of semantic roles. This participation was performed within the development of CL Research’s Knowledge Management System, which massively tags texts with syntactic, semantic, and discourse characterizations and attributes. This System is fully integrated with CL Research’s DIMAP dictionary maintenance software, which provides access to one or more dictionaries for disambiguation and representation. Our core disambiguation functionality, unchanged since SENSEVAL-2, performed at a level comparable to our previous performance. Our participation in the SENSEVAL-3 tasks was concerned primarily with text processing and representation issues and did not advance our disambiguation capabilities.
منابع مشابه
Word Sense Disambiguation Using Semantic Graph
This work describes a method of word sense disambiguation by finding similar words in a text. We have used some characteristic properties of the text and its constituent words for the disambiguation task. Using the WordNet, the algorithm constructs a semantic structure on the text illustrating the relations among the words of the text. This structure is then used for disambiguating the constitu...
متن کاملText Representation with WordNet Synsets using Soft Sense Disambiguation
Text information processing depends critically on the proper representation of texts. A common and naive way of representing a text is as a bag of its component words. This representation suffers primarily from two drawbacks, viz., polysemy and synonymy which arise because of the ambiguity of the words and the lack of information about the relations between the words. This paper presents a mode...
متن کاملخوشهبندی فراابتکاری اسناد فارسی اِکساِماِل مبتنی بر شباهت ساختاری و محتوایی
Due to the increasing number of documents, XML, effectively organize these documents in order to retrieve useful information from them is essential. A possible solution is performed on the clustering of XML documents in order to discover knowledge. Clustering XML documents is a key issue of how to measure the similarity between XML documents. Conventional clustering of text documents using a do...
متن کاملBuilding semantic trees from XML documents
The distributed nature of the Web, as a decentralized system exchanging information between heterogeneous sources, has underlined the need to manage interoperability, i.e., the ability to automatically interpret information in Web documents exchanged between different sources, necessary for efficient information management and search applications. In this context, XML was introduced as a data r...
متن کاملFeatures for Web Person Disambiguation
Entity disambiguation resolves the many to many correspondence between mentions of entities in text and unique real-world entities. Our entity disambiguation uses language-independent entity context to agglomeratively resolve mentions with similar names to unique entities. This paper describes our automatic entity disambiguation capability and assesses its performance on the second Web People S...
متن کامل