Deriving Semantic Knowledge from Descriptive Texts Using an MT System
نویسندگان
چکیده
This paper describes the results of a feasibility study which focused on deriving semantic networks from descriptive texts using controlled language. The KANT system 3, 6] was used to analyze input paragraphs, producing sentence-level interlingua representations. The in-terlinguas were merged to construct a paragraph-level representation, which was used to create a semantic network in Conceptual Graph (CG) 1] format. The interlinguas are also translated (using the KANTOO generator) into OWL statements for entry into the Ontology Works electrical power factbase 9]. The system was extended to allow simple querying in natural language.
منابع مشابه
Presenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملContextual Bitext-Derived Paraphrases in Automatic MT Evaluation
In this paper we present a novel method for deriving paraphrases during automatic MT evaluation using only the source and reference texts, which are necessary for the evaluation, and word and phrase alignment software. Using target language paraphrases produced through word and phrase alignment a number of alternative reference sentences are constructed automatically for each candidate translat...
متن کاملThe experimental MT System of the Project KIT-FAST
Within the project KIT-FAST an experimental machine translation (MT) system has been developed and implemented, which translates written German texts into English. For that reason a syntactic, semantic and aspects of a conceptual level of representation have been realized. In general each level has three dimensions, which are a sentence and a text representation, which are constructed during tr...
متن کاملWord Sense Disambiguation: Why Statistics When We Have These Numbers?
Word sense disambiguation continues to be a di cult problem in machine translation (MT). Current methods either demand large amounts of corpus data and training or rely on knowledge of hard selectional constraints. In either case, the methods have been demonstrated only on a small scale and mostly in isolation, where disambiguation is a task by itself. It is not clear that the methods can be sc...
متن کاملImproved Statistical Machine Translation Using Monolingually-Derived Paraphrases
Untranslated words still constitute a major problem for Statistical Machine Translation (SMT), and current SMT systems are limited by the quantity of parallel training texts. Augmenting the training data with paraphrases generated by pivoting through other languages alleviates this problem, especially for the so-called “low density” languages. But pivoting requires additional parallel texts. We...
متن کامل