Learning Translation Rules for a Bidirectional English-Filipino Machine Translator

نویسندگان

  • Michelle Wendy Tan
  • Bryan Anthony Hong
  • Danniel Liwanag Alcantara
  • Amiel Perez
  • Lawrence Tan
چکیده

Filipino is a changing language that poses several challenges. Our goal is to develop a bidirectional English-Filipino Machine Translation (MT) system using a hybrid approach to learn rules from examples. The first phase was an English to Filipino MT system that required several language resources. The problem lies on its dependency over the annotated grammar which is currently unavailable for Filipino, which makes reverse translation impossible. Phase 2 addresses this limitation by using information taken from English and Filipino POS Taggers. The seed rules are generated by aligning the POS tags from English and Filipino examples, including their constraints. To perform compositionality, the system deduces the constituent labels by using the longest adjacent POS tags found in both the English and Filipino rule. The system groups together similar rules and generalizes it to encompass a wider range of unseen examples.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Template Extraction for a Bidirectional English-Filipino Machine Translation System

A bidirectional English-Filipino Example-based Machine Translation System that learns and uses templates is presented. The system uses machine learning techniques to initially extract templates from a given bilingual corpus. These templates are subsequently used for translating English input text into Filipino and vice versa. The system implements the similarity template learning algorithm perf...

متن کامل

TExt Translation: Template Extraction for a Bidirectional English-Filipino Example-Based Machine Translation

In this paper, we present TExt Translation, a bidirectional English-Filipino Example-based Machine Translation System that learns and uses templates. These templates are used for translating English input text into Filipino and vice versa. Minimal language resources and information are used since these resources are few and may contain errors. The system uses an untagged bilingual corpus, lexic...

متن کامل

Rule Extraction Applied in Language Translation

Machine translation (MT) has been used to address inherent problems from human translators. However, the quality of machine translations are usually unacceptable. Researches have focused on improving quality by incorporating machine learning for translation. An example of which is TWiRL which translates English to Filipino sentences. However, TWiRL’s approach presented a strict requirement of a...

متن کامل

Learning Translation Rules from Bilingual English - Filipino Corpus

Most machine translators are implemented using example based, rule based, and statistical approaches. However, each of these paradigms has its drawbacks. Example based and statistical based approaches are domain specific and requires a large database of examples to produce accurate translation results. Although rule based approach is known to produce high quality translations, a linguist is nec...

متن کامل

ESANA: Hybrid Machine Translation Approach for English-to-Sinhala Language Translation

In modern society Internet has become the most popular and efficient communication media. Most of the Internet resources are available on English language. However, English fluency rate in majority of the countries is not up to a satisfactory level. In Sri Lanka, it was observed that the English fluency rate has been reduced over past 30 years. Therefore the neediness of an English-to-Sinhala t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006