نتایج جستجو برای: trilingual

تعداد نتایج: 263  

Journal: :Silk Road: A Journal of Eurasian Development 2020

2016
Utsab Barman Joachim Wagner Jennifer Foster

Multilingual users of social media sometimes use multiple languages during conversation. Mixing multiple languages in content is known as code-mixing. We annotate a subset of a trilingual code-mixed corpus (Barman et al., 2014) with part-of-speech (POS) tags. We investigate two state-of-the-art POS tagging techniques for code-mixed content and combine the features of the two systems to build a ...

2006
Alicia Pérez M. Inés Torres V. Guijarrubia Francisco Casacuberta

The goals of this work are, on the one hand to develop a trilingual corpus suitable for example-based text and speech-input machine translation between English, Spanish and Basque; and on the other hand, to compare the same translation method for under two very different pair of languages: Spanish-Basque and Spanish-English. The stochastic finite-state transducers have been trained from raw bil...

2017
Dmitrijs Milajevs

Twitter has become a rich source for linguistic data. Here, a possibility of building a trilingual Latvian-Russian-English corpus of tweets from Riga, Latvia is investigated. Such a corpus, once constructed, might be of great use for multiple purposes including training machine translation models, examining cross-lingual phenomena and studying the population of Riga. This pilot study shows that...

2004
Maja Popovic Hermann Ney

In this paper we present methods for improving the quality of translation from an inflected language into English by making use of part-of-speech tags and word stems and suffixes in the source language. Results for translations from Spanish and Catalan into English are presented on the LC-STAR trilingual corpus which consists of spontaneously spoken dialogues in the domain of travelling and app...

2008
Rafael E. Banchs Haizhou Li

This paper presents some statistical machine translation results among English, Spanish and Chinese, and focuses on exploring Spanish-morphology effects on the Chinese to Spanish translation task. Although not strictly comparable, it is observed that by reducing Spanish morphology the accuracy achieved in the Chinese to Spanish translation task becomes comparable to the one achieved in the Chin...

2009
M C Padma

In a multi-script multi-lingual environment, a document may contain text lines in more than one script/language forms. It is necessary to identify different script regions of the document in order to feed the document to the OCRs of individual language. With this context, this paper proposes to develop a model to identify and separate text lines of Telugu, Devanagari and English scripts from a ...

Journal: :Balkan Thesaurus: Communication in complex-cultural Balkan societies 2019

2017
Arda Çelebi Arzucan Özgür

This paper describes the BOUN system’s participation in the Trilingual Entity Detection and Linking (EDL) track in 2017 TAC Knowledge Base Population (KBP) challenge. EDL is important aspect of text analysis and various tasks like sentiment analysis and opinion mining can benefit from discovering which named entities are mentioned in the context. For this challenge, we built a simple candidate ...

2007
Sebastian Padó

Annotation projection is a strategy for the cross-lingual transfer of annotations which can be used to bootstrap linguistic resources for low-density languages, such as role-semantic databases similar to FrameNet. In this paper, we investigate the main assumption underlying annotation projection, cross-lingual parallelism, which states that annotation is parallel across languages. Concentrating...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید