The Tel Aviv University System for the Code-Switching Workshop Shared Task
نویسندگان
چکیده
We describe our entry in the EMNLP 2014 code-switching shared task. Our system is based on a sequential classifier, trained on the shared training set using various characterand word-level features, some calculated using a large monolingual corpora. We participated in the Twitter-genre Spanish-English track, obtaining an accuracy of 0.868 when measured on the tweet level and 0.858 on the word level.
منابع مشابه
The George Washington University System for the Code-Switching Workshop Shared Task 2016
We describe our work in the EMNLP 2016 second code-switching shared task; a generic language independent framework for linguistic code switch point detection (LCSPD). The system uses characters level 5-grams and word level unigram language models to train a conditional random fields (CRF) model for classifying input words into various languages. We participated in the Modern Standard Arabic (MS...
متن کاملThe Howard University System Submission for the Shared Task in Language Identification in Spanish-English Codeswitching
This paper describes the Howard University system for the language identification shared task of the Second Workshop on Computational Approaches to Code Switching. Our system is based on prior work on SwahiliEnglish token-level language identification. Our system primarily uses character n-gram, prefix and suffix features, letter case and special character features along with previously existin...
متن کاملImproved Orientations of Physical Networks
The orientation of physical networks is a prime task in deciphering the signaling-regulatory circuitry of the cell. One manifestation of this computational task is as a maximum graph orientation problem, where given an undirected graph with n vertices and a collection of vertex pairs, the goal is to orient the edges of the graph so that a maximum number of pairs are connected by a directed path...
متن کاملColumbia-Jadavpur submission for EMNLP 2016 Code-Switching Workshop Shared Task: System description
We describe our present system for language identification as a part of the EMNLP 2016 Shared Task. We were provided with the Spanish-English corpus composed of tweets. We have employed a predictor-corrector algorithm to accomplish the goals of this shared task and analyzed the results obtained.
متن کاملFinite-buffer polling systems with threshold-based switching policy
We consider a system of two separate finite-buffer M/M/1 queues served by a single server, where the switching mechanism between the queues is thresholdbased, determined by the queue which is not being served. Applications may be found in data centers, smart traffic-light control and human behavior. Specifically, whenever the server attends queue i (Qi ) and the number of customers in the other...
متن کامل