Integrated Linguistic Resources for Language Exploitation Technologies
نویسندگان
چکیده
Linguistic Data Consortium has recently embarked on an effort to create integrated linguistic resources and related infrastructure for language exploitation technologies within the DARPA GALE (Global Autonomous Language Exploitation) Program. GALE targets an end-to-end system consisting of three major engines: Transcription, Translation and Distillation. Multilingual speech or text from a variety of genres is taken as input and English text is given as output, with information of interest presented in an integrated and consolidated fashion to the end user. GALE's goals requires a quantum leap in the performance of human language technology, while also demanding solutions that are more intelligent, more robust, more adaptable, more efficient and more integrated. LDC has responded to this challenge with a comprehensive approach to linguistic resource development designed to support GALE's research and evaluation needs and to provide lasting resources for the larger Human Language Technology community.
منابع مشابه
Multipurpose Design of Greek Sign Language Resources: a Factor towards Universal Access
In this paper we present the methodology of data collection and implementation of databases for the creation of extensive lexical and terminological resources for the Greek Sign Language (GSL) in order to introduce the major issue of dynamic sign representation. In respect to electronic linguistic resources of GSL, the focus is on issues of validation of linguistic content, multipurpose design ...
متن کاملIntegrated Environment for Management and Exploitation of Linguistic Resources
act — In this paper we describe two tools that form an ed environment which can be successfully used for ment and exploitation of linguistic resources. Both the d the resources were developed within the University of e Human Language Technology Group. The tools we are WS4LR, a software tool that has been developed d for solving different tasks within the Group, and a lication named WS4QE, accom...
متن کاملSelection of Foreign Language Teaching Content in Russian Master of Laws (LLM) Graduate Programs
Master`s degree was integrated into the system of Russian Higher Education several decades ago, however, teaching foreign languages at this level still needs further analysis including the postgraduate law students training. The article investigates the principal components of foreign language teaching in Master of laws Graduate Programs (considering the case of the English language) on the bas...
متن کاملIntegrated Language Technologies for Multilingual Information Services in the MEMPHIS Project
The MEMPHIS project integrates a large set of NLP technologies. An overview of components, their underlying technologies and resources will be presented: language identification, document classification, linguistic analysis, summarization, information extraction, machine translation, knowledge management and crosslingual retrieval.
متن کاملSign Language & Linguistics
The work reported in this study is based on research that has been carried out while developing a sign synthesis system for Greek Sign Language (GSL): theoretical linguistic analysis as well as lexicon and grammar resources derived from this analysis. We focus on the organisation of linguistic knowledge that initiates the multi-functional processing required to achieve sign generation performed...
متن کامل