The Creation, Distribution and Use of Linguistic Data: the Case of the Linguistic Data Consortium
نویسندگان
چکیده
The Linguistic Data Consortium (LDC) is an open consortium of universities, companies and government research laboratories. It creates and distributes speech and text databases, lexicons and other resources. The University of Pennsylvania is the LDC’s host institution. The LDC was founded in 1992 with a grant from the Defense Advanced Research Projects Agency (DARPA). Currently, all LDC publication and distribution activities are self-supporting, while new data creation is partly supported by grant IRI 9528587 from the Information, Robotics and Intelligent Systems division of the National Science Foundation (NSF). The LDC’s core mission remains the support of pre-competitive research and development in speech and language technology, but support of other language-related research is also an important focus.
منابع مشابه
A Progress Report from the Linguistic Data Consortium: Recent Activities in Resource Creation and Distribution and the Development of Tools and Standards
This paper described recent activities of the Linguistic Data Consortium in the collection, annotation and distribution of language data the developments of tools and standards for using that data, the creation of metadata to facilitate the search for linguistic resources.
متن کاملA Contrastive Study of Metadiscourse in English and Persian Editorials
The original impetus for this cross-linguistic study came from a need to explore the effect of cultural factors and generic conventions on the use and distribution of metadiscourse within a single genre. To this end, the study as a contrastive rhetoric research, examined a corpus of 60 newspaper editorials (written in English and Persian) culled from 10 elite newspapers in America and Iran. Bas...
متن کاملLanguage Resource Creation and Distribution at the Linguistic Data Consortium: A Progress Report
Changes in the supply of and demand for language resources continues to affect the role of large data centers such as the Linguistic Data Consortium (LDC) and European Language Resource Center (ELRA) within the research communities they serve. The past few years have seen increased demand for: intensively multi-modal resources, larger data sets in high-density languages and new data in low dens...
متن کاملNeuropsychological Double Dissociation between Linguistic Levels: Clinical Linguistic Evidence from Iranian Aphasic Patients
Introduction: In this paper we report on clinical linguistic applications of several versions of the Bilingual Aphasia Test (BAT) and the Persian Aphasia Battery (PAB) developed to assess patterns of recovery and language impairments in monolingual and bilingual aphasics with different clinical histories living in Iran. Methods: The participants are adult monolingual native speakers of Persian ...
متن کاملEffectiveness of the Linguistic Plays on Improving the Reading Skills of Educable Mental Retarded Preliminary School Students
Abstract The present study has been conducted with the purpose of exploring linguistic plays in increasing reading skill among retarded students. The kind of the study is quasi-experimental with pre-test and post-test, being conducted among all retarded students studying in second grade of elementary schools at Mashhad. The sample included 30 subjects, randomly selected and assigned as experim...
متن کامل