Survey: Finite-state technology in natural language processing
نویسندگان
چکیده
منابع مشابه
Survey: Finite-state technology in natural language processing
In this survey, we will discuss current uses of finite-state information in several statistical natural language processing tasks. To this end, we will review standard approaches in tokenization, part-of-speech tagging, and parsing, and illustrate the utility of finite-state information and technology in these areas. The particular problems were chosen to allow a natural progression from simple...
متن کاملFinite-State Technology in Natural Language Processing
Finite-state technology is at the core of many standard approaches in natural language processing [1, 2]. However, the terminology and the notations differ significantly between theoretical computer science (TCS) [3] and natural language processing (NLP) [4]. In this lecture, inspired by [2, 4], we plan to illustrate the close ties between formal language theory as discussed in TCS and its use ...
متن کاملUsing Finite State Technology in Natural Language Processing of Basque
This paper describes the components used in the design and implementation of NLP tools for Basque. These components are based on finite state technology and are devoted to the morphological analysis of Basque, an agglutinative pre-Indo-European language. We think that our design can be interesting for the treatment of other languages. The main components developed are a general and robust morph...
متن کاملIntroduction to Finite-State Devices in Natural Language Processing
The theory of finite-state automata (FSA) is rich and finite-state automata techniques have been used in a wide range of domains, such as switching theory, pattern matching, pattern recognition, speech processing, hand writing recognition, optical character recognition, encryption algorithm, data compression, indexing and operating system analysis (Petri-net). In this chapter, we describe the b...
متن کاملFinite-state methods and models in natural language processing
A N S S I Y L I-J Y R Ä, A N D R Á S K O R N A I and J A C Q U E S S A K A R O V I T C H 1Department of Modern Languages, PO Box 24, 00014 University of Helsinki, Finland email: [email protected] 2Computer and Automation Research Institute, Hungarian Academy of Sciences, Kende u 13-17, Budapest 1111, Hungary and Harvard University, Institute for Quantitative Social Science, 1737 Cambri...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Theoretical Computer Science
سال: 2017
ISSN: 0304-3975
DOI: 10.1016/j.tcs.2016.05.030