نتایج جستجو برای: historical texts

تعداد نتایج: 141368  

Journal: :Journal of Indian and Buddhist Studies (Indogaku Bukkyogaku Kenkyu) 2012

Journal: :Meta: Journal des traducteurs 1999

Journal: :The Proceedings of the Annual Convention of the Japanese Psychological Association 2017

Journal: :International Journal Of Turkish Literature Culture Education 2012

2013
Alina Maria Ciobanu Anca Dinu Liviu P. Dinu Vlad Niculae Octavia-Maria Sulea

In this paper we look at a task at border of natural language processing, historical linguistics and the study of language development, namely that of identifying the time when a text was written. We use machine learning classification using lexical, word ending and dictionary-based features, with linear support vector machines and random forests. We find that lexical features are the most help...

2008
Claire Grover Sharon Givon Richard Tobin Julian Ball

We describe and evaluate a prototype system for recognising person and place names in digitised records of British parliamentary proceedings from the late 17th and early 19th centuries. The output of an OCR engine is the input for our system and we describe certain issues and errors in this data and discuss the methods we have used to overcome the problems. We describe our rule-based named enti...

2011
Eva Pettersson Joakim Nivre

Even though historical texts reveal a lot of interesting information on culture and social structure in the past, information access is limited and in most cases the only way to find the information you are looking for is to manually go through large volumes of text, searching for interesting text segments. In this paper we will explore the idea of facilitating this timeconsuming manual effort,...

2016
Yi Yang Jacob Eisenstein

As more historical texts are digitized, there is interest in applying natural language processing tools to these archives. However, the performance of these tools is often unsatisfactory, due to language change and genre differences. Spelling normalization heuristics are the dominant solution for dealing with historical texts, but this approach fails to account for changes in usage and vocabula...

2011
Marcel Bollmann Florian Petran Stefanie Dipper

This paper deals with normalization of language data from Early New High German. We describe an unsupervised, rulebased approach which maps historical wordforms to modern wordforms. Rules are specified in the form of context-aware rewrite rules that apply to sequences of characters. They are derived from two aligned versions of the Luther bible and weighted according to their frequency. The eva...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید