نتایج جستجو برای: the historical

تعداد نتایج: 16064877  

2015
Rafael C. Carrasco Isabel Martínez-Sempere Enrique Mollá-Gandía Felipe Sánchez-Martínez Gustavo Candela Romero Maria Pilar Escobar Esteban

The BVC section of the impact-es diachronic corpus of historical Spanish compiles 86 books —containing approximately 2 million words. About 27% of the words —providing a representative coverage of the most frequent word forms— have been annotated with their lemma, part of speech, and modern equivalent following the Text Encoding Initiative guidelines. We describe how this type of annotation can...

2010
Sai-Ming Li Mohammad Mahdian R. Preston McAfee

The standard business model in the sponsored search marketplace is to sell click-throughs to the advertisers. This involves running an auction that allocates advertisement opportunities based on the value the advertiser is willing to pay per click, times the click-through rate of the advertiser. The click-through rate of an advertiser is the probability that if their ad is shown, it would be cl...

2011
Sokratis Vavilis Ergina Kavallieratou Roberto Paredes Kostas Sotiropoulos

In this chapter, a binarization technique specifically designed for historical document images is presented. Existing binarization techniques focus either on finding an appropriate global threshold or adapting a local threshold for each area in order to remove smear, strains, uneven illumination etc. Here, a hybrid approach is presented that first applies a global thresholding technique and, th...

2016
Mathias Coeckelbergs Seth van Hooland

Providing useful and efficient semantic annotations is a major challenge for knowledge design of any body of text, especially historical documents. In this article, we propose Topic Modeling as an important first step to gather semantic information beyond the lexicon which can be added as annotations in the SHEBANQ. By laying out a case study, we discuss both noise and structure found in compar...

2017
Christoph Dann Emma Brunskill René Kizilcec

In this project, we aim to estimate the effect of different teaching strategies in a tutoring system on student learning and how that effect varies across different groups of students. More specifically, we want to shed light on whether choosing exercise problems adaptively based on prior student performance is more effective at teaching elementary school students about fractions than non-adapt...

2008
Jyi-Shane Liu

In this paper, we report a databank development project in which structured textual data from historical documents are extracted to provide information access of higher data granularity. The availability of the databank opens up tremendous opportunities for research topics in government personnel systems that were limited by data acquisition difficulty in the past. The project demonstrates the ...

2005
DIANE E. BAILEY STEPHEN R. BARLEY

Industrial engineering was originally founded as a discipline that focused on the study and design of work. Yet, today the field has largely distanced itself from this early concern. This paper tracks the decline of work studies in industrial engineering and explores the question of why the discipline lost its concern for work and, ultimately, its ability to speak to the kinds of social and eco...

2015
Zahrul Islam Natia Dundua

Gospels are one type of translated historical document. There are many versions of the same Gospel that have been translated from the original, or from another Gospel that has already been translated into a different language. Nowadays, it is difficult to determine the language of the original Gospel from where these Gospels were translated. In this paper we use a supervised machine learning te...

2017
Mika Koistinen Kimmo Kettunen Tuula Pääkkönen

In this paper we describe a method for improving the optical character recognition (OCR) toolkit Tesseract for Finnish historical documents. First we create a model for Finnish Fraktur fonts. Second we test Tesseract with the created Fraktur model and Antiqua model on single images and combinations of images with different image preprocessing methods. Against commercial ABBYY FineReader toolkit...

2014
Hannah Pileggi Briana Morrison Amy Bruckman

This descriptive study explores deliberate barriers to user participation on the long-lived discussion site Metafilter.com. Metafilter has been in continuous operation since its founding in 1999, and at the time of this writing has around 12,000 active users. While many newer online sites appear eager to eliminate barriers to participation and recruit as many new members as possible, Metafilter...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید