written texts film

نتایج جستجو برای: written texts film

تعداد نتایج: 208079 فیلتر نتایج به سال:

Decoding Anagrammed Texts Written in an Unknown Language and Script

Journal: :TACL 2016

Bradley Hauer Grzegorz Kondrak

Algorithmic decipherment is a prime example of a truly unsupervised problem. The first step in the decipherment process is the identification of the encrypted language. We propose three methods for determining the source language of a document enciphered with a monoalphabetic substitution cipher. The best method achieves 97% accuracy on 380 languages. We then present an approach to decoding ana...

متن کامل

Automatic Language Identification from Written Texts – An Overview

2014

H L Shashirekha

Language Identification is the task of automatically identifying the language(s) in which the content is written in a document (web page, text document). Due to the widespread use of internet, identification of languages has become an important preprocessing step for a number of applications such as machine translation, Part-of-Speech tagging, linguistic corpus creation, supporting low-density ...

متن کامل

Gender, Genre, and Writing Style in Formal Written Texts

2003

Shlomo Argamon Moshe Koppel Jonathan Fine Anat Rachel Shimoni

This paper explores differences between male and female writing in a large subset of the British National Corpus covering a range of genres. Several classes of simple lexical and syntactic features that differ substantially according to author gender are identified, both in fiction and in non-fiction documents. In particular, we find significant differences between maleand female-authored docum...

متن کامل

Word-length entropies and correlations of natural language written texts

Journal: :Journal of Quantitative Linguistics 2015

Maria Kalimeri Vassilios Constantoudis Constantinos Papadimitriou Konstantinos Karamanos Fotis K. Diakonos Harris Papageorgiou

We study the frequency distributions and correlations of the word lengths of ten European languages. Our findings indicate that a) the word-length distribution of short words quantified by the mean value and the entropy distinguishes the Uralic (Finnish) corpus from the others, b) the tails at long words, manifested in the high-order moments of the distributions, differentiate the Germanic lang...

متن کامل

Linguistic complexity: English vs. Polish, text vs. corpus

Journal: :CoRR 2010

Jaroslaw Kwapien Stanislaw Drozdz Adam Orczyk

We analyze the rank-frequency distributions of words in selected English and Polish texts. We show that for the lemmatized (basic) word forms the scale-invariant regime breaks after about two decades, while it might be consistent for the whole range of ranks for the inflected word forms. We also find that for a corpus consisting of texts written by different authors the basic scale-invariant re...

متن کامل

Automatic Transcription of Lecture Speech using Language Model Based on Speaking-Style Transformation of Proceeding Texts

2012

Yuya Akita Makoto Watanabe Tatsuya Kawahara

For language modeling of spontaneous speech recognition, we propose a style transformation approach, which transforms written texts to a spoken-style language model. Since these two styles are largely different and thus direct transformation is difficult, we cascade two transformation methods; rule-based transformation to rewrite written-style texts to intermediate “verbatim” texts, and statist...

متن کامل

Dynamic Semantics at Work

2004

Rolf Schwitter Marc Tilbrook

In this case study we show how an unambiguous semantic representation can be constructed dynamically in left-to-right order while a text is written in PENG, a controlled natural language designed for knowledge representation. PENG can be used in contexts where precise texts (e.g. software specifications, axioms for formal ontologies, legal documents) need to be composed. Texts written in PENG l...

متن کامل

Syntactic annotation of spoken utterances: A case study on the Czech Academic Corpus

2009

Barbora Hladká Zdenka Uresová

Corpus annotation plays an important role in linguistic analysis and computational processing of both written and spoken language. Syntactic annotation of spoken texts becomes clearly a topic of considerable interest nowadays, driven by the desire to improve automatic speech recognition systems by incorporating syntax in the language models, or to build language understanding applications. Synt...

متن کامل

Collaboration, creativity and the co-construction of oral and written texts

Journal: :Thinking Skills and Creativity 2008

متن کامل

Probing the Topological Properties of Complex Networks Modeling Short Written Texts

Journal: :PLOS ONE 2015

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید