نتایج جستجو برای: unanalyzed chunks

تعداد نتایج: 2557  

2010
Olivier Blanc Matthieu Constant Anne Dister Patrick Watrin

This paper describes the process and the resources used to automatically annotate a French corpus of spontaneous speech transcriptions in super-chunks. Super-chunks are enhanced chunks that can contain lexical multiword units. This partial parsing is based on a preprocessing stage of the spoken data that consists in reformatting and tagging utterances that break the syntactic structure of the t...

2015
Michael Hirsch Shmuel Tomi Klein Dana Shapira Yair Toaff

A special case of data compression in which repeated chunks of data are stored only once is known as deduplication. The input data is cut into chunks and a cryptographically strong hash value of each (different) chunk is stored. To restrict the influence of small inserts and deletes to local perturbations, the chunk boundaries are usually defined in a data dependent way, which implies that the ...

2008
Kristina Vuckovic Marko Tadic Zdravko Dovedan Han

In this paper we discuss a rule-based approach to chunking sentences in Croatian, implemented using local regular grammars within the NooJ development environment. We describe the rules and their implementation by regular grammars and at the same time show that in NooJ environment it is extremely easy to fine tune their different sub-rules. Since Croatian has strong morphosyntactic features tha...

2008
Petr Homola Vladislav Kubon

This paper describes a shallow parsing formalism aiming at machine translation between closely related languages. The formalism allows to write grammar rules helping to (partially) disambiguate chunks in input sentences. The chunks are then translatred into the target language without any deep syntactic or semantic processing. A stochastic ranker then selects the best translation according to t...

2006
Gabriel G. Bès Lionel Lamadon François Trouilleux

A way of extracting French verbal chunks, inflected and infinitive, is explored and tested on effective corpus. Declarative morphological and local grammar rules specifying chunks and some simple contextual structures are used, relying on limited lexical information and some simple heuristic/statistic properties obtained from restricted corpora. The specific goals, the architecture and the form...

2014
Atsushi Tagami Chikara Sasaki Katsunori Yamaoka

Download Consumption Consumer s ta r ts conten t download using mobile network (e.g., LTE/3G) When Consumer enters the mi l l iwave network, content chunks required in the feture are downloaded using the network. Consumer cont inue to service using the pre-downloaded content chunks. Help from the spot networks improves the content download time and offloads come portion of the mobile network tr...

2007

In this paper we describe language recognition algorithms for monoand multi-lingual documents that are based on mixed-order n-grams, Markov chains, maximum likelihood, and dynamic programming. We compare the monolingual algorithm to those suggested by other researchers. This comparison suggests that this algorithm significantly outperforms commonly used language recognition algorithms. We then ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید