نتایج جستجو برای: word lexical units
تعداد نتایج: 292572 فیلتر نتایج به سال:
the aim of this paper is to report on an attempt to design and implement an intelligent system capable of generating the correct part of speech for a given sentence while the sentence is totally new to the system and not stored in any database available to the system. it follows the same steps a normal individual does to provide the correct parts of speech using a natural language processor. it...
This paper describes a new method of word model generation based on acoustically derived segment units (henceforth ASUs). An ASU-based approach has the advantages of growing out of human pre-determined phonemes and of consistently generating acoustic units by using the maximum likelihood (ML) criterion. The former advantage is e ective when it is di cult to map acoustics to a phone such as with...
For languages with limited training resources, out-ofvocabulary (OOV) words are a significant problem, both for transcription and keyword spotting. This paper investigates the use of subword lexical units for keyword spotting. Three strategies for using the sub-word units are explored: 1) converting word-based lattices to subword lattices after decoding, 2) performing a separate decoding for ea...
Two experiments are reported in which the processing units involved in the reading of French polysyllabic words are examined. A comparison was made between units following the maximal onset principle (i.e., the spoken syllable) and units following the maximal coda principle (i.e., the basic orthographic syllabic structure [BOSS]). In the first experiment, it took longer to recognize that a syll...
Despite numerous studies, the aspect of word formation nouns in modern literary German language Germany and Austria is yet to be fully covered.
 The purpose this study determine patterns word-formation substantives Austrian Standard using example lexeme “Kraut” (cabbage).
 We used continuous sampling method lexicographical literature for German.
 As a result study, universal uniq...
This paper describes the representation of Basque Multiword Lexical Units and the automatic processing of Multiword Expressions. After discussing and stating which kind of multiword expressions we consider to be processed at the current stage of the work, we present the representation schema of the corresponding lexical units in a generalpurpose lexical database. Due to its expressive power, th...
NooJ associates each text with a Text Annotation Structure, in which each recognized linguistic unit is represented by an annotation. Annotations store the position of the text units to be represented, their length, and linguistic information. NooJ can represent and process complex annotations, such as those that represent units inside word forms, as well as those that are discontinuous. We dem...
In this paper, we present an application of Genetic Algorithms to extract Multiword Units (i.e. complex lexical units such as compound nouns, idiomatic expressions or phrase templates). For that purpose, a fitness function will be defined whose maximization will serve as a basis for the identification of pertinent word -grams (i.e ordered vectors of words) based on different similarity measures...
Slovar slovenskega knjižnega jezika 2022 ('Dictionary of the Slovenian Standard Language 2022') contains a modern linguistic description multifaceted and complex reality language based on use as reflected in diverse resources, primarily corpora. A total 489 dictionary entries introduced by single-word headwords offers wholesome systematic semantic, grammatical, pragmatic other characteristics m...
When NooJ performs an automatic lexical analysis of corpora, it recognizes five types of atomic linguistic units (ALUs) and represents them as annotations stored inside each text’s annotation structure (TAS). Unfortunately, the massive level of ambiguities generated by each of the five corresponding parsers produces a TAS far too heavy for most corpus linguistics applications. In consequence, m...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید