نتایج جستجو برای: word lexical units

تعداد نتایج: 292572  

Journal: :journal of english language teaching and learning 2012
vahid reza mirzaeian

the aim of this paper is to report on an attempt to design and implement an intelligent system capable of generating the correct part of speech for a given sentence while the sentence is totally new to the system and not stored in any database available to the system. it follows the same steps a normal individual does to provide the correct parts of speech using a natural language processor. it...

1996
Toshiaki Fukada Michiel Bacchiani Kuldip K. Paliwal Yoshinori Sagisaka

This paper describes a new method of word model generation based on acoustically derived segment units (henceforth ASUs). An ASU-based approach has the advantages of growing out of human pre-determined phonemes and of consistently generating acoustic units by using the maximum likelihood (ML) criterion. The former advantage is e ective when it is di cult to map acoustics to a phone such as with...

2014
William Hartmann Viet Bac Le Abdelkhalek Messaoudi Lori Lamel Jean-Luc Gauvain

For languages with limited training resources, out-ofvocabulary (OOV) words are a significant problem, both for transcription and keyword spotting. This paper investigates the use of subword lexical units for keyword spotting. Three strategies for using the sub-word units are explored: 1) converting word-based lattices to subword lattices after decoding, 2) performing a separate decoding for ea...

Journal: :Memory & cognition 2001
A Rouibah M Taft

Two experiments are reported in which the processing units involved in the reading of French polysyllabic words are examined. A comparison was made between units following the maximal onset principle (i.e., the spoken syllable) and units following the maximal coda principle (i.e., the basic orthographic syllabic structure [BOSS]). In the first experiment, it took longer to recognize that a syll...

Journal: : 2023

Despite numerous studies, the aspect of word formation nouns in modern literary German language Germany and Austria is yet to be fully covered.
 The purpose this study determine patterns word-formation substantives Austrian Standard using example lexeme “Kraut” (cabbage).
 We used continuous sampling method lexicographical literature for German.
 As a result study, universal uniq...

2004
Iñaki Alegria Olatz Ansa Xabier Artola Nerea Ezeiza Koldo Gojenola Ruben Urizar

This paper describes the representation of Basque Multiword Lexical Units and the automatic processing of Multiword Expressions. After discussing and stating which kind of multiword expressions we consider to be processed at the current stage of the work, we present the representation schema of the corresponding lexical units in a generalpurpose lexical database. Due to its expressive power, th...

2010
Max Silberztein

NooJ associates each text with a Text Annotation Structure, in which each recognized linguistic unit is represented by an annotation. Annotations store the position of the text units to be represented, their length, and linguistic information. NooJ can represent and process complex annotations, such as those that represent units inside word forms, as well as those that are discontinuous. We dem...

2004
Gaël Dias Sérgio Nunes

In this paper, we present an application of Genetic Algorithms to extract Multiword Units (i.e. complex lexical units such as compound nouns, idiomatic expressions or phrase templates). For that purpose, a fitness function will be defined whose maximization will serve as a basis for the identification of pertinent word -grams (i.e ordered vectors of words) based on different similarity measures...

Journal: : 2023

Slovar slovenskega knjižnega jezika 2022 ('Dictionary of the Slovenian Standard Language 2022') contains a modern linguistic description multifaceted and complex reality language based on use as reflected in diverse resources, primarily corpora. A total 489 dictionary entries introduced by single-word headwords offers wholesome systematic semantic, grammatical, pragmatic other characteristics m...

2010
Max Silberztein

When NooJ performs an automatic lexical analysis of corpora, it recognizes five types of atomic linguistic units (ALUs) and represents them as annotations stored inside each text’s annotation structure (TAS). Unfortunately, the massive level of ambiguities generated by each of the five corresponding parsers produces a TAS far too heavy for most corpus linguistics applications. In consequence, m...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید