Learning grammatical categories using paradigmatic representations: Substitute words for language acquisition
نویسندگان
چکیده
Learning word categories is a fundamental task in language acquisition. Previous studies show that co-occurrence patterns of preceding and following words are essential to group words into categories. However, the neighboring words, or frames, are rarely repeated exactly in the data. This creates data sparsity and hampers learning for frame based models. In this work, we propose a paradigmatic representation of word context which uses probable substitutes instead of frames. Our experiments on child-directed speech show that models based on probable substitutes learn more accurate categories with fewer examples compared to models based on frames.
منابع مشابه
Learning Syntactic Categories Using Paradigmatic Representations of Word Context
We investigate paradigmatic representations of word context in the domain of unsupervised syntactic category acquisition. Paradigmatic representations of word context are based on potential substitutes of a word in contrast to syntagmatic representations based on properties of neighboring words. We compare a bigram based baseline model with several paradigmatic models and demonstrate significan...
متن کاملWord Context and Token Representations from Paradigmatic Relations and Their Application to Part-of-Speech Induction
Representation of words as dense real vectors in the Euclidean space provides an intuitive definition of relatedness in terms of the distance or the angle between one another. Regions occupied by these word representations reveal syntactic and semantic traits of the words. On top of that, word representations can be incorporated in other natural language processing algorithms as features. In th...
متن کاملAcquisition and Representation of Grammatical Categories: Grammatical Gender in a Connectionist Network
In traditional models of language production grammatical categories are represented as abstract features independent of semantics and phonology. An alternative view is proposed where syntactic categories emerge as a higher-order regularity from semantic and phonological properties of words. The proposal was tested using grammatical gender in Serbian, a south Slavic language with rich morphology...
متن کاملThe Effect of Zipfian Frequency Variations on Category Formation in Adult Artificial Language Learning
Successful language acquisition hinges on organizing individual words into grammatical categories and learning the relationships between them, but the method by which children accomplish this task has been debated in the literature. One proposal is that learners use the shared distributional contexts in which words appear as a cue to their underlying category structure. Indeed, recent research ...
متن کاملSemantic Regularities in Grammatical Categories: Learning Grammatical Gender in an Artificial Language
The knowledge of grammatical categories such as nouns and verbs is considered to lie at the foundations of human language comprehension and production abilities. Words’ distributional and phonological properties contribute to both adult and infant learning of grammatical categories. Here we investigate the contribution of semantic cues to the acquisition of grammatical categories using grammati...
متن کامل