Learning grammatical categories using paradigmatic representations: Substitute words for language acquisition

نویسندگان

  • Mehmet Ali Yatbaz
  • Volkan Cirik
  • Aylin Küntay
  • Deniz Yuret
چکیده

Learning word categories is a fundamental task in language acquisition. Previous studies show that co-occurrence patterns of preceding and following words are essential to group words into categories. However, the neighboring words, or frames, are rarely repeated exactly in the data. This creates data sparsity and hampers learning for frame based models. In this work, we propose a paradigmatic representation of word context which uses probable substitutes instead of frames. Our experiments on child-directed speech show that models based on probable substitutes learn more accurate categories with fewer examples compared to models based on frames.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Syntactic Categories Using Paradigmatic Representations of Word Context

We investigate paradigmatic representations of word context in the domain of unsupervised syntactic category acquisition. Paradigmatic representations of word context are based on potential substitutes of a word in contrast to syntagmatic representations based on properties of neighboring words. We compare a bigram based baseline model with several paradigmatic models and demonstrate significan...

متن کامل

Word Context and Token Representations from Paradigmatic Relations and Their Application to Part-of-Speech Induction

Representation of words as dense real vectors in the Euclidean space provides an intuitive definition of relatedness in terms of the distance or the angle between one another. Regions occupied by these word representations reveal syntactic and semantic traits of the words. On top of that, word representations can be incorporated in other natural language processing algorithms as features. In th...

متن کامل

Acquisition and Representation of Grammatical Categories: Grammatical Gender in a Connectionist Network

In traditional models of language production grammatical categories are represented as abstract features independent of semantics and phonology. An alternative view is proposed where syntactic categories emerge as a higher-order regularity from semantic and phonological properties of words. The proposal was tested using grammatical gender in Serbian, a south Slavic language with rich morphology...

متن کامل

The Effect of Zipfian Frequency Variations on Category Formation in Adult Artificial Language Learning

Successful language acquisition hinges on organizing individual words into grammatical categories and learning the relationships between them, but the method by which children accomplish this task has been debated in the literature. One proposal is that learners use the shared distributional contexts in which words appear as a cue to their underlying category structure. Indeed, recent research ...

متن کامل

Semantic Regularities in Grammatical Categories: Learning Grammatical Gender in an Artificial Language

The knowledge of grammatical categories such as nouns and verbs is considered to lie at the foundations of human language comprehension and production abilities. Words’ distributional and phonological properties contribute to both adult and infant learning of grammatical categories. Here we investigate the contribution of semantic cues to the acquisition of grammatical categories using grammati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016