Discriminative Analysis of Linguistic Features for Typological Study
نویسندگان
چکیده
We address the task of automatically estimating the missing values of linguistic features by making use of the fact that some linguistic features in typological databases are informative to each other. The questions to address in this work are (i) how much predictive power do features have on the value of another feature? (ii) to what extent can we attribute this predictive power to genealogical or areal factors, as opposed to being provided by tendencies or implicational universals? To address these questions, we conduct a discriminative or predictive analysis on the typological database. Specifically, we use a machine-learning classifier to estimate the value of each feature of each language using the values of the other features, under different choices of training data: all the other languages, or all the other languages except for the ones having the same origin or area with the target language.
منابع مشابه
Constitutive Features of the Russian Political Discourse in Ecolinguistic Aspect
The article offers a comparative description of typological mechanisms used in political communicative practice and methods of verbal explication of its axiological and symbolic constituents determining universal mental features of individual/collective consciousness. The research position based on a systemic multilevel analysis of the component structure of discourse facilitates the identifica...
متن کاملLinguistic Watersheds: a Model for Understanding Variation among the Tibetic Languages
This study applies the observation of alignment between geographical watersheds and linguistic groupings to the Tibetan Plateau and the Himalayas. Tournadre (2014) estimates 220 Tibetic language varieties in 25 major groupings, sharing a common linguistic ancestry. Typological groupings can be readily identified through mapping human settlements to watersheds. For areas that have yet to be rese...
متن کاملModeling the Relationship among Linguistic Typological Features with Hierarchical Dirichlet Process
We propose that topic models can be used to represent the relationship among linguistic typological features. Typological features are typically analyzed in terms of universal implications. We argue that topic models can better capture some phenomena, such as universal tendencies, which are hard to be explained by implications. We conduct experiments to evaluate the predictive accuracy of our H...
متن کاملTarget Language Adaptation of Discriminative Transfer Parsers
We study multi-source transfer parsing for resource-poor target languages; specifically methods for target language adaptation of delexicalized discriminative graph-based dependency parsers. We first show how recent insights on selective parameter sharing, based on typological and language-family features, can be applied to a discriminative parser by carefully decomposing its model features. We...
متن کاملThe Effects of Task Complexity on Input-Driven Uptake of Salient Linguistic Features
The present study investigated the effects of cognitive complexity of pedagogical tasks on the learners’ uptake of salient features in the input. For the purpose of data collection, three versions of a decision-making task (simple, mid, and complex) were employed. Three intact classes (each 20 language learners) were randomly assigned to three groups. Each group transacted a version of a decis...
متن کامل