Latent Semantic Analysis for German Literature Investigation
نویسنده
چکیده
The paper presents the results of experiments of usage of LSA for analysis of textual data. The method is explained in brief and special attention is pointed on its potential for comparison and investigation of German literature texts. Two hypotheses are tested: 1) the texts by the same author are alike and can be distinguished from the ones by different person; 2) the prose and poetry can be automatically discovered.
منابع مشابه
Latent Semantic Analysis for Russian Literature Investigation
The paper presents the results of experiments of usage of Latent Semantic Analysis for analysis of textual data. The method is explained in brief and special attention is pointed on its potential for comparison and investigation of Russian literature texts. Two hypotheses are tested: • The texts by the same author are alike and can be distinguished from the ones by different person; • The prose...
متن کاملSpontaneous semantic associations of German verbs: Giving ontological and functional structure to speakers’ elicited concepts
This work is concerned with an investigation of spontaneous semantic associations. We performed a web experiment where linguistic experts and non-experts were asked to spontaneously list semantic associations for German verbs. The elicited conceptual knowledge was then given ontological structure based on codes from the psycholinguistic ontology GermaNet as well as linguistic functions obtained...
متن کاملQuery expansion based on relevance feedback and latent semantic analysis
Web search engines are one of the most popular tools on the Internet which are widely-used by expert and novice users. Constructing an adequate query which represents the best specification of users’ information need to the search engine is an important concern of web users. Query expansion is a way to reduce this concern and increase user satisfaction. In this paper, a new method of query expa...
متن کاملLatent Semantic Clustering of German Verbs with Treebank Data
Treebank data have been utilized as data sources for a wide range of tasks in computational linguistics, including statistical parsing, anaphora resolution, induction of valence lexica, etc. More recently, researchers have experimented with extracting semantic information from syntactically annotated data. Here, treebank data have been used for the purposes of identifying selectional preference...
متن کاملExploring the value space of attributes: Unsupervised bidirectional clustering of adjectives in German
The paper presents an iterative bidirectional clustering of adjectives and nouns based on a cooccurrence matrix. The clustering method combines a Vector Space Models (VSM) and the results of a Latent Dirichlet Allocation (LDA), whose results are merged in each iterative step. The aim is to derive a clustering of German adjectives that reflects latent semantic classes of adjectives, and that can...
متن کامل