lexical clusters

نتایج جستجو برای: lexical clusters

تعداد نتایج: 143359 فیلتر نتایج به سال:

Automatic Acquisition of Adjacent Information and Its Effectiveness in Extraction of Bilingual Word Pairs from Parallel Corpora

2005

Hiroshi Echizen-ya Kenji Araki Yoshio Momouchi

Wednesday, June 15 8:00 Conference Registration (Registration desk) 8:45 Session 1: Large-Scale Online Linguistic Resources (I) Chair: Rafa Muñoz "Text Categorization Based on Subtopic Clusters" Francis Chik, Robert Luk, Korris Chung "Automatic Filtering of Bilingual Corpora for Statistical Machine Translation" Shahram Khadivi, Hermann Ney "The Role of Word Sense Disambiguation in Automated Tex...

متن کامل

Clustering Adjectives for Class Discovery

2003

Gemma Boleda Laura Alonso Alemany

This paper presents an exploratory data analysis in lexical acquisition for adjective classes using clustering techniques. From a theoretical point of view, this approach provides large-scale empirical evidence for a sound classification. From a computational point of view, it helps develop a reliable automatic subclassification method. Results show that the features used in theoretical work ca...

متن کامل

Phonological variation: epenthesis and deletion of schwa in Dutch

1996

Cecile T. L. Kuijpers Wilma van Donselaar Anne Cutler

Two types of phonological variation in Dutch, resulting from optional rules, are schwa epenthesis and schwa deletion. In a lexical decision experiment it was investigated whether the phonological variants were processed similarly to the standard forms. It was found that the two types of variation patterned differently. Words with schwa epenthesis were processed faster and more accurately than t...

متن کامل

Improving a Pipeline Architecture for Shallow Discourse Parsing

2015

Yangqiu Song Haoruo Peng Parisa Kordjamshidi Mark Sammons Dan Roth

We present a system that implements an end-to-end discourse parser. The system uses a pipeline architecture with seven stages: preprocessing, recognizing explicit connectives, identifying argument positions, identifying and labeling arguments, classifying explicit and implicit connectives, and identifying attribution structures. The discourse structure of a document is inferred based on these c...

متن کامل

Semantics-based Multiword Expression Extraction

2007

Tim Van de Cruys Begoña Villada Moirón

This paper describes a fully unsupervised and automated method for large-scale extraction of multiword expressions (MWEs) from large corpora. The method aims at capturing the non-compositionality of MWEs; the intuition is that a noun within a MWE cannot easily be replaced by a semantically similar noun. To implement this intuition, a noun clustering is automatically extracted (using distributio...

متن کامل

Semantic Clustering of Search Engine Results

2015

Sara Saad Soliman Maged F. El-Sayed Yasser F. Hassan

This paper presents a novel approach for search engine results clustering that relies on the semantics of the retrieved documents rather than the terms in those documents. The proposed approach takes into consideration both lexical and semantics similarities among documents and applies activation spreading technique in order to generate semantically meaningful clusters. This approach allows doc...

متن کامل

Cross-Domain Bootstrapping for Named Entity Recognition

2011

Ang Sun Ralph Grishman

We propose a general cross-domain bootstrapping algorithm for domain adaptation in the task of named entity recognition. We first generalize the lexical features of the source domain model with word clusters generated from a joint corpus. We then select target domain instances based on multiple criteria during the bootstrapping process. Without using annotated data from the target domain and wi...

متن کامل

Two-year-olds’ acquisition of the possessive morpheme: An acoustic analysis

2012

Kiri Mealings Katherine Demuth

Previous research shows that 2-year-olds’ production of third person singular -s, but not plural -s, is affected by coda complexity, though both are more accurately produced in durationally longer utterance-final compared to utterancemedial position. This study explores these effects with possessive -s. Acoustic analysis of 10 two-years-olds’ elicited imitations examined children’s use of simpl...

متن کامل

Author Diarization Using Cluster-Distance Approach

2016

Abdul Sittar Hafiz Rizwan Iqbal Rao Muhammad Adeel Nawab

Author Diarization is a new task introduced in PAN’16, to identify portion(s) of text with in a document written by multiple authors. This paper presents, our proposed approach for author diarization task. Various types of stylistic features which include lexical features, used to uniquely identify an author. Furthermore, to find anomalous text with in a single document, ClustDist method used. ...

متن کامل

L’opposition massif/comptable au niveau lexical et supra-lexical

Journal: :SHS Web of Conferences 2014

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید