Multi-Syllable Phonotactic Modelling

نویسنده

  • Anja Belz
چکیده

This paper describes a novel approach to constructing phonotactic models. The underlying theoretical approach to phonological description is the multi-syllable approach in which multiple syllable classes are deened that reeect phonotactically idiosyncratic syllable subcategories. A new nite-state formalism, ofs Modelling, is used as a tool for encoding, automatically constructing and generalising phonotac-tic descriptions. Language-independent prototype models are constructed which are instantiated on the basis of data sets of phonological strings, and gener-alised with a clustering algorithm. The resulting approach enables the automatic construction of phono-tactic models that encode arbitrarily close approximations of a language's set of attested phonological forms. The approach is applied to the construction of multi-syllable word-level phonotactic models for German, English and Dutch.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Language Independent Approach To Acquiring Phonotactic Resources for Speech Recognition

Building and developing linguistic resources for languages is of prime importance with many areas of application. This paper focusses on a fully automatic approach to the aquisition of a syllable phonotactics for a particular language. In this approach the phonotactic constraints for a language are encoded in a finite-state phonotactic automaton the structure of which can be automatically deriv...

متن کامل

Improving Syllabification Models with Phonotactic Knowledge

We report on a series of experiments with probabilistic context-free grammars predicting English and German syllable structure. The treebank-trained grammars are evaluated on a syllabification task. The grammar used by Müller (2002) serves as point of comparison. As she evaluates the grammar only for German, we reimplement the grammar and experiment with additional phonotactic features. Using b...

متن کامل

Phonotactic and prosodic effects on word segmentation in infants.

This research examines the issue of speech segmentation in 9-month-old infants. Two cues known to carry probabilistic information about word boundaries were investigated: Phonotactic regularity and prosodic pattern. The stimuli used in four head turn preference experiments were bisyllabic CVC.CVC nonwords bearing primary stress in either the first or the second syllable (strong/weak vs. weak/st...

متن کامل

On the syllable structures of Chinese relating to speech recognition

It is well known that Chinese is a tone language with multi-tone system, but the distinctive syllable structures relating to speech recognition have not brought to phoneticians' attention yet. The syllable structures, the phonotactic rules were discussed and the joint probability of the initials and the finals were given in this paper. A comparative study of the relative information transmitted...

متن کامل

Acquiring Reusable Multilingual Phonotactic Resources

This paper presents a fully automatic procedure for acquiring reusable phonotactic resources from syllable annotated data. The procedure makes use of a regular inference algorithm and the acquired resources are stored in a specialised XML representation. The technique is then extended to support acquisition from phoneme labelled data while providing a semi-automatic annotation system assisting ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره cs.CL/0102020  شماره 

صفحات  -

تاریخ انتشار 2000