Word level precision of the NALIGN automatic segmen- tation algorithm
نویسندگان
چکیده
This work presents an evaluation of the word level precision of an automatic segmentation algorithm, NALIGN. Measurements of the portion of temporal overlap between automatic and manual word level segmentations show that a precision of 90 to 95% can be achieved, partly depending on the type of speech material. These precision figures are furthermore only marginally lower than the gold standard obtained comparing independent manual segmentations.
منابع مشابه
Unsupervised Segmentation of Chinese Text by Use of Branching Entropy
We propose an unsupervised segmen-tation method based on an assumption about language data: that the increasing point of entropy of successive characters is the location of a word boundary. A large-scale experiment was conducted by using 200 MB of unseg-mented training data and 1 MB of test data, and precision of 90% was attained with recall being around 80%. Moreover , we found that the precis...
متن کاملAutomatic Segmentation of Brain Tissues and MR Bias Field Correction Using a Cigital Brain Atlas
This paper 1 proposes a method for fully automatic segmen-tation of brain tissues and MR bias eld correction using a digital brain atlas. We have extended the EM segmentation algorithm, including an explicit parametric model of the bias eld. The algorithm interleaves classiication with parameter estimation, yielding better results at every iteration. The method can handle multi-channel data and...
متن کاملMining Web data for Chinese segmentation
within documents as indexing terms for search of relevant documents. As Chinese is an ideographic character-based language, the words in the texts are not delimited by white spaces. Indexing of Chinese documents is impossible without a proper segmentation algorithm. Many Chinese segmentation algorithms have been proposed in the past. Traditional segmentation algorithms cannot operate without a ...
متن کاملUnsupervised Learning of Word SegmentationRules with
This article presents a combination of unsupervised and supervised learning techniques for the generation of word segmentation rules from a raw list of words. First, a language bias for word segmen-tation is introduced and a simple genetic algorithm is used in the search for segmentation that corresponds to the best bias value. In the second phase, the words segmented by the genetic algorithm a...
متن کاملSensor-oriented feature usability evaluation in fingerprint segmentation
Existing fingerprint segmentation methods usually process fingerprint images captured by different sensors with the same feature or feature set. We propose to improve the fingerprint segmentation result in view of an important fact that images from different sensors have different characteristics for segmentation. Feature usability evaluation, which means to evaluate the usability of features t...
متن کامل