نتایج جستجو برای: text segmentation

تعداد نتایج: 227918  

Journal: :journal of computer and robotics 0
ali ghafari-beranghar department of computer engineering, science and research branch, islamic azad university, tehran, iran ehsanollah kabir department of electrical and computer engineering, tarbiat modarres university, tehran, iran kaveh kangarloo department of electrical engineering, central tehran branch, islamic azad university, tehran, iran

one of the complex documents in the real world is city maps. in these kinds of maps, text labels overlap by graphics with having a variety of fonts and styles in different orientations. usually, text and graphic colour is not predefined due to various map publishers. in most city maps, text and graphic lines form a single connected component. moreover, the common regions of text and graphic lin...

Journal: :TAL 2006
Olivier Ferret

Topic segmentation was addressed by a large amount of work from which it is not easy to draw conclusions, especially about the need for knowledge. In this article, we propose in the same framework two methods for improving the results of a topic segmenter based on lexical reiteration. The first one is endogenous and exploits the distributional similarity of the words of a document for discoveri...

2009
Olivier Ferret

Topic segmentation was addressed by a large amount of work from which it is not easy to draw conclusions, especially about the need for knowledge. In this article, we propose to combine in the same framework two methods for improving the results of a topic segmenter based on lexical reiteration. The first one is endogenous and exploits the distributional similarity of words in a document for di...

Journal: :International Journal of Computer Applications 2013

Journal: :Sistemas y Telemática 2016

In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...

Journal: :CoRR 1994
Pim van der Eijk

A quantitative representation of discourse structure can be computed by measuring lexical cohesion relations among adjacent text elements. These representations have previously been proposed to deal with sub-topic text segmentation. In a parallel corpus, similar representations can be derived for versions of a text in various languages. These can be used for parallel segmentation and as an alte...

2013
Paula Christina Figueira Cardoso Maite Taboada Thiago Alexandre Salgueiro Pardo

In this paper, we describe novel methods for topic segmentation based on patterns of discourse organization. Using a corpus of news texts, our results show that it is possible to use discourse features (based on Rhetorical Structure Theory) for topic segmentation and that we outperform some well-known methods.

2005
Guohong Fu Kang-Kwong Luke Percy Ping-Wai Wong

In this paper, we describe in brief our system for the Second International Chinese Word Segmentation Bakeoff sponsored by the ACL-SIGHAN. We participated in all tracks at the bakeoff. The evaluation results show our system can achieve an F measure of 0.9400.967 for different testing corpora.

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید