نتایج جستجو برای: post text

تعداد نتایج: 564440  

2016
Maria Simi Giuseppe Attardi

TANL is a suite of tools for text analytics based on the software architecture paradigm of data driven pipelines. The strategies for upgrading TANL to the use of Universal Dependencies range from a minimalistic approach consisting of introducing pre/post-processing steps into the native pipeline to revising the whole pipeline. We explore the issue in the context of the Italian Treebank, conside...

2002
G S LEHAL CHANDAN SINGH Chandan Singh

A post-processing system for OCR of Gurmukhi script has been developed. Statistical information of Punjabi language syllable combinations, corpora look-up and certain heuristics based on Punjabi grammar rules have been combined to design the post-processor. An improvement of 3% in recognition rate, from 94.35% to 97.34%, has been reported on clean images using the post-processing techniques.

2009
Marcos Garcia Pablo Gamallo

Many of the errors produced by up-to-date POS-taggers could be considered as morphologic, syntactic or semantic. Once statistical tagging does not deal with semantic ambiguity, the correction of (morpho)syntactic errors emerges as one of the possibilities to improve the accuracy of this task. This work describes a method that applies a robust parser with correction rules over a POS-tagging outp...

Journal: :J. Economic Theory 2009
Alex Gershkov Balázs Szentes

A group of individuals with identical preferences must make a decision under uncertainty about which decision is best. Before the decision is made, each agent can privately acquire a costly and imperfect signal. We discuss how to design a mechanism for eliciting and aggregating the collected information so as to maximize ex-ante social welfare. We first show that, of all mechanisms, a sequentia...

2016
Elvis Koci Maik Thiele Oscar Romero Wolfgang Lehner

Spreadsheet applications are one of the most used tools for content generation and presentation in industry and the Web. In spite of this success, there does not exist a comprehensive approach to automatically extract and reuse the richness of data maintained in this format. The biggest obstacle is the lack of awareness about the structure of the data in spreadsheets, which otherwise could prov...

2005
Motoko Ueyama

A particularly promising approach to the use of the Web for linguistic research is to build corpora via automated queries to search engines, retrieving and post-processing the pages found in this way (Ghani et al. 2003, Baroni and Bernardini 2004, Sharoff to appear). This approach differs from the traditional method of corpus construction, where one needs to spend considerable time finding and ...

2016
Peng Wang Lifeng Sun Shiqiang Yang Alan F. Smeaton

Indexing of visual media based on content analysis has now moved beyond using individual concept detectors and there is now a focus on combining concepts or post-processing the outputs of individual concept detection. Due to the limitations and availability of training corpora which are usually sparsely and imprecisely labeled, training-based refinement methods for semantic indexing of visual m...

2012
Bei Shi Xianpei Han Le Sun

The state-of-the-art Chinese word segmentation systems have achieved high performance on well-formed long document. However, the segmentation for microblog is difficult due to the noise problem and the OOV problem. In this paper, we present a Chinese Micro-Blog Segmentation system for the CIP-SIGHAN Word Segmentation Bakeoff 2012 track. The proposed system adopts a cascaded approach which conta...

2010
Xiaoming Xu Muhua Zhu Xiaoxu Fei Jingbo Zhu

For the competition of Chinese word segmentation held in the first CIPS-SIGHNA joint conference. We applied a subwordbased word segmenter using CRFs and extended the segmenter with OOV words recognized by Accessor Variety. Moreover, we proposed several post-processing rules to improve the performance. Our system achieved promising OOV recall among all the participants.

2015
Salha Alzahrani

This report explains our Arabic plagiarism detection system which we used to submit our run to AraPlagDetect competition at FIRE 2015. The system was constructed through four main stages. First is pre-processing which includes tokenisation and stop words removing. Second is retrieving a list of candidate documents for each suspicious document using K-gram fingerprinting and Jaccard coefficient....

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید