نتایج جستجو برای: text documents

تعداد نتایج: 222232  

Journal: :International Journal of Computers Communications & Control 2013

Journal: :Eastern-European Journal of Enterprise Technologies 2014

Journal: :Proceedings of the American Society for Information Science and Technology 2013

1998
Kamal Nigam Andrew McCallum Sebastian Thrun Tom M. Mitchell

In many important text classification problems, acquiring class labels for training documents is costly, while gathering large quantities of unlabeled data is cheap. This paper shows that the accuracy of text classifiers trained with a small number of labeled documents can be improved by augmenting this small training set with a large pool of unlabeled documents. We present a theoretical argume...

2001
Karsten Winkler Myra Spiliopoulou

Public archives contain large and continuously growing volumes of electronically available text documents. In many countries, public authorities are required by law to publish certain data to satisfy the information needs of the general public. In contrast to plain text documents, semantically tagged XML documents along with appropriate query languages largely facilitate searching and browsing ...

2006
Mikaela Keller Samy Bengio

Text categorization is intrinsically a supervised learning task, which aims at relating a given text document to one or more predefined categories. Unfortunately, labeling such databases of documents is a painful task. We present in this paper a method that takes advantage of huge amounts of unlabeled text documents available in digital format, to counter balance the relatively smaller availabl...

2001
MohammadTaghi Hajiaghayi

Documents can be represented as structures with a hierarchial arrangement of text and non-text nodes, where nodes are labeled by category names such as paragraph and section. Representing documents this way is a natural consequence of using the Standard Generalized Markup Language(SGML) to encode text documents which has many applications in different areas. There are many circumstances in whic...

2014
M S Patil

Due to increasing use of internet and online technologies or online data, there is vast increase in the electronic documents. When a data is being retrieved from such a huge collection of electronic documents, hundreds and thousands of documents are retrieved. Hence, for user, it is not possible to read all the retrieved documents. Also, these documents contain redundant information. In such si...

Journal: :Bulletin of the National Technical University «KhPI» Series: New solutions in modern technologies 2017

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید