نتایج جستجو برای: web page classification

تعداد نتایج: 749611  

Journal: :Studies in health technology and informatics 2015
Célia Boyer Ljiljana Dolamic Natalia Grabar

Authors evaluated supervised automatic classification algorithms for determination of health related web-page compliance with individual HONcode criteria of conduct using varying length character n-gram vectors to represent healthcare web page documents. The training/testing collection comprised web page fragments extracted by HONcode experts during the manual certification process. The authors...

2011
P. Malarvizhi Ramachandra V. Pujeri

The web is a large repository of information and to facilitate the search and retrieval of pages from it, categorization of web documents is essential. An effective means to handle the complexity of information retrieval from the internet is through automatic classification of web pages. Although lots of automatic classification algorithms and systems have been presented, most of the existing a...

Journal: :New Review of Hypermedia and Multimedia 2002

2009
Jane E. Mason Michael A. Shepherd Jack Duffy

The research reported in this paper is part of a larger project on the automatic classification of Web pages by their genres, using a distance function classification model. In this paper, we investigate the effect of several commonly used data preprocessing steps, explore the use of byte and word n-grams, and test our classification model on three Web page data sets. Our approach is to represe...

2006
Benjamin N. Waber John J. Magee Margrit Betke

We present a highly accurate method for classifying web pages based on link percentage, which is the percentage of text characters that are parts of links normalized by the number of all text characters on a web page. K -means clustering is used to create unique thresholds to differentiate index pages and article pages on individual web sites. Index pages contain mostly links to articles and ot...

2015
Xiao-Yuan Jing Qian Liu Fei Wu Baowen Xu Yang-Ping Zhu Songcan Chen

Web page classification has attracted increasing research interest. It is intrinsically a multi-view and semi-supervised application, since web pages usually contain two or more types of data, such as text, hyperlinks and images, and unlabeled pages are generally much more than labeled ones. Web page data is commonly high-dimensional. Thus, how to extract useful features from this kind of data ...

2006
Benjamin N. Waber

We present a highly accurate method for classifying web pages based on link percentage, which is the percentage of text characters that are parts of links normalized by the number of all text characters on a web page. K -means clustering is used to create unique thresholds to differentiate index pages and article pages on individual web sites. Index pages contain mostly links to articles and ot...

Journal: :Transactions of the Japanese Society for Artificial Intelligence 2010

Journal: :Journal of Japan Society for Fuzzy Theory and Intelligent Informatics 2006

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید