نتایج جستجو برای: web page classification

تعداد نتایج: 749611  

2003
Paul Larson Masaru Kitsuregawa

Because much of the information on the web is presented in some sort of regular, repeated format, “understanding” web pages often requires recognizing and using structure, where structure is typically defined by hyperlinks between pages and HTML formatting commands within a page. We survey some of the ways in which structure within a web page can be used to help machines understand pages. Speci...

Journal: :IEEE Data Eng. Bull. 2003
William W. Cohen

Because much of the information on the web is presented in some sort of regular, repeated format, “understanding” web pages often requires recognizing and using structure, where structure is typically defined by hyperlinks between pages and HTML formatting commands within a page. We survey some of the ways in which structure within a web page can be used to help machines understand pages. Speci...

Journal: :Procesamiento del Lenguaje Natural 2011
Arkaitz Zubiaga Raquel Martínez-Unanue Víctor Fresno-Fernández

The lack of representative textual content in many web documents suggests the study of additional metadata to improve web page classification tasks. Social bookmarking sites provide an accessible way to increase available metadata in large amounts with user-provided annotations. This field remains relatively unexplored. In this work, we analyze the usefulness of social annotations for web page ...

Journal: :International Journal of Advanced engineering, Management and Science 2017

2013
B. Leeladevi A. Sankar

Web page classification is achieved using text classification techniques. Web page classification is different from traditional text classification due to additional information, provided by web page structure which provides much information on content importance. HTML tags provide visual web page representation and can be considered a parameter to highlight content importance. Textual keywords...

Journal: :Int. J. of Asian Lang. Proc. 2010
Ji-bin Zhang Zhi-ming Xu Kun-li Xiu Qi-shu Pan

Automatic web site classification has a wide application prospect; however, there are few researches on it. Different from pure texts, web sites are the combination of a large number of web pages via hyperlinks, so text classification methods are not suitable to classify them directly. This paper proposes a web site classification approach based on its topological structure. Given a web site, f...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید