Using Spam Farm to Boost PageRank

ثبت نشده
چکیده

Today people have become more and more dependent on search engines such as Google, Yahoo, and MSN, etc., for their information needs. Web spamming has emerged to take the economic advantage of high search rankings and threatened the accuracy and fairness of those rankings. Understanding spamming techniques is essential for evaluating the strength and weakness of a ranking algorithm, and for fighting against web spamming. In this paper, we identify the optimal spam farm structure under some realistic assumptions in the single target spam farm model. Our result contradicts the optimal spam farm claimed in [7], the proof of which is fundamentally flawed. We also characterize the optimal spam farms under some additional and realistic constraints, which the spammer may deploy to disguise the spam farm by deviating from the unconstrained optimal structure. In the simulation, we show that the optimal spam farm can significantly boost the PageRank score of a target page. In particular, the boosting effect is more significant for target pages with low PageRank scores. Furthermore, by using web pages with higher PageRank scores as boosting pages, the spammer can get better boosting effect too.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Boosting Strategies of Link Spamming

Nowadays people become more and more reliant on search engines such as Google, Yahoo and MSN etc. for their information needs. In order to take the economic advantage of high search engine ranking, web spamming emerges. Among the various kinds of web spamming techniques, link spamming boosts the target page by manipulating the links in the web graph. In this paper, we first point out some weak ...

متن کامل

Mining Page Farms and Its Application in Link Spam Detection

Understanding the general relations of Web pages and their environments is important with a few interesting applications such as Web spam detection. In this thesis, we study the novel problem of page farm mining and its application in link spam detection. A page farm is the set of Web pages contributing to (a major portion of) the PageRank score of a target page. We show that extracting page fa...

متن کامل

Closure Operators and Spam Resistance for PageRank

We study the spammablility of ranking functions on the web. Although graph-theoretic ranking functions, such as Hubs and Authorities and PageRank exist, there is no graph theoretic notion of how spammable such functions are. We introduce a very general cost model that only depends on the observation that changing the links of a page that you own is free, whereas changing the links on pages owne...

متن کامل

SpamRank -- Fully Automatic Link Spam Detection

Spammers intend to increase the PageRank of certain spam pages by creating a large number of links pointing to them. We propose a novel method based on the concept of personalized PageRank that detects pages with an undeserved high PageRank value without the need of any kind of white or blacklists or other means of human intervention. We assume that spammed pages have a biased distribution of p...

متن کامل

DirichletRank: Ranking Web Pages Against Link Spams

Anti-spamming has become one of the most important challenges to web search engines and attracted increasing attention in both industry and academia recently. Since most search engines now use link-based ranking algorithms, link-based spamming has become a major threaten. In this paper, we show that the popular link-based ranking algorithm PageRank, while being successfully used in the Google s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006