Yahoo fighting spam Combating Web Spam with TrustRank

Web spam pages use various techniques to achieve higher-than-deserved rankings in a search engine’s results. While human experts can identify spam, it is too expensive to manually evaluate a large number of pages …. read on

full text .pdf

Link-based spam detection

A computer implemented method of ranking search hits in a search result
set. The computer-implemented method includes receiving a query from a
user and generating a list of hits related to the query, where each of
the hits has a relevance to the query, where the hits have one or more
boosting linked documents pointing to the hits, and where the boosting
linked documents affect the relevance of the hits to the query. The
method associates a metric to each of at least a subset of the hits, the
metric being representative of the number of boosting linked documents
that point to each of at least a subset of the hits and which
artificially inflate the relevance of the hits. The method then compares
the metric, which is representative of the size of a spam farm pointing
to the hit, with a threshold value, processes the list of hits to form a
modified list based in part on the comparison, and transmits the modified
list to the user.

read on

Technorati Tags: , , , ,

Advertisements

Leave a Reply

Please log in using one of these methods to post your comment:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: