Yahoo fighting spam Combating Web Spam with TrustRank

Web spam pages use various techniques to achieve higher-than-deserved rankings in a search engine's results. While human experts can identify spam, it is too expensive to manually evaluate a large number of pages

Link-based spam detection

A computer implemented method of ranking search hits in a search result
set. The computer-implemented method includes receiving a query from a
user and generating a list of hits related to the query, where each of
the hits has a relevance to the query, where the hits have one or more
boosting linked documents pointing to the hits, and where the boosting
linked documents affect the relevance of the hits to the query. The
method associates a metric to each of at least a subset of the hits, the
metric being representative of the number of boosting linked documents
that point to each of at least a subset of the hits and which
artificially inflate the relevance of the hits. The method then compares
the metric, which is representative of the size of a spam farm pointing
to the hit, with a threshold value, processes the list of hits to form a
modified list based in part on the comparison, and transmits the modified
list to the user.

