Detecting Spam Web Pages Through Content Analysis
Click here to download now
Overview: This paper from International World Wide Web Conference Committee describes investigations of ?Web Spam? the injection of artificially-created pages into the web in order to influence the results from search engines, to drive traffic to certain pages for fun or profit. This paper considers some previously-undescribed techniques for automatically detecting spam pages, examines the effectiveness of these techniques in isolation and when aggregated using classification algorithms.