Research » Web Spam Detection » Datasets » UK-2007 » Credits

WEBSPAM-UK2007 Credits

Assessments

A group of volunteers contributed their time and work during the assessment phase, labeling hundreds of hosts each:

Organization

This task was organized by:

UK crawl data

The base data is a set of 105,896,555 pages in 114,529 hosts in the .UK domain. The data was downloaded in May 2007 by the Laboratory of Web Algorithmics, Università degli Studi di Milano, with the support of the DELIS EU - FET research project.

For inquiries contact Carlos Castillo