This is the homepage for the Mia Crawler

Course Project

The project of implementing the Mia Search Engine is part of the course Algorithms for Web Indexing and Searching, Fall 2002 at the Department of Computer Science at the University of Aarhus.

Crawler implemtentation

The Mia Crawler is restricted to .dk domains. The crawler will read robots.txt and hopefully respect The Robots Exclusion Protocol.The crawler however does not respect The Robots META tag. The crawler implements politeness by not revisiting the same domain within 5 seconds.

Running search engine

A running copy of the Mia search engine is now available at http://mia.lir.dk/.

Please note that the "local copy" feature is only available from within the daimi network, due to firewalling.

Contact

You can contact the Mia Crawler Project Group by mail: thal+mia@daimi.au.dk