The project of implementing the Mia Search Engine is part of the course Algorithms for Web Indexing and Searching, Fall 2002 at the Department of Computer Science at the University of Aarhus.
The Mia Crawler is restricted to .dk domains. The crawler will read robots.txt and hopefully respect The Robots Exclusion Protocol.The crawler however does not respect The Robots META tag. The crawler implements politeness by not revisiting the same domain within 5 seconds.
A running copy of the Mia search engine is now available at http://mia.lir.dk/.
Please note that the "local copy" feature is only available from within the daimi network, due to firewalling.
You can contact the Mia Crawler Project Group by mail: thal+mia@daimi.au.dk