Practical Research Projects - Project Description

Project Description

This is a description of a Practical Research Project associated with the PREP course

Project Title

Survival rate of HTML elements

Quarter

Q2

Responsible

Jakob G. Thomsen.

Second level advisor: Erik Ernst.

Aims

Most web pages are not constant in that the layout, the structure or the content often changes over time. These changes have the effect that automatically retrieving data from web pages are not at all trivial, as it often breaks down, when the web page changes. The aim of this prep project is therefore to better understand these changes, by quantifying the change rate of elements of a given web page.

The expected outcome is an experiment, that will be able for a given set of webpage versions to quantify the survival rate of the HTML tags on the webpage.

Learning Outcome

The student will gain insight into the process of research by setting up his/her own experiment that will test the given thesis. Furthermore the student will learn to read and understand technical scientific papers as a part of the project is to read how other researchers have investigated the area.

Requirements

dWebTek, AWT or SWP/CWP (recommended)