Clustering exercise

Read the problem description.


All the data here (except the GO categories) has been fetched from Ulitsky et al.. However, the data files are not the original ones but a subsample that has been processed to a form that is easy to read in Matlab. These files do not even have the names of the genes of the GO classes, so doing any biology with these is impossible. I will later add versions that could be used in the project work.

Use "save as" from the browser to save these to your own folder.

Matlab-related stuff


You can take a look at run_cluster_solution.m if you couldn't finish the exercise. It is by no means a comprehensive treatment of the problem, but has the commands for running the clusterings as well as an attempt to some external validation by GO classes.

