T-61.6030 Special Course in Computer and Information Science III P: Multimedia Retrieval

Project work

New: Added ground truth files for test set 2. Available here (4.4 MB).

The project work involves semantic image classification based on a set of precomputed low-level features. The selection of the used classification algorithm is free, but the provided examples apply support vector machines using the LIBSVM implementation. The deadline for the project work is May 19th, 2008.

Data

We use the image data from The PASCAL Visual Object Classes Challenge 2005, datasets 1 and 2. The data sets contain a total of 2329 images of motorbikes, bicycles, people, and cars in arbitrary pose. The set is divided into a training set of 684 images, and two test sets containing 689 and 956 images, respectively. You may download the original images from the challenge website for reference, although it is possible to do the project work entirely based on the provided low-level features.

The set of pre-computed low-level features are:

Name	Dim.	Description
MPEG-7 Color Layout	12	Spatial distribution of dominant colors in YCbCr color system.
MPEG-7 Scalable Color	256	256-bin color histogram in HSV color space, encoded by a Haar transform.
MPEG-7 Edge Histogram	80	Histogram of five different edge types.
IPLD histogram	256	Histogram of interest point features, detected using a combined Harris-Laplace and Difference-of-Gaussian detector.

The feature data can be found here in LIBSVM format (6.8 MB). The classwise ground truth for test set 2 is also available (4.4 MB). See the LIBSVM README file for a description of the format.

The file names of the images in correct order are given in the files trainset and testset1 for the training set and test set 1, respectively. The file testset2 contains the image labels given for the randomized test set 2.

For more details about the features, see the MPEG-7 overview and this ICCV 2003 paper by Gy. Dorkó and C. Schmid.

Experiments

The task is to train classifiers for the four concepts (motorbikes, bicycles, people, and cars) using the training set and then test the classifiers using the test sets. The examples shown in this section use the command-line version of LIBSVM, but you are free to use any other implementation or algorithm. In particular, there is a Matlab interface for LIBSVM available if you prefer to use Matlab. Also, you can use the included read_sparse function to read in LIBSVM format data files to Matlab and then use some other algorithm for the classification.

An example SVM classifier

In the following, we assume that you have downloaded (and compiled) LIBSVM, and have all the low-level feature files in a subdirectory voc2005 of the LIBSVM main directory. We assume here that you are using some variant of Unix/Linux that has Python and Gnuplot installed. Windows binaries for LIBSVM can be found in the windows subdirectory.

First, let us scale the training and test sets into the range [-1,1]:

$ cd voc2005 $ ../svm-scale -l -1 -u 1 -s EdgeHistogram-bicycles.train.range EdgeHistogram-bicycles.train > EdgeHistogram-bicycles.train.scale $ ../svm-scale -r EdgeHistogram-bicycles.train.range EdgeHistogram-bicycles.test1 > EdgeHistogram-bicycles.test1.scale

LIBSVM provides a tool for SVM parameter estimation using grid search and cross-validation:

$ ../tools/grid.py EdgeHistogram-bicycles.train.scale which outputs the best values it found for C and gamma. You may now focus the grid search to the neighborhood of the best values with options -log2c and -log2g.

Once we have the parameter values, we are ready to train the SVM classifier:

$ ../svm-train -c C -g gamma -b 1 EdgeHistogram-bicycles.train.scale EdgeHistogram-bicycles.train.model where C and gamma are the values found by grid.py. The option -b 1 is needed to get the probability estimates for the test images.

The classifier is tested using the images in the test set 1:

$ ../svm-predict -b 1 EdgeHistogram-bicycles.test1.scale EdgeHistogram-bicycles.train.model EdgeHistogram-bicycles.test1.output which prints out the overall accuracy and the individual probability estimates for the test images in EdgeHistogram-bicycles.test1.output.

Fundamentals

Train models for all four concepts using the given features. Report the results of test set 1 with the concept-wise and overall classification accuracy and average precision (see Chapter 13 in the course book). You may also use some other suitable information retrieval measures such as recall-precision curves or receiver operating characteristic (ROC) curves. For these measures, you need to look at the individual probability estimates in the .output files and sort the test set images accordingly.

For test set 2, return an ordered list of the images for each concept as separate text files, one image per line. The lists should contain the images that your method finds most probable for being relevant to the given concept, in decreasing order. The lists may or may not contain all 956 images, but the performance may suffer if you return only partial lists. We will then calculate the mean average precision for all submitted experiments and publish the results on the course web page. Submit the lists for only your best method or methods to Mats Sjöberg (mats.sjoberg (at) tkk.fi) before April, 21st, 2008.

Improvements

There are a number of ways you can extend your experiments:

The lack of positive examples. Adjust the classifier due to the greater number of negative examples, e.g. by removing some negative images from the training data or using the -w-1 option (see LIBSVM documentation).
Early fusion. Concatenate multiple features together before training the SVMs.
Late fusion. Combine results from multiple features together using a suitable function for the probability estimates or the retrieval ranks of the images.
Fault analysis. Look at the original images and try to analyse why certain images are classified incorrectly.
Four-class classification. Classify each test image into only the most probable class and compare the results.
Try other SVM kernels instead of the default RBF kernel.
Try other classification algorithms.
Something else.

Select and include at least two items from the list to your project.

Test set 2 results

The average precision results for submitted test set 2 runs are as follows:

id	Author	Brief comments	Bicycles	Motorbikes	People	Cars	Mean
0	Markus Koskela	Random permutation of `testset2`	0.2606	0.2201	0.3272	0.2118	0.2549
1	Mats Sjöberg	Best single feature according to test set 1	0.5119	0.4075	0.5709	0.5201	0.5026
2a	Sami Virpioja	Log-linear interpolation of 5 single-class SVMs	0.5452	0.4961	0.6529	0.5205	0.5537
2b	Sami Virpioja	Log-linear interpolation of 5 multi-class SVMs	0.5281	0.5149	0.6435	0.4507	0.5343
3a	Anne Ylinen	EH, IPLD and SC combined, one-class classification	0.4896	0.5012	0.6426	0.5362	0.5424
3b	Anne Ylinen	EH, IPLD and SC combined, 4-class classification	0.4341	0.5046	0.6248	0.4604	0.5060
4a	Lazlo Kozma		0.4966	0.5074	0.6620	0.5331	0.5498
4b	Lazlo Kozma		0.4769	0.4815	0.6283	0.4772	0.5160
5	Luis De Alba	Equal number of positive and negative samples, weighted average of features	0.3880	0.3478	0.5231	0.3503	0.4023
6	Pauli Ruonala	Results marked with ^a reversed	0.4040	0.2934^a	0.5648^a	0.3732^a	0.4089
7	Stevan Keraudy	IPLD feature, `-w1` optimization	0.5075	0.4200	0.6445	0.5195	0.5229
8a	Ville Turunen	Early fusion: For each concept, the best combination of features according to test set 1	0.5474	0.5068	0.6549	0.5092	0.5546
8b	Ville Turunen	Ensemble: Early fusion + punishing the probablity for classes that do not have the highest probability	0.5181	0.5090	0.6736	0.4545	0.5388
9a	Dusan Sovilj	OP-ELM, CL and EH features	0.4603	0.3619	0.5673	0.4687	0.4646
9b	Dusan Sovilj	OP-ELM with class priors, CL and EH features	0.4601	0.3608	0.5670	0.4775	0.4664

Interpolated precision-recall curves for bicycles, motorbikes, people, and cars. Gnuplot and data files for making these curves are also available.

Report

The results of the project work should be reported in a scientific conference style paper, which should be understandable as such, i.e. without these instructions, and complete with a title, abstract, introduction, methods, results and discussion sections, and references.

Send the final report in PDF format to Mats Sjöberg (mats.sjoberg (at) tkk.fi). The deadline for the project work is May 19th, 2008. The report can be written in English, Finnish, or Swedish.