ICA applied to feature extraction from colour and stereo images

Previous work has shown that independent component features from natural image data yields features resembling Gabor functions and V1 simple-cell receptive fields. (See our introductory page.) This was first shown for static monochrome images (Olshausen and Field, 1996; Bell and Sejnowski, 1997), and subsequently it has been shown that the features learned from dynamic image data (video) also strongly resemble simple-cell receptive fields (van Hateren and Ruderman, 1998). The goal of this project was to extend the analysis to consider the effect of chromatic and stereo information.

Independent component features from colour images

As data, we used a set of natural images in full colour. Three images from this dataset are shown below:

As with the standard ICA on image data approach, we model our data (small image patches from the images) by a linear model, and estimate the basis vectors that give sparse, independent stochastic coefficients. The only difference is that we have three channels (RGB) instead of one (brightness):

_{=
s1+
s2+
... + sk}

Of course, these can still be displayed normally by superimposing the channels, and that is how we display them subsequently.

Having sampled a large number of image patches from our natural scene data, we estimate the linear ICA model given above, and visualize the basis vectors:

Examining the basis closely reveals that the features found are very similar to earlier results on monochrome image data, i.e. the basis patches resemble Gabor functions. The decomposition clearly separates 3 different channels: red-green, blue-yellow, and monochrome features.

How do these features compare with V1 receptive fields, in terms of colour coding? This is a difficult question, as there are quite conflicting results on the chromatic dimension of simple-cell receptive fields. However, the ICA representation does seem to be in agreement with many physiological findings. See our paper for some discussion on this issue.

Independent component features from stereo images

To see if the ICA model can account for binocular properties of simple-cells in V1, we estimated the model from a set of stereo images of natural scenes. One such image is shown below:

The left image should be seen with the left eye, and the right image with the right eye (i.e. uncrossed viewing). Note the subtle but important differences in the images due to the three-dimensional viewing geometry.

We simulate fixations (by finding matching points), and then sample corresponding image patches in the two images. This gives a distribution of disparities (centred on zero) in the data. We then model this data by the linear ICA model (here, the top patch is the patch from the left-eye image, and the bottom patch is the corresponding one from the right-eye image):

_{=
s1+
s2+
... + sk}

Estimating the model leads to the following kinds of features:

Each pair of patches (horizontal neighbours) corresponds to one basis vector. It is readily seen that the features exhibit varying degrees of 'ocular dominance': some basis vectors code for features present in one eye only, whereas others code for features present in both views. Also, our features have inter-ocularly matched preferred spatial frequency and orientation. This is quite similar to the representation in the visual cortex. Finally, our features show some selectivity to disparity as well: some represent zero disparities, some positive disparities, and some negative ones.

For more details, see our paper:

P.O. Hoyer and A. Hyvärinen. Independent Component Analysis Applied to Feature Extraction from Colour and Stereo Images. Network: Computation in Neural Systems, 11(3):191-210, 2000.
Postscript gzipped PostScript

Patrik Hoyer & Aapo Hyvarinen
December 2001