A Model-Based Technique for the Classification of Textured Surfaces with Illuminant Direction Invariance

G. McGunningle & M.J. Chantler

Dept. of Computing and Electrical Engineering

Heriot-Watt University

Edinburgh

gmg@cee.hw.ac.uk

In this paper, the authors present a novel scheme for the reduction of texture misclassification due to variation of the illuminant direction between the training and classification stages. The approach is model-based, using photometric techniques to form a surface description. This description is rendered under the illumination conditions at which classification will take place to simulate training data which is appropriate to the classification task. This paper uses a feature set of twelve Gabor filters and the similarity of the simulated and actual features will be assessed. The classification accuracy of the model-based technique is assessed against that of a classifier trained under only one illumination condition and another classifier retrained for each illumination condition.

  1. Introduction
  2. Research into the perception of texture is fundamental to the understanding of biological visual systems and the development of general machine vision systems. However, even in the much more constrained area of inspection there are a wide range of tasks which require textures to be identified and categorised. This paper will use the exemplar of a hypothetical texture inspection task, where a textured surface will be imaged by a camera whose axis is normal to the texture plane. The textures present in the image will be classified as belonging to an a priori known class.

    Perceived textures may be due to a projected illumination pattern, surface markings, the interaction of illumination with a rough surface or a combination of all three. This work is primarily concerned with the automated classification of textures belonging to the third group. While this type of texture is formed by the interaction of illumination and surface, most applications require the classification of surface type, regardless of illumination conditions. Consequently, work has been carried out on texture classification which is invariant to changes in the illuminant intensity [Thau90] and colour [Healey95]. However, the effect of variations in illuminant direction has been less frequently considered [Chantler94].

    The direction of the illuminant with respect to the texture may defined by two polar co-ordinates: slant and tilt. Slant is the angle between the camera axis and the illuminant vector, in this work, it shall be held constant at 50° . Tilt refers to the polar angle of the illuminant on a plane normal to the camera axis. The aim of this work is to maintain consistently accurate classification of textures under changes in the illuminant tilt angle.

  3. The Model-Based Approach
  4. It has been shown that changes in illuminant tilt can induce classifier failure[Chantler95a]. For isotropic surfaces a change in illuminant tilt is effectively an instance of rotation, with the texture properties being preserved. However, for directional surfaces, the information content of the image will be altered by a change in the illuminant direction. While Chantler [Chantler95b] used a frequency domain technique, based on Kube's model [Kube89], to reduce the effect of tilt, non-linearities and artefacts in both the reflectance function and the imaging process present limitations to the practical application of this technique. In this section we propose a simulation-based technique which is defined in the spatial domain and circumvents many of the constraints of the frequency domain.

    The simulation-based approach is designed to anticipate the feature space distributions by modelling the underlying physical and analytical processes of imaging and feature extraction (Figure 2.1). This technique forms a spatial model of the training surfaces in the recovery stage. The classification process proper, begins with secondary training, where the synthetic surface is rendered under the experimental conditions, and the resulting image forms the basis for training. If the model’s components, the reflectance function and the surface description, are sufficiently accurate we should be able to obtain classification rates approaching those of the ‘best case’ classification, i.e. classification based on training surfaces illuminated at the appropriate tilt angle.






























    Figure .1 Model Based Classification

    The model consists of two components, a reflectance function, which we consider to be Lambertian, and a surface description. In order to obtain this description, we must adopt a method of recovery. Several cues to surface recovery have been investigated: focus [Noguchi94], binocular stereo [Papadimitriou95], and laser based approaches [Gross95] have been used. Let us assume that we will be able to invest more effort and exert more control in the recovery stage than in the classification stages. Ideally, we seek a technique which requires no additional hardware beyond that required for the classification. This immediately suggests the field of shape from shading (SFS).

    The single image SFS problem is ill-posed and requires additional constraints, the smoothness constraint being almost universal. This clearly prevents the vast majority of single image algorithms being applied to the rough surfaces considered in this paper. Two algorithms which do not use this constraint are Knill's adaptive technique [Knill90] and Pentland's frequency-based technique [Pentland90]. Knill's technique uses prior knowledge of the surface and is therefore unsuitable for classification tasks, while Pentland's scheme is subject to the same problems of non-linearity as Chantler's tilt compensation scheme. The authors note that, if a single image scheme did form a consistent estimate of the surface, it would be possible to classify on the basis of the surface, rather than on the image, removing the need to resort to a model-based technique.

    The technique of photometric stereo allows the formation of a surface description from several images of the same surface imaged under various illumination directions. The technique is entirely localised and does not require a smoothness constraint. It therefore seems ideally suited to our purposes and is intuitively satisfying since our problem is itself caused by an illuminant of variable direction. In fact the concept of recovering a surface for subsequent image prediction is not original, Russell used photometric techniques to acquire depth maps which could then be synthetically illuminated to simulate aerial images [Russell91], though the authors are unaware of any application of this approach to texture classification.

    Since it was first advocated by Woodham [Woodham80], photometric stereo has been further developed to deal with specularities [Coleman82], accommodate more complex reflectance maps [Tagare91], use fewer images [Onn90], recover shape and the reflectance function simultaneously [Tagare90], estimate the degree of subpixel roughness [Solomon96] and has been implemented in various fashions [Woodham94], [Iwahori95]. In this paper, where the accuracy of surface recovery is less important than that of image prediction, and where the surfaces are near-Lambertian, we adopt the following simple and sub-optimal scheme.

    Consider a facet with derivatives p,q, illuminated from a tilt of t and a slant of s . If we assume the reflectance function to be Lambertian, the perceived intensity will be given by

    Equation 1

    Now, if we obtain three images with s held constant and t =0° ,90° and 180° , we may proceed as follows:

    Equation 2

    Now, dividing the t =0° and 90° facet images by (2) we can remove any albedo variation, and the non-linear denominator to form expressions for the surface derivatives.

    The authors accept that the surface recovery performance, and generality of this technique, is inferior to any modern photometric technique. However, in the context of this application it offers an adequate prediction of the correct training image at the appropriate tilt angle for a minimum of implementation complexity, memory requirements and computational effort.

  5. Classification
    1. The Classification Task and Approach
    2. We have created two montages: Anaglypta and Rock (Figure 3.1), as classification tasks for the classifier. Anaglypta is composed of highly directional and consistent surfaces and represents a relatively easy classification task, given appropriate training data. The Rock montage consists of more natural surfaces, only one of which is directional, these surfaces all have spectra of a similar form, and lack characteristic spectral peaks, this montage consequently is more difficult to classify. Both montages are torroidal.

      Figure .1 The Anaglypta and Rock texture montages, illuminated from t =50° , s =50°

      The classifier we adopt is based on a set of 12 Gabor filters following the definition of Jain [Jain91], followed by a non-linearity and averaging, classified using a quadratic discriminant and post-processed using a mode filter. The filters have frequencies of 32, 64 and 128 cycles per image, are oriented at 0,45,90 and 135° with an angular bandwidth of 45° . The filter outputs are squared and then averaged using a Guassian low pass filter. Finally the square root of the images is taken, effectively acting as a power transform, to force the data into a more Gaussian form before being passed to a statistical classifier.

       

    3. Comparison with a Normal Classifier.
    4. The complete classifiers are now applied to both test montages. The classifiers use the same feature images and differ only in the training data. The normal classifier uses training data obtained at t =0° while the model-based technique uses simulated training data.

       

      Figure .2 Accuracy of classification of anaglypta textures.

       

      Figure .3 Classification accuracy of Rock montage.

      In both montages the model-based technique clearly offers a dramatic improvement, in both the quality and stability of tilt variant classification, over the normal classifier. It is interesting to compare the rapidity of classification failure of the normal classifier for the Anaglypta montage, with the more gradual failure associated with the Rock montage. We suggest this may be a consequence of the extremely directional nature of the anaglypta textures. In both cases, the model-based scheme has arrested the catastrophic degradation in classification accuracy associated with changes in illuminant tilt.

      Classified Anaglypta Montage Classified Rock Montage

      Figure .4 Classification montages of Anaglypta and Rock at Tau 50°

    5. Comparison with the 'Best Case' Classifier.

    The simulation-based scheme seeks to reduce classification error by simulating the input image at a given tilt value. It is therefore natural that the system's classification accuracy should be compared with the accuracy of a classifier which has been retrained at each orientation on the actual data, and which consequently forms the ‘best case’ case classification for a given texture set and feature set.

    Figure .5 Comparison of Model Based and Best Case Classifiers for Anaglypta Montage

    Figure .6 Comparison of Model Based and Best Case Classifiers for Rock Montage

    With the exception of the two aberrations in the anaglypta classification at the extremes of the tilt range, all the simulation-based classifications approach the accuracy of the best case classification. The anomaly in our results for the anaglypta montage at Tau 0° and 180° , is particularly puzzling since these images are two of the recovery images and we would expect any artefacts in this frame to be transmitted throughout the tilt range. The high misclassification rates in the model-based scheme coincides with an increase in the best-case classification, suggesting that the technique is more sensitive to the difficulty of classification than a classifier based on real data. However, overall, the results show that in all the other cases the technique has successfully stabilised the misclassification rate at a low level which approaches the best rate possible with the feature set used.

     

  6. Discussion
  7.  

    While the experimental results of the simulation-based scheme are extremely promising, there are some points which require discussion.

    · Shadowing. Shadowing in the recovery images presents a problem for photometric techniques in general, however, given the formulaic nature of the approach used here it is a particular problem with this scheme. We note that several researchers such as [Solomon96] use schemes which use more images to reduce the impact of shadowing on surface estimation. Shadowing also presents a problem at the rendering stage. Since the surface description is expressed in terms of surface derivatives, rather than absolute height, it is not possible to incorporate shadowing into the rendering algorithm.

    · Reflectance assumptions. In the experimental work carried out here, all surfaces were of approximately Lambertian reflectance and uniform albedo. Suspension of the former condition will require more complex surface recovery and rendering algorithms, but will not of itself affect the validity of the simulation/training concept. The uniformity condition on the other hand is more serious: although the albedo variation may be isolated from the training data, it will be difficult to generalise the demodulation process to the classification data.

    · Rotation-Invariance. This is probably the most fundamental and intractable limitation of the model-based scheme. The model-based system as discussed in this chapter is, almost by definition, unable to deal with the rotation of directional surfaces. However, simulation-based techniques may still have a significant role as a means of economically generating large volumes training data for the design of classifiers which are invariant both to the rotation of textured surfaces and the associated variation in relative illuminant tilt.

    · Slant-variation. This paper has only considered the performance of the scheme for variations in illuminant tilt with the slant angle constant for recovery and classification images. It could therefore be argued that the scheme can only be described as an interpolation scheme since the generalisation properties of the scheme have not been assessed. If the recovery and rendering algorithms are sufficiently accurate, the scheme should be able to deal with slant variation and will form an extremely powerful support tool for texture analysis. The effect of slant variation will also have a significant effect on shape from texture techniques which are applied to rough surfaces, however, it is worth noting that the increased degree of shadowing associated with this task may require recovery of the actual height map, rather than just the surface derivatives. The required accuracy for height recovery is beyond the capabilities of the recovery technique discussed here.

     

  8. Conclusions
  9. The first conclusion of this paper is that even the simple shape recovery system described here is able to form the basis of feature prediction to a good degree of accuracy for the textures considered in this paper. This not withstanding, we do express reservations as to the accuracy of derivative recovery. While inaccuracies at the observed levels do not appear to seriously affect image simulation, it is the belief of the authors that the accuracy of estimated derivatives is inadequate for the process of surface height recovery where errors are cumulative. However, the level of accuracy obtained is adequate for the purposes of training a classifier to discriminate between the members of our data sets. More sophisticated models are available and are an area of continuing research within the machine vision community; application of these techniques will allow the relaxation Lambertian requirement and more reliable surface recovery.

    The second conclusion is that a simulation-based system forms an effective approach to maintaining consistently good classification regardless of illuminant tilt. The misclassification rate is consistent and in most cases approaches the ‘best case’ level. The authors conclude that the concept of simulating training data does represent a novel and powerful tool for the development of systems robust to illuminant changes for relatively little extra effort at the training stage.

     

  10. References

Chantler94 M.Chantler, "The effect of variation in illuminant direction on texture classification", PhD Thesis, Heriot- Watt University, August 1994.

Chantler95a M. Chantler, "Why illuminant direction is fundamental to texture analysis." IEE Proceedings in Vision, Image and Signal Processing, Vol.142, No.4 August 1995

Chantler95b M. Chantler, G. McGunnigle, "Compensation of Illuminant Tilt Variation for Texture Classification", IEE Conference on Image Processing And Its Applications, 4-6 July 1995

Coleman82 E. Coleman & R. Jain, "Obtaining 3-Dimensional Shape of Textured and Specular Surfaces Using Four-Source Photometry", Computer Graphics and Image Processing Vol.18 1982, pp.309-328

Gross95 M. Gross & R. Koch, "Visualization of Multidimensional Shape and Texture Features in Laser Range Data Using Complex-Valued Gabor-Wavelets", IEEE Transactions on Visualization and Computer Graphics, 1995, Vol.1 No.1 pp. 44-59.

Healey95 G. Healey, L. Wang, "Illuminant-Invariant Recognition of Texture in Color Images", Journal of the Optical Society of America, 1995, Vol.12, No.9, pp.1877-1883.

Iwahori95 Y. Iwahori, R. Woodham & A. Bagheri, "Principal Components Analysis and Neural Network Implementation of Photometric Stereo" 1995 IEEE pp117-125

Jain91 A. Jain & F. Farrokhnia, "Unsupervised texture segmentation using Gabor filters", Pattern Recognition, 1991, 24, pp1167-1186

Kube88 P. Kube, & A. Pentland, "On the Imaging of Fractal Surfaces", IEEE Transactions Pattern Analysis and Machine Intelligence, Vol.10, No.5, pp704-707, Sept. 1988

Knill90 D. Knill & D. Kersten, "Learning a Near-Optimal Estimator for Surface Shape from Shading", Computer Vision, Graphics and Image Processing, Vol.50, 1990 pp75-100

Papadimitriou95 D. Papadimitriou & T. Dennis, "A stereo disparity algorithm for 3D model construction", IEE Conference on Image Processing And Its Applications, 4-6 July 1995

Pentland90 A. Pentland, "Linear Shape from Shading", International Journal of Computer Vision, Vol.4, 1990 pp153-162

Russell91 K. Russell, J. McFee & M. Ito, "Simulation of images by photometric stereo modelling" Optical Engineering, Vol.30 No.9, September 1991

Solomon96 F. Solomon, K. Ikeuchi, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.18, No.4, April 1996.

Tagare90 H. Tagare, R. deFigueiredo, "Simultaneous Estimation of Shape and Reflectance Maps from Photometric Stereo" International Conference on Computer Vision. 1990 pp340-343

Tagare91 H. Tagare, R. deFigueiredo, "A Theory of Photometric Stereo for a Class of Diffuse Non-Lambertian Surfaces" IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.13, No.2, February 1991.

Thau90 R.S. Thau, "Illuminant Precompensation for Texture Discrimination using Filters", Image Understanding Workshop (DARPA) Pittsburg PA, Sept.1990 pp 179-184 Publ. Morgan Kaufmann ISBN 1-55860- 140-6

Woodham80 R. Woodham, "Photometric method for determining surface orientation from multiple images", Optical Engineering Jan./Feb. 1980, Vol.19, No.1

Woodham94 R.Woodham, "Gradient and curvature from photometric-stereo method, including local confidence estimation" Journal of Optical Society of America" Vol.11 No.11 November 1994,pp3050-3068