A Unified System For Object Detection, Texture Recognition, and Context Analysis Based on the Standard Model Feature Set

S. M. Bileschi and L. Wolf (Massachusetts Institute of Technology).

Recently, a neuroscience inspired set of visual features was introduced. It was shown that this representation facilitates better performance than state-of-the-art vision systems for object recognition in cluttered and unsegmented images. In this paper, we investigate the utility of these features in other common scene-understanding tasks. We show that this outstanding performance extends to shape-based object detection in the usual windowing framework, to amorphous object detection as a texture classification task, and finally to context understanding These tasks are performed on a large set of images which were collected as a benchmark for the problem of scene understanding. The final system is able to reliably identify cars, pedestrians, bicycles, sky, road, buildings and trees in a diverse set of images.