Maths encyclopedia and lessons  
Search

Mathematics Encyclopedia and Lessons

 
     
 

Lessons

Popular
Subjects

algebra
arithmetic
calculus
equations
geometry
differential equations
trigonometry
number theory
probability theory
more
 

References

applied mathematics
mathematical games
mathematicians
more
 
 

Scale-invariant feature transform

Scale-invariant feature transform (or SIFT) is a computer vision algorithm for extracting distinctive features from images, to be used in algorithms for tasks like matching different views of an object or scene (e.g. for stereo vision) and Object recognition . The features are invariant to image scale, rotation, and partially invariant (i.e. robost) to changing viewpoints, and change in illumination. The name Scale-invariant feature transform was chosen, as the algorithm transforms image data into scale-invariant coordinates relative to local features.

The algorithm was devised by David Lowe at the University of British Columbia, who has a US-patent on it.

First, the original image is progressively Gaussian blurred with sigma in a band from 1 to 2 resulting in a series of Gaussian blurred images (cascade filtering ). Then, these images are substracted from their direct neighbors (by sigma) to produce a new series of images (with difference of Gaussian ).

The major steps in the computation of the image features are

  1. Scale-space extrema detection - each pixel in the images are compared to its 8 neighbors and the 9 pixels each (corresponding pixel+8 neighbors) of the other pictures in the series.
  2. keypoint localization - keypoints are chosen from the extrema in scale space.
  3. orientation assignment - for each keypoint, in a 16x16 window, histograms of gradient directions are computed (using bilinear interpolation).
  4. keypoint descriptor - representation in a 128-dimensional vector.

For the application of SIFT keypoints in matching and object recognition, Lowe was applying a nearest neighbor algorithm, followed by a Hough transform for object recognition (as described in Lowe, 2004).

SIFT is a fundamental part of the visual pattern recognition (ViPR) and visual simultaneous localization and mapping (vSLAM) algorithms developed by Evolution Robotics .

The feature representations found by SIFT are thought to be analogous to those of neurons in inferior temporal cortex , a region used for object recognition in primate vision.

SIFT is used by Aibo to locate its charging station.

see also

External links

References

  • Lowe, D. G., "distinctive image features from scale-invariant keypoints", International Journal of Computer Vision, 2004
01-04-2007 01:18:14
The contents of this article are licensed from Wikipedia.org
under the GNU Free Documentation License. How to see transparent copy