Sub-linear Indexing for Large Scale Object Recognition


Obdržálek, Štěpán, Matas, Jiří
BMVC 2005, London, UK
BMVC 2005: Proceedings of the 16th British Machine Vision Conference
Volume 1, Pages 1-10
September, 2005

Abstract

Realistic approaches to large scale object recognition, i.e. for detection and localisation of hundreds or more objects, must support sub-linear time indexing. In the paper, we propose a method capable of recognising one of N objects in log(N) time. The visual memory. is organised as a binary decision tree that is built to minimise average time to decision. Leaves of the tree represent a few local image areas, and each non-terminal node is associated with a .weak classifier.. In the recognition phase, a single invariant measurement decides in which subtree a corresponding image area is sought. The method preserves all the strengths of local affine region methods . robustness to background clutter, occlusion, and large changes of viewpoints. Experimentally we show that it supports near real-time recognition of hundreds of objects with state-of-the-art recognition rates. After the test image is processed (in a second on a current PCs), the recognition via indexing into the visual memory requires milliseconds.

Keywords

object recognition, local affine frames, MSER, LAF


Bibtex entry

@InProceedings{obdrzalek-tree-bmvc05,
  author =      {Obdr{\vz}{\'a}lek, {\vS}t{\ve}p{\'a}n and  
                 Matas, Ji{\vr}{\'i}},
  title =       {Sub-linear Indexing for Large Scale Object Recognition},
  year =        {2005},
  pages =       {1--10},
  booktitle =   {BMVC 2005: Proceedings of the 16th British Machine Vision Conference},
  volume =      {1},
  editor =      {Clocksin, WF and Fitzgibbon, AW and Torr, PHS},
  isbn =        {1-901725-29-4},
  publisher =   {BMVA},
  address =     {London, UK},
  month =       {September},
  day =         {5--8},
  venue =       {Oxford, UK},
  annote = { Realistic approaches to large scale object recognition,
    i.e. for detection and localisation of hundreds or more objects,
    must support sub-linear time indexing. In the paper, we propose a
    method capable of recognising one of N objects in log(N) time. The
    visual memory.  is organised as a binary decision tree that is
    built to minimise average time to decision. Leaves of the tree
    represent a few local image areas, and each non-terminal node is
    associated with a .weak classifier.. In the recognition phase, a
    single invariant measurement decides in which subtree a
    corresponding image area is sought. The method preserves all the
    strengths of local affine region methods . robustness to
    background clutter, occlusion, and large changes of
    viewpoints. Experimentally we show that it supports near real-time
    recognition of hundreds of objects with state-of-the-art
    recognition rates. After the test image is processed (in a second
    on a current PCs), the recognition via indexing into the visual
    memory requires milliseconds. },
 keywords =    {object recognition, local affine frames, MSER, LAF},
}