Outline of object recognition explained

Object recognition  - technology in the field of computer vision for finding and identifying objects in an image or video sequence. Humans recognize a multitude of objects in images with little effort, despite the fact that the image of the objects may vary somewhat in different view points, in many different sizes and scales or even when they are translated or rotated. Objects can even be recognized when they are partially obstructed from view. This task is still a challenge for computer vision systems. Many approaches to the task have been implemented over multiple decades.

Approaches based on CAD-like object models

Recognition by parts

Appearance-based methods

Edge matching

Divide-and-Conquer search

Greyscale matching

Gradient matching

Histograms of receptive field responses

Large modelbases

Feature-based methods

Interpretation trees

Hypothesize and test

(1 – Wc)k = Z

Pose consistency

§ For example, in cases where, if the object was at that pose, the object frame group would be invisible.

Scale-invariant feature transform (SIFT)

Speeded Up Robust Features (SURF)

Bag of words representations

See also: Bag of words model in computer vision.

Genetic algorithm

Genetic algorithms can operate without prior knowledge of a given dataset and can develop recognition procedures without human intervention. A recent project achieved 100 percent accuracy on the benchmark motorbike, face, airplane and car image datasets from Caltech and 99.4 percent accuracy on fish species image datasets.[9] [10]

Other approaches

Applications

Object recognition methods has the following applications:

Surveys

See also

Lists

References

Notes and References

  1. Rahesh Mohan . Rakamant Nevatia . amp . Perceptual organization for scene segmentation and description. 1992. IEEE Trans Pattern Anal Mach Intell.
  2. Swain. Michael J.. Ballard. Dana H.. 1991-11-01. Color indexing. International Journal of Computer Vision. en. 7. 1. 11–32. 10.1007/BF00130487. 8167136 . 1573-1405.
  3. Schiele. Bernt. Crowley. James L.. 2000-01-01. Recognition without Correspondence using Multidimensional Receptive Field Histograms. International Journal of Computer Vision. en. 36. 1. 31–50. 10.1023/A:1008120406972. 2551159 . 1573-1405.
  4. ftp://ftp1.nada.kth.se/pub/documents/CVAP/reports/LinLin04-ICPR.pdf O. Linde and T. Lindeberg "Object recognition using composed receptive field histograms of higher dimensionality", Proc. International Conference on Pattern Recognition (ICPR'04), Cambridge, U.K. II:1-6, 2004.
  5. Composed Complex-Cue Histograms: An Investigation of the Information Content in Receptive Field Based Image Descriptors for Object Recognition. Computer Vision and Image Understanding . 116. 4 . 538–560. 2012 . O. Linde . T. Lindeberg. 10.1016/j.cviu.2011.12.003 .
  6. http://citeseer.ist.psu.edu/lowe04distinctive.html Lowe, D. G., "Distinctive image features from scale-invariant keypoints", International Journal of Computer Vision, 60, 2, pp. 91-110, 2004.
  7. Lindeberg, Tony. 2012. Scale invariant feature transform. Scholarpedia. 7. 5. 10491. 10.4249/scholarpedia.10491. 2012SchpJ...710491L. free.
  8. 10.1016/j.cviu.2007.09.014 . 110 . 3 . Speeded-Up Robust Features (SURF) . 2008 . Computer Vision and Image Understanding . 346–359 . Bay . Herbert . Ess . Andreas . Tuytelaars . Tinne . Van Gool . Luc. 10.1.1.205.738 . 14777911 .
  9. Web site: New object recognition algorithm learns on the fly . 20 January 2014 . Gizmag.com . 2014-01-21.
  10. Lillywhite . K. . Lee . D. J. . Tippetts . B. . Archibald . J. . A feature construction method for general object recognition . 10.1016/j.patcog.2013.06.002 . Pattern Recognition . 46 . 12 . 3300 . 2013 . 2013PatRe..46.3300L .
  11. Brown, Matthew, and David G. Lowe. "Unsupervised 3D object recognition and reconstruction in unordered datasets." 3-D Digital Imaging and Modeling, 2005. 3DIM 2005. Fifth International Conference on. IEEE, 2005.
  12. Oliva, Aude, and Antonio Torralba. "The role of context in object recognition." Trends in cognitive sciences 11.12 (2007): 520-527.
  13. Stein, Fridtjof, and Gérard Medioni. "Structural indexing: Efficient 3-D object recognition." IEEE Transactions on Pattern Analysis & Machine Intelligence 2 (1992): 125-145.
  14. Zhu, Song-Chun, and David Mumford. "A stochastic grammar of images." Foundations and Trends in Computer Graphics and Vision 2.4 (2007): 259-362.
  15. Nayar, Shree K., and Ruud M. Bolle. "Reflectance based object recognition." International journal of computer vision 17.3 (1996): 219-240.
  16. Worthington, Philip L., and Edwin R. Hancock. "Object recognition using shape-from-shading." IEEE Transactions on Pattern Analysis and Machine Intelligence 23.5 (2001): 535-542.
  17. Shotton, Jamie, et al. "Textonboost for image understanding: Multi-class object recognition and segmentation by jointly modeling texture, layout, and context." International journal of computer vision 81.1 (2009): 2-23.
  18. Niu, Zhenxing, et al. "Context aware topic model for scene recognition." 2012 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2012.
  19. Web site: Better robot vision . KurzweilAI . 2013-10-09.
  20. Donahue, Jeffrey, et al. "Long-term recurrent convolutional networks for visual recognition and description." Proceedings of the IEEE conference on computer vision and pattern recognition. 2015.
  21. Karpathy, Andrej, and Li Fei-Fei. "Deep visual-semantic alignments for generating image descriptions." Proceedings of the IEEE conference on computer vision and pattern recognition. 2015.
  22. P Duygulu . K Barnard . N de Fretias . D Forsyth . amp . Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary . Proceedings of the European Conference on Computer Vision . 2002 . 97–112 . dead . https://web.archive.org/web/20050305174408/http://vision.cs.arizona.edu/kobus/research/publications/ECCV-02-1/ . 2005-03-05 .
  23. Web site: Android Eyes Computer Vision. Martha J. Farah "Visual Agnosia", Computer Vision Computing Cognitive Neuroscience, MIT Press, 2011-05-01, Pages 760-781, ISSN 1468-4233 https://play.google.com/store/apps/details?id=com.davecote.androideyes
  24. Esteva, Andre, et al. "Dermatologist-level classification of skin cancer with deep neural networks." Nature 542.7639 (2017): 115.
  25. Brown, M., and Lowe, D.G., "Recognising Panoramas," ICCV, p. 1218, Ninth IEEE International Conference on Computer Vision (ICCV'03) - Volume 2, Nice,France, 2003
  26. Li, L., Guo, B., and Shao, K., "Geometrically robust image watermarking using scale-invariant feature transform and Zernike moments," Chinese Optics Letters, Volume 5, Issue 6, pp. 332-335, 2007.
  27. Se,S., Lowe, D.G., and Little, J.J.,"Vision-based global localization and mapping for mobile robots", IEEE Transactions on Robotics, 21, 3 (2005), pp. 364-375.
  28. Thomas Serre, Maximillian Riesenhuber, Jennifer Louie, Tomaso Poggio, "On the Role of Object-Specific features for Real World Object Recognition in Biological Vision." Artificial Intelligence Lab, and Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Center for Biological and Computational Learning, Mc Govern Institute for Brain Research, Cambridge, MA, USA
  29. Permaloff. Anne. Grafton. Carl. 1992. Optical Character Recognition. PS: Political Science and Politics. 25. 3. 523–531. 10.2307/419444. 419444 . 64806776 . 1049-0965.
  30. Christian Demant, Bernd Streicher-Abel, Peter Waszkewitz, "Industrial image processing: visual quality control in manufacturing"
  31. Nuno Vasconcelos "Image Indexing with Mixture Hierarchies" Compaq Computer Corporation, Proc. IEEE Conference in Computer Vision and Pattern Recognition, Kauai, Hawaii, 2001
  32. 10.1016/j.imavis.2003.09.010 . 22 . 7 . A real-time system for monitoring of cyclists and pedestrians . 2004 . Image and Vision Computing . 563–570 . Heikkilä . Janne . Silvén . Olli.
  33. Book: Jung. Ho Gi. Kim. Dong Suk. Yoon. Pal Joo. Kim. Jaihie. 2006. Yeung. Dit-Yan. Kwok. James T.. Fred. Ana. Roli. Fabio. de Ridder. Dick. Structure Analysis Based Parking Slot Marking Recognition for Semi-automatic Parking System. Structural, Syntactic, and Statistical Pattern Recognition. Lecture Notes in Computer Science . 4109 . en. Berlin, Heidelberg. Springer. 384–393. 10.1007/11815921_42. 978-3-540-37241-7. free.
  34. S. K. Nayar, H. Murase, and S.A. Nene, "Learning, Positioning, and tracking Visual appearance" Proc. Of IEEE Intl. Conf. on Robotics and Automation, San Diego, May 1994
  35. Liu . F. . Gleicher . M. . Jin . H. . Agarwala . A. . Content-preserving warps for 3D video stabilization . 10.1145/1531326.1531350 . ACM Transactions on Graphics . 28 . 3 . 1 . 2009 . 10.1.1.678.3088 .