This course will cover
essentials of computer vision. We will learn basic principles of image
formation, image processing algorithms and different algorithms for
3D reconstruction and recognition from single or multiple
images (video). Apllications to 3D modelling, video analysis,
video surveillance, object recognition and vision based control will
This course is of interest to anyone seeking to process images or camera information, or to acquire a general background in issues related to real-world perception, image processing, object and scene recognition and multi-view geometry
1. Representation of 3-D scenes : rigid body motion, euclidean, affine and projective transformations.
2. Image formation: geometric and photometric aspects of image formation process, binary, grey level and color images
3. Image features and Correspondence: geometric and photometric features, feature detection and matching, optical flow
4. Stereo - Two view geometry : camera pose and 3D structure recovery from two views, camera calibration, 3-D reconstruction
6. Image Matching and Tracking : matching of multiple views, tracking and video analysis
8. Grouping and Segmentation : detection and recovery of multiple motions
9. Detection and Recognition of 0bjects in Images: object representations and classification methods
10. Selected topics: vision based control, image based rendering pipeline, vision for human computer intraction, recognition
Grading: Homeworks (about every 2 weeks) 40% Midterm:
30% Final project: 30%
Prerequisites: linear algebra, calculus
Lecture Materials: Lecture slides, lecture notes provided by instructor
Computer Vision: Stockman and Shapiro, Prentice Hall.
Invitation to 3D Vision: From Images to Geometric Models: Y. Ma, S. Soatto, J. Kosecka and S. Sastry (for part I of the course)
Introductory Techniques for 3D computer Vision. E. Trucco and A. Verri, Prentice-Hall, 1998
Computer Vision: A Modern Approach: D. Forsythe and J. Ponce, Prentice-Hall, 2003
Matlab, OpenCV. Homeworks will require using Matlab and OpenCV. You can buy a student version in Johnson center or use it remotely from ITE labs. OpenCV is an C/C++ open source computer vision library.
Outcome: Students will obtain basic understanding of images formation process, processing of digital images and video and will gain familiarity with different algorithms for 3D reconstruction and recognition of objects in images. In this context of these computer vision problems, the students will use elemetary geometry, linear algebra, probabilistic inferenec and basic machine learning and pattern recognition techniques. Students will obtain capabilities for implementing learned algorithms in C/C++ and Matlab.