Computer Vision and Image Processing – Spring 2021
- Instructor: Professor Zhigang Zhu
- The CUNY Graduate Center and City College
- Course Codes: GC 74030-1 (63185) and CNS 80300-1 (59866) Computer Vision and Image Processing
- Time and Location: Wednesday 9:30 – 11:30 am, The CUNY Graduate Center (Room TBD)
- Office Hours: TBD
Course Update Information
- 02/03/2021. The first day of this class.
This course will cover the fundamental work on color, shapes, stereo and visual motion, which has dealt with the problems of image understanding, 3D reconstruction from multiple images, and structure from motion with video sequences. In addition to these traditional problems, we will also showcase a few examples of machine learning approaches that have been successful in computer vision tasks, such as facial computing and crowd analysis using CNNs, DBNs, LSTMs and GANs . Moreover, the best successful vision system that computer vision researchers can learn from is the human vision system. Therefore this course will briefly discuss human vision science and explore how the brain sees the world too.
Course Syllabus and Tentative Schedule (mm/dd)
Part I. Introduction and Human Vision
- I-1. Human and Machine Vision (slides); Human Eyes (slides) – 02/03
- I-2. Visual Brain (slides) -02/10
- I-3. Depth (slides) -02/17 (Assignment 1 on I-3 and I-4)
- I-4. Color (slides) -02/17
Part II. 3D Computer Vision
- II-1. Camera Models (slides) (lecture notes) (Assignment 2 on II-1 and II-2) -02/24
- II-2. Camera Calibration (slides) (lecture notes) -03/03
- II-3. Stereo Vision (slides) (lecture notes ) (Assignment 3 on II-3 and II-4), Project Discussion – 03/10, 03/17
- II-4. Visual Motion – (slides) (lecture notes); Exam Quick Review – 03/24, 04/07 (Spring Recess: 03/27-04/04)
Part III. Advanced Topics and Project Presentations
- III-1. Deep neural networks I: (1) From Linear Regression to CNN, RNN and LSTM; (2) Facial Computing 04/14
- Mid-Term Exam in Class – 04/21
- III-2. Deep neural networks II: Multimodal classification with SVM, DBN and CNN – 04/28
- III-3. Deep neural networks III: GANs, Regression, and Crowd Analysis – 05/05
- III-4. All Student Project Presentations – 05/12; Project Reports
Textbook and References
- Computer Vision, In the form of Lecture Notes and Slides; will be provided by the instructor
- Vision and Brain – How We Perceive the World, By James V. Stone, The MIT Press. Paperback | $30.00 | ISBN: 9780262517737 | 264 pp. | 6 x 9 in | 25 color illus., 132 b&w illus.| September 2012 (For students with little experience in vision and neuroscience to know human vision, brain and computational neuroscience)
Online References and additional readings when necessary.
Grading and Prerequisites
The course will accommodate both PhD students in Computer Science and master level graduate students in Data Science and Cognitive Neuroscience at the CUNY Graduate Center. Students who take the course for credits will be required to finish 3 assignments (15% for each), one midterm exam (30%), and one programming project (25%, including submit a report (10%) and give a presentation (15%) to the class at the end of the semester). The topics of the projects will be given in the middle of the semester and will be related to the material presented in the lectures.
Students are required to have a good preparation in both mathematics (linear algebra/numerical analysis) and advanced programming.