Computer Vision (Fall 2020)
Instructor: Professor Zhigang Zhu
The City College of New York
Course Update Information
- 08/20. CUNY faculty and students will be able to download a standalone Matlab version by creating an account using your CCNY email account at this website. On this website, go to “Matlab Portal” to create your account and download the software installer. You will need the account information to install and use the software too. It might be very slow when you downloading the real software & tools in installation. So please do not add new tools to the default but instead remove simulink etc. that you won’t use.
- 08/31: First day of our class. Here is the Zoom Link. The Passcode will be sent before class via CUNYFirst to the emails that students used there.
- 10/06/2020. Grading for Assignment 1.
- 10/26/2020. Grading for Assignments 1-2.
- 11/12/2020. Grading for Assignments 1-3.
- 11/24/2020. Grading for Assignments 1-3 (so far) and Exam. We will discuss exam answers and project presentations next Monday. Happy Thanksgiving!
- 12/02/2020. Grading for Assignments 1-3 (so far), Exam and Quiz. Please get ready for your project presentation on December 07. We will start at 4:50 sharp without stopping, following the order of me receiving your proposed topics (as I showed last class). For a single-person team, please prepare a presentation for 5 minutes, and for a two-person team, 10 minutes. I will strictly enforce the time with a timer and ask you to stop when your time is up. Each teach will have 5 minutes after your presentation for Q&A and transition.
- 12/09/2020. Grading for Assignments 1-4, Exam and Quiz.
- 12/19/2020. Final Grading will be submitted on Monday December 21, 2020. Happy Holidays to all!
Computer vision has a rich history of fundamental work on color, stereo and visual motion, which has dealt with the problems of color image understanding, 3D reconstruction from multiple images, and structure from motion from video sequences. Recently, in addition to these traditional problems, the stereo and motion information presented in multiple images or a video sequence is also being used to solve several other interesting problems, for example, large-scale scene modeling and rendering, video mosaicing, video segmentation, video compression and transmission, video manipulation, mobile vision, and first person vision. The best successful vision systems that computer vision researchers can learn from are human vision systems. Therefore this course will also briefly discuss human vision science and explore how the brain sees the world, thus including introductory on computational neuroscience, motion, color and several other topics.
Course Syllabus and Tentative Schedule (mm/dd)
Part 0. Introduction and Human Vision
- 0-1. Introduction (slides) & Human Eyes (slides) -08/31
- 0-2. Visual Brain (slides) -08/31, 09/14 (no class on 09/07)
- 0-3. Depth (slides) -09/14
- 0-4. Color (slides) -09/14
Part I. 2D Computer Vision Basics
- I-1. Image Formation: Digital Image Basics (slides) (Assignment 1)-09/21
- I-2. Image Enhancement (slides) (Lecture notes on feature extraction:I-2 and I-3) – 09/29 (Tuesday on Monday Schedule)
- I-3. Edge Detection: (slides) (Assignment 2 on I-2 and I-3) – 10/05
Part II. 3D Computer Vision
- II-1. Camera Models (slides) (lecture notes) – 10/14 (Wed on Monday Schedule)
- II-2. Camera Calibration (slides) (lecture notes) (Assignment 3 on II-1 and II-2) – 10/19
- II-3. Stereo Vision (slides) (lecture notes ) (Assignment 4 on II-3 and II-4), Project Discussion – 10/26, 11/02
- II-4. Visual Motion – (slides) (lecture notes), Exam Quick Review – 11/09, 11/16 – Slides updated to this points
Part III. Exam, Projects and Project Presentations
- III-1. Exam – 11/23
- III-2. Exam, Assignments and Project Discussions + a quick pop quiz – 11/30
- III-3. All Student Project Presentations – 12/07; Project Reports due 12/13 (Sunday) midnight!
Textbook and References
- Computer Vision, In the form of Lecture Notes and Slides; will be provided by the instructor
- Vision and Brain – How We Perceive the World, By James V. Stone, The MIT Press. Paperback | $30.00 | ISBN: 9780262517737 | 264 pp. | 6 x 9 in | 25 color illus., 132 b&w illus.| September 2012 (For students with little experience in vision and neuroscience to know human vision, brain and computational neuroscience)
- “Computer Vision – A Modern Approach” , David A. Forsyth, Jean Ponce, Prentice Hall, 2003 (ISBN: 0130851981 , 693 pages).
- “Three Dimensional Computer Vision: A Geometric Viewpoint” , Olivier Faugeras, The MIT Press, November 19, 1993 (ISBN: 0262061589 , 695 pages)
Online References and additional readings when necessary.
Grading and Prerequisites
The course will accommodate both graduate and senior undergraduate students with background in computer science, electrical and computer engineering, biomedical engineering or applied mathematics. Students who take the course for credits will be required to finish 4 assignments (40%), one midterm exam (40%), and one programming project (20%, including submit a report and give a presentation to the class at the end of the semester). The topics of the projects will be given in the middle of the semester and will be related to the material presented in the lectures.
Students are required to have a good preparation in both mathematics (linear algebra/numerical analysis) and advanced programming.