- From www.udacity.com
Introduction to Computer Vision
- Self-paced
- Free Access
- 16 Sequences
- Introductive Level
Course details
Syllabus
1 Introduction
- 1A Introduction2 Image Processing for Computer Vision
- 2A Linear image processing- 2B Model fitting- 2C Frequency domain analysis3 Camera Models and Views
- 3A Camera models- 3B Stereo geometry- 3C Camera calibration- 3D Multiple views4 Image Features
- 4A Feature detection- 4B Feature descriptors- 4C Model fitting5 Lighting
- 5A Photometry- 5B Lightness- 5C Shape from shading6 Image Motion
- 6A Overview- 6B Optical flow7 Tracking
- 7A Introduction to tracking- 7B Parametric models- 7C Non-parametric models- 7D Tracking considerations8 Classification and Recognition
- 8A Introduction to recognition- 8B Classification: Generative models- 8C Classification: Discriminative models- 8D Action recognition9 Useful Methods
- 9A Color spaces and segmentation- 9B Binary morphology- 9C 3D perception10 Human Visual System
- 10A The retina- 10B Vision in the brain---**GT OMSCS Students**Note: Please refer to your course website/schedule for further details, assignments, etc.**Spring 2015 resources** (old):- [Schedule](https://docs.google.com/spreadsheets/d/1ecUGIyhYOfQPi3HPXb-7NndrLgpX_zgkwsqzfqHPaus/pubhtml): Suggested pace, assignments, deadlines, references.- [Course website](http://www.cc.gatech.edu/~afb/classes/CS4495-Spring2015-OMS/): Course information, problem sets, academic policies, grading scheme.- [Piazza forum](https://piazza.com/class/i4fze20e9lh1o9): Discussions, announcements, clarifications. - [T-Square site](https://t-square.gatech.edu/portal/site/gtc-4a26-e369-5db2-8cec-fb426ed49513): Problem set submissions.Note: This course was previously offered as CS 4495.Prerequisite
Instructors
- Aaron Bobick - Aaron Bobick, PhD, joined Washington University in St. Louis as Dean of the School of Engineering & Applied Science and the James M. McKelvey Professor July 1, 2015. Prior to Washington University, he was a professor and founding chair of the School of Interactive Computing at the Georgia Institute of Technology, where he was a member of the faculty since 1999. He has B.Sc. degrees from MIT in Mathematics (1981) and Computer Science (1981) and a Ph.D. from MIT in Cognitive Science (1987). He joined the MIT Media Laboratory faculty in 1992 where he was a pioneer in the area of action recognition by computer vision. In 1999 Prof. Bobick moved to Georgia Tech where he became the Director of the GVU Center, an internationally known research center in computer vision, graphics, ubiquitous computing, and HCI. In 2005 the School of Interactive Computing was created with Prof. Bobick serving as the founding Chair. Prof. Bobick is both an IEEE Fellow and an ACM Distinguished Scientist. He has served as a senior area chair for most international computer vision conferences including serving as Program Chair of IEEE Conference on Computer Vision and Pattern Recognition. He has also served on the advisory board or boards of directors of a variety of surveillance-focused computer vision and medical imaging technology companies.
- Irfan Essa - Irfan Essa is a Professor in the School of Interactive Computing (iC) and Associate Dean in the College of Computing (CoC), at the Georgia Institute of Technology (GA Tech), in Atlanta, Georgia, USA. Professor Essa works in the areas of Computer Vision, Computer Graphics, Computational Perception, Robotics and Computer Animation, Machine Learning, and Social Computing, with potential impact on Video Analysis and Production (e.g., Computational Photography & Video, Image-based Modeling and Rendering, etc.) Human Computer Interaction, Artificial Intelligence, Computational Behavioral/Social Sciences, and Computational Journalism research. He has published over 150 scholarly articles in leading journals and conference venues on these topics and several of his papers have also won best paper awards. He has been awarded the NSF CAREER and was elected to the grade of IEEE Fellow. He has held extended research consulting positions with Disney Research and Google Research and also was an Adjunct Faculty Member at Carnegie Mellon’s Robotics Institute. He joined GA Tech Faculty in 1996 after his earning his MS (1990), Ph.D. (1994), and holding research faculty position at the MIT Media Lab (1988-1996).
Editor
The Georgia Institute of Technology, also known as Georgia Tech or GT, is a co-educational public research university located in Atlanta, Georgia, USA. It is part of the wider University System of Georgia network. Georgia Tech has offices in Savannah (Georgia, USA), Metz (France), Athlone (Ireland), Shanghai (China), and Singapore.
Georgia Tech's reputation is built on its engineering and computer science programmes, which are among the best in the world5,6. The range of courses on offer is complemented by programmes in the sciences, architecture, humanities and management.
Platform
Udacity is a for-profit educational organization founded by Sebastian Thrun, David Stavens, and Mike Sokolsky offering massive open online courses (MOOCs). According to Thrun, the origin of the name Udacity comes from the company's desire to be "audacious for you, the student". While it originally focused on offering university-style courses, it now focuses more on vocational courses for professionals.