Study 10-15 hrs/week and complete in 3 months.
Learning material and communication in English
From computer graphics to social robotics to autonomous vehicles, computer vision is powering world-changing new technologies. In this program, you’ll write code to perform everything from facial recognition to scene-understanding to object tracking; by the end of this program, you’ll have a broad portfolio of applications that you’ve built!
Computer vision is a rapidly growing field that powers a variety of emerging technologies—from facial recognition to augmented reality to self-driving cars. Learn the latest deep learning architectures and image processing techniques today!
We collaborated with industry leaders from NVIDIA to Affectiva to build a program that showcases how computer vision is being applied on the front-lines of technology today.
You’ll learn how to program computer vision techniques in Python, and then use that knowledge to create your own applications! You’ll complete three major computer vision projects, and build a strong portfolio in the process.
Get personalized feedback on your computer vision projects from a team of technical reviewers. The invaluable reviews you receive mirror the experience of working on a team of engineers and mentors, and this feedback offers you unique and actionable insights as to how you should develop code!
See fewer details
This program requires experience with Python, statistics, machine learning, and deep learning.See detailed requirements.
Master computer vision and image processing essentials. Learn to extract important features from image data, and apply deep learning techniques to classification tasks.Facial Keypoint Detection
Learn to apply deep learning architectures to computer vision tasks. Discover how to combine CNN and RNN networks to build an automatic image captioning application.Automatic Image Captioning
Learn how to locate an object and track it over time. These techniques are used in a variety of moving systems, such as self-driving car navigation and drone flight.Landmark Detection & Tracking
“Computer vision is one of the most important elements of machine intelligence and the transformation of enterprise and companies.”— Fei-Fei Li, Chief Scientist, Google Cloud
Sebastian Thrun is a scientist, educator, inventor, and entrepreneur. Prior to founding Udacity, he launched Google’s self-driving car project.
Cezanne is an expert in computer vision with a Masters in Electrical Engineering from Stanford University. As a former researcher in genomics and biomedical imaging, she’s applied computer vision and deep learning to medical diagnostic applications.
Alexis is an applied mathematician with a Masters in computer science from Brown University and a Masters in applied mathematics from the University of Michigan. She was formerly a National Science Foundation Graduate Research Fellow.
Juan is a computational physicist with a Masters in Astronomy. He is finishing his PhD in Biophysics. He previously worked at NASA developing space instruments and writing software to analyze large amounts of scientific data using machine learning techniques.
Jay has a degree in computer science, loves visualizing machine learning concepts, and is the Investment Principal at STV, a $500 million venture capital fund focused on high-technology startups.
Ortal Arel has a PhD in Computer Engineering, and has been professor and researcher in the field of applied cryptography and embedded platforms. She has worked on design and analysis of intelligent algorithms for high-speed custom digital architectures.
Luis was formerly a Machine Learning Engineer at Google. He holds a PhD in mathematics from the University of Michigan, and a Postdoctoral Fellowship at the University of Quebec at Montreal.
Learn the essentials of computer vision, including image transformation, neural network architectures, and object recognition.