lec2
lec2
VISION
Labs 10% 5% -
Lab 1 2.5% 1.25% February 6 (due approximately one
week later)
Lab 2 2.5% 1.25% February 27 (due approximately one
week later)
Lab 3 2.5% 1.25% March 13 ((due approximately one
week later)
Lab 4 2.5% 1.25% March 27 (due approximately one
week later)
a simulation of eyes
Converting images to more understandable things like distance, edges,
directions etc.
Computer getting information out of images/video
Giving the computer "eyes" to see and identify as humans would.
Teach computer to interpret and understand our world through images.
What is computer vision?
Terminator 2
What is computer vision?
Terminator 2
Every picture tells a story
235 217 115 212 243 236 247 139 91 209 208 211
233 208 131 222 219 226 196 114 74 208 213 214
232 232 182 186 184 179 159 123 93 232 235 235
232 236 201 154 216 133 129 81 175 252 241 240
235 238 230 128 172 138 65 63 234 249 241 245
234 237 245 193 55 33 115 144 213 255 253 251
248 245 161 128 149 109 138 65 47 156 239 255
Apple Maps
2D Maps
Google Maps
Computational photography
Portrait mode
simulating wider aperture
Even wider aperture...
3D Photos on Facebook
Estimate depth from photo to create animation
https://ai.facebook.com/blog/-powered-by-ai-turning-any-2d
-photo-into-3d-using-convolutional-neural-nets/
Face recognition
Who is she?
Vision-based biometrics
“How the Afghan Girl was Identified by Her Iris Patterns” Read the story
Object recognition
Special effects: shape capture
Microsoft Hololens 2
HoloLens2 Sensors
4 head-tracking
8Mpix RGB cameras
camera (stereo + periphery)
+
IMU
IR eye cameras + IR
LEDs
5 microphone
array
Augmented Reality
Phone-based AR
https://www.youtube.com/watch?v=0Pj-jzy6ESE
Body Tracking
Robotics
Mobileye
• Vision systems currently in high-end BMW, GM, Volvo models
Self-driving cars
https://waymo.com/tech/
Drones
https://www.skydio.com/
Research: Timelapse
Research: Neural Rendering
Research: Yolo
Research: StyleGan
Current state of the art
You just saw examples of current systems.
• Many of these are less than 5 years old
Predictor
Low-level features
Mid-level High-level Process features
Lines, oriented
features features and predict output
edges
Combine edges: Combine shapes:
curves, shapes objects, scenes
Object Detection
https://heartbeat.fritz.ai/introduction-to-basic-object-detection-algorithms-b77295a95a63
Segmentation
https://gts.ai/how-do-we-solve-the-challenges-faced-due-to-semantic-segmentation/
Features
Optical Flow
https://www.commonlounge.com/discussion/1c2eaa85265f47a3a0a8ff1ac5fbce51
Stereo and Depth
3D Mapping (SLAM and SfM)
3D Shape and Appearance
Computational Photography
https://ai.googleblog.com/2017/10/portrait-mode-on-pixel-2-and-pixel-2-xl.html