0% found this document useful (0 votes)

55 views72 pages

Lec00 Intro For Web Highlighted

Uploaded by

abbasahmer734

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views72 pages

Lec00 Intro For Web Highlighted

Uploaded by

abbasahmer734

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 72

CS5670: Intro to Computer Vision

(Cornell Tech)
Depth from a single image
Visualizing scenes from tourist
photos
Reconstructing dynamic 3D
scenes

DynIBaR: Neural Dynamic Image-Based Rendering [

https://dynibar.github.io/]
Zhengqi Li, Qianqian Wang, Forrester Cole, Richard Tucker, Noah Snavely
CVPR 2023
Today
1. What is computer vision?

2. Why study computer vision?

3. Course overview

4. Images & image filtering [time permitting]

Today
• Readings
– Szeliski, Chapter 1 (Introduction)
Every image tells a story
• Goal of computer vision:
perceive the “story”
behind the picture
• Compute properties of
the world
– 3D shape
– Names of people or
objects
– What happened?
The goal of computer vision
Can computers match human perception?
• Yes and no (mainly no)
– computers can be better at
“easy” things
– humans are better at
“hard” things

• But huge progress

– Accelerating in the last five
years due to deep learning
– What is considered “hard”
keeps changing
Human perception has its shortcomings

https://twitter.com/pickover/status/
1460275132958662657/
But humans can tell a lot about a scene
from a little information…

Source: “80 million tiny images” by Torralba, et al.

The goal of computer vision
The goal of computer vision
• Compute the 3D shape of the world

ZED 2i Camera
The goal of computer vision
• Recognize objects and people

Terminator 2, 1991
slide credit: Fei-Fei, Fergus & Torralba
sky
building

flag

face
banner
wall
street lamp
bus bus

cars slide credit: Fei-Fei, Fergus & Torralba

The goal of computer vision
• “Enhance” images
The goal of computer vision
• Forensics

Source: Nayar and Nishino, “Eyes for Relighting”

Source: Nayar and Nishino, “Eyes for Relighting”
Source: Nayar and Nishino, “Eyes for Relighting”
The goal of computer vision
• Improve photos (“Computational Photography”)

Super-resolution (source:
2d3)

Depth of field on cell phone

camera (source:
Google Research Blog) Removing objects (
Google Magic Erase
Low-light photography r
(credit: Hasinoff et al., SIGGRAPH ASIA 2016 )
)
April 10, 2019
Why study computer vision?
• Billions of images/videos captured per day

• Huge number of potential applications

• The next slides show the current state of
Optical character recognition
(OCR) • If you have a scanner, it probably came with OCR
software

Digit recognition, AT&T labs (1990’s) License plate readers

http://en.wikipedia.org/wiki/Automatic_number_plate_recognition
http://yann.lecun.com/exdb/lenet/

Sudoku grabber
http://sudokugrab.blogspot.com/

Automatic check processing

Face detection

• Nearly all cameras detect faces in real

time
– (Why?)
Face analysis and recognition
Vision-based biometrics

Who is she? Source: S. Seitz

Vision-based biometrics

“How the Afghan Girl was Identified by Her Iris Patterns” Read
the story

Source: S. Seitz
Login without a password

Fingerprint scanners Face unlock on Apple iPhone X

on many new See also
smartphones and http://www.sensiblevision.com/
other devices
New York Times, Jan. 18, 2020
by Kashmir Hill
Bird identification

Merlin Bird ID (based on Cornell Tech technology!)

Special effects: shape capture

The Matrix movies, ESC Entertainment, XYZRGB, NRC

Source: S. Seitz
Special effects: motion capture

Pirates of the Carribean, Industrial Light and Magic Source: S. Seitz

3D face tracking w/ consumer cameras

Snapchat Lenses

Face2Face system (Thies et

Image synthesis

Karras, et al., Progressive Growing of GANs for Improved Quality, Stability, and Variation, ICLR
Which face is real?

https://www.whichfaceisreal.com/
Image synthesis

“An astronaut riding a horse in a “A photo of a Corgi dog riding a bike in

photorealistic style” – DALL-E 2 Times Square. It is wearing sunglasses and
a beach hat” – Imagen
Sports

Sportvision first down line

Explanation on www.howstuffworks.com

Source: S. Seitz
Smart cars

• Mobileye
• Tesla Autopilot
• Safety features in many cars
Self-driving cars

Waymo
Robotics

NASA’s Mars Curiosity Rover Amazon Picking Challenge

https://en.wikipedia.org/wiki/Curiosity_(rover) http://www.robocup2016.org/en/events/amazon-picking-chal
lenge/

Amazon Prime Air Amazon Scout

Medical imaging

3D imaging
(MRI, CT) Skin cancer classification with deep learning
https://cs.stanford.edu/people/esteva/nature/
Virtual & Augmented Reality

6DoF head tracking Hand & body tracking

3D scene understanding 3D-360 video capture

Current state of the art
• You just saw many examples of current systems.
– Many of these are less than 5 years old

• Computer vision is an active research area, and rapidly

changing
– Many new apps in the next 5 years
– Deep learning and generative methods powering many modern
applications

• Many startups across a dizzying array of areas

– Generative AI, robotics, autonomous vehicles, medical
imaging, construction, inspection, VR/AR, …
Why is computer vision difficult?

Viewpoint variation

Credit: Flickr user michaelpaul

Scale
Illumination
Why is computer vision difficult?

Motion (Source: S. Lazebnik)

Intra-class variation

Background clutter Occlusion

Challenges: local ambiguity

slide credit: Fei-Fei, Fergus & Torralba

But there are lots of visual cues we can
use…

Source: S. Lazebnik
Bottom line
• Perception is an inherently ambiguous problem
– Many different 3D scenes could have given rise to a given 2D
image

Artist Julian Beever with his anamorphic Coke bottle

– We often must use prior knowledge about the world’s
structure Image source: F. Durand
CS5670: Introduction to Computer Vision

• Project-based course whose goal is to teach you

the basics of computer vision – image processing,
geometry, recognition – in a hands-on way
Course requirements
• Prerequisites
– Data structures
– Good working knowledge of Python programming
– Linear algebra
– Vector calculus

• Course does not assume prior imaging

experience
– computer vision, image processing, graphics, etc.
Course overview
(tentative)
1. Low-level vision
– image processing, edge detection,
feature detection, cameras, image
formation

2. Geometry & appearance

– projective geometry, stereo, structure
from motion, optimization, lighting &
materials

3. Recognition & generative

models
– object classification, deep learning,
1. Low-level vision
• Basic image processing and image formation

* =
Filtering, edge detection

Feature extraction Image formation

Project: Hybrid images
Project: Feature detection and matching
2. Geometry & appearance

Image credit: IDS Imaging

Projective geometry Stereo vision

Multi-view stereo Structure from motion

Project: Creating panoramas
Project: 3D reconstruction
3. Recognition, Deep Learning &
Generative Models

“dog”

Image classification Convolutional Neural Networks

“a class watching a computer vision lecture at Cornell Tech”

Image generation
Project: Neural Radiance Fields
(NeRFs)
Questions?

Lec00 Intro For Web
No ratings yet
Lec00 Intro For Web
81 pages
Lec00 Intro Computervision
No ratings yet
Lec00 Intro Computervision
58 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
1 Intro Visión Artificial
No ratings yet
1 Intro Visión Artificial
50 pages
00CV Intro Full
No ratings yet
00CV Intro Full
58 pages
Intro to Computer Vision Course
No ratings yet
Intro to Computer Vision Course
76 pages
Computer Vision
100% (1)
Computer Vision
48 pages
Ilovepdf Merged Compressed
No ratings yet
Ilovepdf Merged Compressed
1,100 pages
Lec01 Intro
No ratings yet
Lec01 Intro
55 pages
Ch-3 Image AnalysisComputer Vision
No ratings yet
Ch-3 Image AnalysisComputer Vision
88 pages
1 Vision Lec 1
No ratings yet
1 Vision Lec 1
49 pages
Lec01 Intro
No ratings yet
Lec01 Intro
61 pages
Computer Vision
No ratings yet
Computer Vision
52 pages
1 Intro
No ratings yet
1 Intro
103 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
61 pages
CV Overview
No ratings yet
CV Overview
83 pages
Lect1 PDF
100% (1)
Lect1 PDF
45 pages
CV Module 1
100% (1)
CV Module 1
166 pages
1a. Introduction
No ratings yet
1a. Introduction
32 pages
Computer Vision Intorduction
No ratings yet
Computer Vision Intorduction
57 pages
Lec01 CT Intro
No ratings yet
Lec01 CT Intro
61 pages
Computer Vision: Linda Shapiro
No ratings yet
Computer Vision: Linda Shapiro
73 pages
CS 474 Lec 01 Introduction
No ratings yet
CS 474 Lec 01 Introduction
69 pages
Prerequisites: What Is Computer Vision? Vision For Measurement
No ratings yet
Prerequisites: What Is Computer Vision? Vision For Measurement
8 pages
Lecture1 - Introduction
No ratings yet
Lecture1 - Introduction
35 pages
Unit 1
No ratings yet
Unit 1
186 pages
803 (A) Image Processing and Computer Vision#: Subject In-Charge: Prof Shilpa Sharma
No ratings yet
803 (A) Image Processing and Computer Vision#: Subject In-Charge: Prof Shilpa Sharma
44 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
CV Unit 1 Overview of Computer Vison and Application
No ratings yet
CV Unit 1 Overview of Computer Vison and Application
51 pages
Lec01 - Intro To Computer Vision
No ratings yet
Lec01 - Intro To Computer Vision
43 pages
CS7.505: Computer Vision: Spring 2022
No ratings yet
CS7.505: Computer Vision: Spring 2022
46 pages
Computer Vision Part1
No ratings yet
Computer Vision Part1
96 pages
CSE480: Machine Vision
No ratings yet
CSE480: Machine Vision
51 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
62 pages
Computer Vision for Beginners
No ratings yet
Computer Vision for Beginners
26 pages
Introduction To Computer Vision
No ratings yet
Introduction To Computer Vision
34 pages
Lecture 1
100% (1)
Lecture 1
21 pages
Computer Vision SM-1
No ratings yet
Computer Vision SM-1
26 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
Introduction To Computer Vision: by James Hays
No ratings yet
Introduction To Computer Vision: by James Hays
32 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
Computer Vision Presentation Updated
No ratings yet
Computer Vision Presentation Updated
15 pages
Computer Vision: Evolution and Promise
No ratings yet
Computer Vision: Evolution and Promise
5 pages
Comp Vis Week 1
No ratings yet
Comp Vis Week 1
39 pages
What Is Computer Vision
No ratings yet
What Is Computer Vision
18 pages
Department of Computer Science and Engineering - University of Bologna
No ratings yet
Department of Computer Science and Engineering - University of Bologna
23 pages
Lecture AI 15 23052025 112103am
No ratings yet
Lecture AI 15 23052025 112103am
69 pages
LectureNotes PDF
No ratings yet
LectureNotes PDF
212 pages
Deep Learning in Computer Vision Course
No ratings yet
Deep Learning in Computer Vision Course
35 pages
Computer Vision Basics for Beginners
No ratings yet
Computer Vision Basics for Beginners
21 pages
Introduction to Data Science: (Khoa học dữ liệu)
No ratings yet
Introduction to Data Science: (Khoa học dữ liệu)
91 pages
Computer Vision Presentation AI
No ratings yet
Computer Vision Presentation AI
16 pages
IT5409 Ch1 Intro New Template
No ratings yet
IT5409 Ch1 Intro New Template
14 pages
Visual Information Processing Course TDS3651
No ratings yet
Visual Information Processing Course TDS3651
73 pages
18cse390t U1 s1 Slo1 Content
No ratings yet
18cse390t U1 s1 Slo1 Content
15 pages
1 Intro24
No ratings yet
1 Intro24
79 pages
01 - Introduction
No ratings yet
01 - Introduction
37 pages
FC300X Camera Processing Report
No ratings yet
FC300X Camera Processing Report
6 pages
M. Zain Ul Abideen: Profile
No ratings yet
M. Zain Ul Abideen: Profile
1 page
Photoshop for Web Design Essentials
No ratings yet
Photoshop for Web Design Essentials
27 pages
Art Harmony: Elements & Techniques
No ratings yet
Art Harmony: Elements & Techniques
3 pages
DDI Dimensional Imaging DI3D
100% (1)
DDI Dimensional Imaging DI3D
4 pages
Composite Video Separation Techniques: Application Note October 1996 AN9644
No ratings yet
Composite Video Separation Techniques: Application Note October 1996 AN9644
8 pages
History of Photography
No ratings yet
History of Photography
19 pages
Prelim Medsurg 2 Lec
No ratings yet
Prelim Medsurg 2 Lec
18 pages
Visionix VX 650: Comprehensive Eye Screening
No ratings yet
Visionix VX 650: Comprehensive Eye Screening
8 pages
Optical Coherence Tomography
No ratings yet
Optical Coherence Tomography
33 pages
Berlin and Kay's Color Theory
No ratings yet
Berlin and Kay's Color Theory
4 pages
Miniature Photography Tips & Techniques
No ratings yet
Miniature Photography Tips & Techniques
31 pages
Aurosil - Issue 2
No ratings yet
Aurosil - Issue 2
1 page
A Review of Ocular Genetics and Inherited Eye Diseases: SD Mathebula
No ratings yet
A Review of Ocular Genetics and Inherited Eye Diseases: SD Mathebula
12 pages
Question 1365184
No ratings yet
Question 1365184
4 pages
2021 Axial Length Targets For Myopia Control
No ratings yet
2021 Axial Length Targets For Myopia Control
9 pages
Scanning Colour Negatives: by Ian Lyons
No ratings yet
Scanning Colour Negatives: by Ian Lyons
6 pages
Microtropia
100% (1)
Microtropia
24 pages
PixInsight Image Processing Guide
No ratings yet
PixInsight Image Processing Guide
3 pages
Near vs. Distance Vision in Cataracts
No ratings yet
Near vs. Distance Vision in Cataracts
4 pages
Signal Phrases & Visual Aids Guide
0% (1)
Signal Phrases & Visual Aids Guide
25 pages
Lecture Note 2-GenEd 108
No ratings yet
Lecture Note 2-GenEd 108
6 pages
Comprehensive Ophthalmic Examination Guide
No ratings yet
Comprehensive Ophthalmic Examination Guide
24 pages
99d BrandGuide Template
No ratings yet
99d BrandGuide Template
13 pages
Thermal Camera for Public Safety
No ratings yet
Thermal Camera for Public Safety
4 pages
Fevicryl Catalogue
100% (1)
Fevicryl Catalogue
64 pages
Canon EOS 1000FN User Manual
No ratings yet
Canon EOS 1000FN User Manual
109 pages
Central Corneal Thickness: Pachymetry vs OCT
No ratings yet
Central Corneal Thickness: Pachymetry vs OCT
5 pages
The Beginning Artist's Guide To Perspective Drawing
No ratings yet
The Beginning Artist's Guide To Perspective Drawing
19 pages
Correlation Between Near Point of Convergence and Stereoacuity
No ratings yet
Correlation Between Near Point of Convergence and Stereoacuity
6 pages

Lec00 Intro For Web Highlighted

Uploaded by

Lec00 Intro For Web Highlighted

Uploaded by

CS5670: Intro to Computer Vision

DynIBaR: Neural Dynamic Image-Based Rendering [

2. Why study computer vision?

4. Images & image filtering [time permitting]

• But huge progress

Source: “80 million tiny images” by Torralba, et al.

cars slide credit: Fei-Fei, Fergus & Torralba

Source: Nayar and Nishino, “Eyes for Relighting”

Depth of field on cell phone

• Huge number of potential applications

Digit recognition, AT&T labs (1990’s) License plate readers

Automatic check processing

• Nearly all cameras detect faces in real

Who is she? Source: S. Seitz

Fingerprint scanners Face unlock on Apple iPhone X

Merlin Bird ID (based on Cornell Tech technology!)

The Matrix movies, ESC Entertainment, XYZRGB, NRC

Pirates of the Carribean, Industrial Light and Magic Source: S. Seitz

Face2Face system (Thies et

“An astronaut riding a horse in a “A photo of a Corgi dog riding a bike in

Sportvision first down line

NASA’s Mars Curiosity Rover Amazon Picking Challenge

Amazon Prime Air Amazon Scout

6DoF head tracking Hand & body tracking

3D scene understanding 3D-360 video capture

• Computer vision is an active research area, and rapidly

• Many startups across a dizzying array of areas

Credit: Flickr user michaelpaul

Motion (Source: S. Lazebnik)

Background clutter Occlusion

slide credit: Fei-Fei, Fergus & Torralba

Artist Julian Beever with his anamorphic Coke bottle

• Project-based course whose goal is to teach you

• Course does not assume prior imaging

2. Geometry & appearance

3. Recognition & generative

Feature extraction Image formation

Image credit: IDS Imaging

Projective geometry Stereo vision

Multi-view stereo Structure from motion

Image classification Convolutional Neural Networks

“a class watching a computer vision lecture at Cornell Tech”

You might also like