SlideShare a Scribd company logo
Stereo
CSCI 455: Computer Vision
Single image stereogram, by Niklas Een
Mark Twain at Pool Table", no date, UCR Museum of Photography
Stereo
• Given two images from different viewpoints
– How can we compute the depth of each point in the image?
– Based on how much each pixel moves between the two images
epipolar
lines
Epipolar geometry
(x1, y1) (x2, y1)
x2 -x1 = the disparity of pixel (x1, y1)
Two images captured by a purely horizontal translating camera
(rectified stereo pair)
Your basic stereo matching algorithm
• Match Pixels in Conjugate Epipolar Lines
– Assume brightness constancy
– This is a challenging problem
– Hundreds of approaches
• A good survey and evaluation: http://www.middlebury.edu/stereo/
Your basic stereo algorithm
For each epipolar line
For each pixel in the left image
• compare with every pixel on same epipolar line in right image
• pick pixel with minimum match cost
Improvement: match windows
Stereo matching based on SSD
SSD
dmin d
Best matching disparity
Window size
– Smaller window
+
•
– Larger window
+
•
W = 3 W = 20
Better results with adaptive window
• T. Kanade and M. Okutomi, A Stereo Matching Algorithm
with an Adaptive Window: Theory and Experiment,,
Proc. International Conference on Robotics and
Automation, 1991.
• D. Scharstein and R. Szeliski. Stereo matching with
nonlinear diffusion. International Journal of Computer
Vision, 28(2):155-174, July 1998
Effect of window size
Stereo results
– Data from University of Tsukuba
– Similar results on other images without ground truth
Ground truthScene
Results with window search
Window-based matching
(best window size)
Ground truth
Better methods exist...
State of the art method
Boykov et al., Fast Approximate Energy Minimization via Graph Cuts,
International Conference on Computer Vision, September 1999.
Ground truth
For the latest and greatest: http://www.middlebury.edu/stereo/
Stereo as energy minimization
• What defines a good stereo correspondence?
1. Match quality
• Want each pixel to find a good match in the other image
2. Smoothness
• If two pixels are adjacent, they should (usually) move about
the same amount
Stereo as energy minimization
• Find disparity map d that minimizes an energy
function
• Simple pixel / window matching
SSD distance between windows
I(x, y) and J(x + d(x,y), y)=
Stereo as energy minimization
I(x, y) J(x, y)
y = 141
C(x, y, d); the disparity space image (DSI)x
d
Stereo as energy minimization
y = 141
x
d
Simple pixel / window matching: choose the minimum of each
column in the DSI independently:
Greedy selection of best match
Stereo as energy minimization
• Better objective function
{
{
match cost smoothness cost
Want each pixel to find a good
match in the other image
Adjacent pixels should (usually)
move about the same amount
Stereo as energy minimization
match cost:
smoothness cost:
4-connected
neighborhood
8-connected
neighborhood
: set of neighboring pixels
Smoothness cost
“Potts model”
L1 distance
How do we choose V?
Dynamic programming
• Can minimize this independently per scanline
using dynamic programming (DP)
• Basic idea: incrementally build a table of costs
D one column at a time
: minimum cost of solution such that d(x,y) = i
Recurrence:
Base case: (L = max disparity)
Dynamic programming
• Finds “smooth”, low-cost path through DPI from left
to right
y = 141
x
d
Dynamic Programming
Dynamic programming
• Can we apply this trick in 2D as well?
• No: dx,y-1 and dx-1,y may depend on different
values of dx-1,y-1
Slide credit: D. Huttenlocher
Stereo as a minimization problem
• The 2D problem has many local minima
– Gradient descent doesn’t work well
• And a large search space
– n x m image w/ k disparities has knm possible solutions
– Finding the global minimum is NP-hard in general
• Good approximations exist… we’ll see this soon
Questions?
Depth from disparity
f
x x’
baseline
z
C C’
X
f
Real-time stereo
• Used for robot navigation (and other tasks)
– Several real-time stereo techniques have been
developed (most based on simple discrete search)
Nomad robot searches for meteorites in Antartica
http://www.frc.ri.cmu.edu/projects/meteorobot/index.html
• Camera calibration errors
• Poor image resolution
• Occlusions
• Violations of brightness constancy (specular reflections)
• Large motions
• Low-contrast image regions
Stereo reconstruction pipeline
• Steps
– Calibrate cameras
– Rectify images
– Compute disparity
– Estimate depth
What will cause errors?
Active stereo with structured light
• Project “structured” light patterns onto the object
– simplifies the correspondence problem
– basis for active depth sensors, such as Kinect and iPhone X (using IR)
camera 2
camera 1
projector
camera 1
projector
Li Zhang’s one-shot stereo
Active stereo with structured light
https://ios.gadgethacks.com/news/watch-iphone-xs-30k-ir-dots-scan-your-face-0180944/
Laser scanning
• Optical triangulation
– Project a single stripe of laser light
– Scan it across the surface of the object
– This is a very precise version of structured light scanning
Digital Michelangelo Project
http://graphics.stanford.edu/projects/mich/
Laser scanned models
The Digital Michelangelo Project, Levoy et al.
Laser scanned models
The Digital Michelangelo Project, Levoy et al.
Laser scanned models
The Digital Michelangelo Project, Levoy et al.
Laser scanned models
The Digital Michelangelo Project, Levoy et al.
Questions?

More Related Content

PPTX
Computer vision
Kartik Kalpande Patil
 
PPT
Introduction to 2D/3D Graphics
Prabindh Sundareson
 
PPTX
2 d transformations and homogeneous coordinates
Tarun Gehlot
 
PPTX
Computer Vision Introduction
Camera Culture Group, MIT Media Lab
 
PPTX
Machine Learning - Convolutional Neural Network
Richard Kuo
 
PPTX
Semantic segmentation with Convolutional Neural Network Approaches
UMBC
 
PPT
Facial Recognition: The Science, The Technology, and Market Applications
Investorideas.com
 
PPTX
Computer vision
yusifagalar
 
Computer vision
Kartik Kalpande Patil
 
Introduction to 2D/3D Graphics
Prabindh Sundareson
 
2 d transformations and homogeneous coordinates
Tarun Gehlot
 
Computer Vision Introduction
Camera Culture Group, MIT Media Lab
 
Machine Learning - Convolutional Neural Network
Richard Kuo
 
Semantic segmentation with Convolutional Neural Network Approaches
UMBC
 
Facial Recognition: The Science, The Technology, and Market Applications
Investorideas.com
 
Computer vision
yusifagalar
 

What's hot (20)

PPTX
Motion Capture Technology Computer Graphics
Rohan Patel
 
PPTX
Deep Learning in Computer Vision
Sungjoon Choi
 
PDF
Feature detection and matching
Kuppusamy P
 
PPTX
Computer vision
Mahmoud Hussein
 
PPTX
Computer Vision harris
Wael Badawy
 
PPT
Moving object detection
Manav Mittal
 
PPTX
Ai lecture 03 computer vision
Ahmad sohail Kakar
 
PPTX
Computer vision introduction
Wael Badawy
 
PPTX
Introduction to Computer Graphics
Megha Sharma
 
PPTX
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
Sujit Pal
 
PDF
Automated Neural Image Caption Generator for Visually Impaired People
Christopher Mehdi Elamri
 
PPT
2 d geometric transformations
Mohd Arif
 
PPTX
Computer vision
pravindesai17
 
PDF
Lec14 multiview stereo
BaliThorat1
 
PPTX
Hermit curves & beizer curves
KKARUNKARTHIK
 
PDF
Convolutional neural network
Yan Xu
 
PDF
Deep learning for image video processing
Yu Huang
 
PPTX
Computer Vision transformations
Wael Badawy
 
PPT
3D transformation
Aditya Rawat
 
Motion Capture Technology Computer Graphics
Rohan Patel
 
Deep Learning in Computer Vision
Sungjoon Choi
 
Feature detection and matching
Kuppusamy P
 
Computer vision
Mahmoud Hussein
 
Computer Vision harris
Wael Badawy
 
Moving object detection
Manav Mittal
 
Ai lecture 03 computer vision
Ahmad sohail Kakar
 
Computer vision introduction
Wael Badawy
 
Introduction to Computer Graphics
Megha Sharma
 
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
Sujit Pal
 
Automated Neural Image Caption Generator for Visually Impaired People
Christopher Mehdi Elamri
 
2 d geometric transformations
Mohd Arif
 
Computer vision
pravindesai17
 
Lec14 multiview stereo
BaliThorat1
 
Hermit curves & beizer curves
KKARUNKARTHIK
 
Convolutional neural network
Yan Xu
 
Deep learning for image video processing
Yu Huang
 
Computer Vision transformations
Wael Badawy
 
3D transformation
Aditya Rawat
 
Ad

Similar to Computer Vision - Stereo Vision (20)

PPTX
Introduction to Binocular Stereo in Computer Vision
othersk46
 
PPTX
Lec13 stereo converted
BaliThorat1
 
PDF
Passive stereo vision with deep learning
Yu Huang
 
PPTX
Computer Vision panoramas
Wael Badawy
 
PDF
Computer Vision - Image Formation.pdf
AmmarahMajeed
 
PDF
PPT s12-machine vision-s2
Binus Online Learning
 
PDF
Dense Image Matching - Challenges and Potentials (Keynote 3D-ARCH 2015)
Konrad Wenzel
 
PDF
State of art pde based ip to bt vijayakrishna rowthu
vijayakrishna rowthu
 
PPT
24th IP_Fundamentals.ppt
Mphill2018
 
PDF
Week06 bme429-cbir
Ikram Moalla
 
PDF
Lecture1
Mobeen Mustafa
 
PPTX
IMAGE SEGMENTATION.
Tawose Olamide Timothy
 
PDF
digital image processing lecture notes e
DeviPriyaMohan1
 
PPT
2 basic imaging and radiometry
zjnsrbt
 
PPTX
Computer Vision - cameras
Wael Badawy
 
PPTX
Topic 1- computer vision and machine learning
asnamadathodika
 
PPTX
computer Vision and Machine learning Chapter 1
asnamadathodika
 
PDF
Astronomical data processing of ccd data.pdf
ZainRahim3
 
PPTX
Conventional Neural Networks and compute
YobuDJob1
 
PPT
Image processing 1-lectures
Taymoor Nazmy
 
Introduction to Binocular Stereo in Computer Vision
othersk46
 
Lec13 stereo converted
BaliThorat1
 
Passive stereo vision with deep learning
Yu Huang
 
Computer Vision panoramas
Wael Badawy
 
Computer Vision - Image Formation.pdf
AmmarahMajeed
 
PPT s12-machine vision-s2
Binus Online Learning
 
Dense Image Matching - Challenges and Potentials (Keynote 3D-ARCH 2015)
Konrad Wenzel
 
State of art pde based ip to bt vijayakrishna rowthu
vijayakrishna rowthu
 
24th IP_Fundamentals.ppt
Mphill2018
 
Week06 bme429-cbir
Ikram Moalla
 
Lecture1
Mobeen Mustafa
 
IMAGE SEGMENTATION.
Tawose Olamide Timothy
 
digital image processing lecture notes e
DeviPriyaMohan1
 
2 basic imaging and radiometry
zjnsrbt
 
Computer Vision - cameras
Wael Badawy
 
Topic 1- computer vision and machine learning
asnamadathodika
 
computer Vision and Machine learning Chapter 1
asnamadathodika
 
Astronomical data processing of ccd data.pdf
ZainRahim3
 
Conventional Neural Networks and compute
YobuDJob1
 
Image processing 1-lectures
Taymoor Nazmy
 
Ad

More from Wael Badawy (20)

PDF
HTML introduction
Wael Badawy
 
PPTX
Np complete reductions
Wael Badawy
 
PPTX
N F A - Non Deterministic Finite Automata
Wael Badawy
 
PPTX
Parsers -
Wael Badawy
 
PPTX
Computer Vision Cameras
Wael Badawy
 
PPTX
Computer Vision Gans
Wael Badawy
 
PPTX
Computer Vision image classification
Wael Badawy
 
PPTX
Computer Vision Structure from motion
Wael Badawy
 
PDF
Universal turing
Wael Badawy
 
PDF
Turing Machine
Wael Badawy
 
PDF
Turing variations
Wael Badawy
 
PDF
Time complexity
Wael Badawy
 
PDF
Regular pumping
Wael Badawy
 
PDF
Regular pumping examples
Wael Badawy
 
PDF
Regular properties
Wael Badawy
 
PDF
Regular expressions
Wael Badawy
 
PDF
Pushdown Automota
Wael Badawy
 
PDF
Pda accept context free
Wael Badawy
 
PPTX
Computer Vision sfm
Wael Badawy
 
PPTX
Computer vision - photometric
Wael Badawy
 
HTML introduction
Wael Badawy
 
Np complete reductions
Wael Badawy
 
N F A - Non Deterministic Finite Automata
Wael Badawy
 
Parsers -
Wael Badawy
 
Computer Vision Cameras
Wael Badawy
 
Computer Vision Gans
Wael Badawy
 
Computer Vision image classification
Wael Badawy
 
Computer Vision Structure from motion
Wael Badawy
 
Universal turing
Wael Badawy
 
Turing Machine
Wael Badawy
 
Turing variations
Wael Badawy
 
Time complexity
Wael Badawy
 
Regular pumping
Wael Badawy
 
Regular pumping examples
Wael Badawy
 
Regular properties
Wael Badawy
 
Regular expressions
Wael Badawy
 
Pushdown Automota
Wael Badawy
 
Pda accept context free
Wael Badawy
 
Computer Vision sfm
Wael Badawy
 
Computer vision - photometric
Wael Badawy
 

Recently uploaded (20)

PPTX
Information Texts_Infographic on Forgetting Curve.pptx
Tata Sevilla
 
PPTX
Virus sequence retrieval from NCBI database
yamunaK13
 
PDF
Antianginal agents, Definition, Classification, MOA.pdf
Prerana Jadhav
 
PPTX
HEALTH CARE DELIVERY SYSTEM - UNIT 2 - GNM 3RD YEAR.pptx
Priyanshu Anand
 
PPTX
INTESTINALPARASITES OR WORM INFESTATIONS.pptx
PRADEEP ABOTHU
 
PPTX
Tips Management in Odoo 18 POS - Odoo Slides
Celine George
 
DOCX
pgdei-UNIT -V Neurological Disorders & developmental disabilities
JELLA VISHNU DURGA PRASAD
 
PPTX
Dakar Framework Education For All- 2000(Act)
santoshmohalik1
 
PPTX
An introduction to Dialogue writing.pptx
drsiddhantnagine
 
PPTX
Continental Accounting in Odoo 18 - Odoo Slides
Celine George
 
PPTX
How to Track Skills & Contracts Using Odoo 18 Employee
Celine George
 
PPTX
BASICS IN COMPUTER APPLICATIONS - UNIT I
suganthim28
 
PPTX
Applications of matrices In Real Life_20250724_091307_0000.pptx
gehlotkrish03
 
PPTX
PROTIEN ENERGY MALNUTRITION: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
DOCX
Unit 5: Speech-language and swallowing disorders
JELLA VISHNU DURGA PRASAD
 
PPTX
Introduction to pediatric nursing in 5th Sem..pptx
AneetaSharma15
 
PPTX
A Smarter Way to Think About Choosing a College
Cyndy McDonald
 
PDF
Review of Related Literature & Studies.pdf
Thelma Villaflores
 
PPTX
Artificial-Intelligence-in-Drug-Discovery by R D Jawarkar.pptx
Rahul Jawarkar
 
PDF
Module 2: Public Health History [Tutorial Slides]
JonathanHallett4
 
Information Texts_Infographic on Forgetting Curve.pptx
Tata Sevilla
 
Virus sequence retrieval from NCBI database
yamunaK13
 
Antianginal agents, Definition, Classification, MOA.pdf
Prerana Jadhav
 
HEALTH CARE DELIVERY SYSTEM - UNIT 2 - GNM 3RD YEAR.pptx
Priyanshu Anand
 
INTESTINALPARASITES OR WORM INFESTATIONS.pptx
PRADEEP ABOTHU
 
Tips Management in Odoo 18 POS - Odoo Slides
Celine George
 
pgdei-UNIT -V Neurological Disorders & developmental disabilities
JELLA VISHNU DURGA PRASAD
 
Dakar Framework Education For All- 2000(Act)
santoshmohalik1
 
An introduction to Dialogue writing.pptx
drsiddhantnagine
 
Continental Accounting in Odoo 18 - Odoo Slides
Celine George
 
How to Track Skills & Contracts Using Odoo 18 Employee
Celine George
 
BASICS IN COMPUTER APPLICATIONS - UNIT I
suganthim28
 
Applications of matrices In Real Life_20250724_091307_0000.pptx
gehlotkrish03
 
PROTIEN ENERGY MALNUTRITION: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
Unit 5: Speech-language and swallowing disorders
JELLA VISHNU DURGA PRASAD
 
Introduction to pediatric nursing in 5th Sem..pptx
AneetaSharma15
 
A Smarter Way to Think About Choosing a College
Cyndy McDonald
 
Review of Related Literature & Studies.pdf
Thelma Villaflores
 
Artificial-Intelligence-in-Drug-Discovery by R D Jawarkar.pptx
Rahul Jawarkar
 
Module 2: Public Health History [Tutorial Slides]
JonathanHallett4
 

Computer Vision - Stereo Vision

  • 1. Stereo CSCI 455: Computer Vision Single image stereogram, by Niklas Een
  • 2. Mark Twain at Pool Table", no date, UCR Museum of Photography
  • 3. Stereo • Given two images from different viewpoints – How can we compute the depth of each point in the image? – Based on how much each pixel moves between the two images
  • 4. epipolar lines Epipolar geometry (x1, y1) (x2, y1) x2 -x1 = the disparity of pixel (x1, y1) Two images captured by a purely horizontal translating camera (rectified stereo pair)
  • 5. Your basic stereo matching algorithm • Match Pixels in Conjugate Epipolar Lines – Assume brightness constancy – This is a challenging problem – Hundreds of approaches • A good survey and evaluation: http://www.middlebury.edu/stereo/
  • 6. Your basic stereo algorithm For each epipolar line For each pixel in the left image • compare with every pixel on same epipolar line in right image • pick pixel with minimum match cost Improvement: match windows
  • 7. Stereo matching based on SSD SSD dmin d Best matching disparity
  • 8. Window size – Smaller window + • – Larger window + • W = 3 W = 20 Better results with adaptive window • T. Kanade and M. Okutomi, A Stereo Matching Algorithm with an Adaptive Window: Theory and Experiment,, Proc. International Conference on Robotics and Automation, 1991. • D. Scharstein and R. Szeliski. Stereo matching with nonlinear diffusion. International Journal of Computer Vision, 28(2):155-174, July 1998 Effect of window size
  • 9. Stereo results – Data from University of Tsukuba – Similar results on other images without ground truth Ground truthScene
  • 10. Results with window search Window-based matching (best window size) Ground truth
  • 11. Better methods exist... State of the art method Boykov et al., Fast Approximate Energy Minimization via Graph Cuts, International Conference on Computer Vision, September 1999. Ground truth For the latest and greatest: http://www.middlebury.edu/stereo/
  • 12. Stereo as energy minimization • What defines a good stereo correspondence? 1. Match quality • Want each pixel to find a good match in the other image 2. Smoothness • If two pixels are adjacent, they should (usually) move about the same amount
  • 13. Stereo as energy minimization • Find disparity map d that minimizes an energy function • Simple pixel / window matching SSD distance between windows I(x, y) and J(x + d(x,y), y)=
  • 14. Stereo as energy minimization I(x, y) J(x, y) y = 141 C(x, y, d); the disparity space image (DSI)x d
  • 15. Stereo as energy minimization y = 141 x d Simple pixel / window matching: choose the minimum of each column in the DSI independently:
  • 16. Greedy selection of best match
  • 17. Stereo as energy minimization • Better objective function { { match cost smoothness cost Want each pixel to find a good match in the other image Adjacent pixels should (usually) move about the same amount
  • 18. Stereo as energy minimization match cost: smoothness cost: 4-connected neighborhood 8-connected neighborhood : set of neighboring pixels
  • 19. Smoothness cost “Potts model” L1 distance How do we choose V?
  • 20. Dynamic programming • Can minimize this independently per scanline using dynamic programming (DP) • Basic idea: incrementally build a table of costs D one column at a time : minimum cost of solution such that d(x,y) = i Recurrence: Base case: (L = max disparity)
  • 21. Dynamic programming • Finds “smooth”, low-cost path through DPI from left to right y = 141 x d
  • 23. Dynamic programming • Can we apply this trick in 2D as well? • No: dx,y-1 and dx-1,y may depend on different values of dx-1,y-1 Slide credit: D. Huttenlocher
  • 24. Stereo as a minimization problem • The 2D problem has many local minima – Gradient descent doesn’t work well • And a large search space – n x m image w/ k disparities has knm possible solutions – Finding the global minimum is NP-hard in general • Good approximations exist… we’ll see this soon
  • 26. Depth from disparity f x x’ baseline z C C’ X f
  • 27. Real-time stereo • Used for robot navigation (and other tasks) – Several real-time stereo techniques have been developed (most based on simple discrete search) Nomad robot searches for meteorites in Antartica http://www.frc.ri.cmu.edu/projects/meteorobot/index.html
  • 28. • Camera calibration errors • Poor image resolution • Occlusions • Violations of brightness constancy (specular reflections) • Large motions • Low-contrast image regions Stereo reconstruction pipeline • Steps – Calibrate cameras – Rectify images – Compute disparity – Estimate depth What will cause errors?
  • 29. Active stereo with structured light • Project “structured” light patterns onto the object – simplifies the correspondence problem – basis for active depth sensors, such as Kinect and iPhone X (using IR) camera 2 camera 1 projector camera 1 projector Li Zhang’s one-shot stereo
  • 30. Active stereo with structured light https://ios.gadgethacks.com/news/watch-iphone-xs-30k-ir-dots-scan-your-face-0180944/
  • 31. Laser scanning • Optical triangulation – Project a single stripe of laser light – Scan it across the surface of the object – This is a very precise version of structured light scanning Digital Michelangelo Project http://graphics.stanford.edu/projects/mich/
  • 32. Laser scanned models The Digital Michelangelo Project, Levoy et al.
  • 33. Laser scanned models The Digital Michelangelo Project, Levoy et al.
  • 34. Laser scanned models The Digital Michelangelo Project, Levoy et al.
  • 35. Laser scanned models The Digital Michelangelo Project, Levoy et al.