SlideShare a Scribd company logo
Computer Vision
Pattern Recognition
Credit: Saturday Morning Breakfast Cereal
http://www.smbc-comics.com/?id=2088
Definitions
Computer vision: the ability for a computer to understand
the contents of images
Pattern recognition: Classification of images
Image processing: preparing images for human or
computer consumption
Computer Vision: Applications
Object recognition
Facial recognition and expression recognition
Surveillance
Camera stabilization
GIS
Crowd / moving object tracking
Robotics
Computer graphics & Games
Tools
Mahotas
SciPy ndimage
OpenCV
Pillow
scikit-image
NumPy, Matplotlib, scikit-learn
http://www.pyimagesearch.com/2014/01/12/my-top-9-favorite-python-libraries-
for-building-image-search-engines/
Mathematical Concepts
Convolution & Correlation
Discrete Differentiation / Image Gradient
Determinants & Trace
Eigenvectors & Eigenvalues
Statistics
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Convolution vs Correlation
Doesn’t matter which one you choose as long
as your kernel is symmetric
Convolution is associative & commutative
Credit: Mubarak Shah, University of Central Florida
Gradient Magnitude & Direction
2x2:
3x3:
Eigenvalues:
Determinants
Main diagonal:
Eigenvalues:
Trace
Credit: Mubarak Shah, University of Central Florida
Image processing: Components
Smoothing filters
Edge detection
Interest point detection
Descriptors
Smooth Filtering
Averaging
Gaussian Filter
Median Filter
Bilateral Filter
Averaging
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Median Filter
Non-linear filter
Take all intensities within window
Sort intensities
Use intensity in the middle
Bilateral Filter
Non-linear filter
Uses Gaussian filtering, but adjusts the weights
according to how closely a pixel within the
window matches the intensity of the center
pixel
Smooth Filtering: Summary
Averaging: Blurs edges the most
Gaussian Filter: Good for Gaussian noise, used often
Median Filter: Good for salt & pepper noise
Bilateral Filter: Preserves edges the most
Thresholding
Otsu
Ridler-Calvard
Edge Detection
Prewitt
Sobel
Marr-Hildreth
Canny
Prewitt & Sobel Edge Detectors
Prewitt Edge Detector (1970)
- Gradient operator w/ averaging
Paper: J.M.S. Prewitt. “Object Enhancement and Extraction”. 1970
Sobel Edge Detector (1968)
- Gradient operator w/ weighted average
Talk: I. Sobel. “Isotropic 3x3 Image Gradient Operator”. 1968
Computer Vision: Pattern Recognition
Computer Vision: Pattern Recognition
Computer Vision: Pattern Recognition
Computer Vision: Pattern Recognition
Computer Vision: Pattern Recognition
Marr-Hildreth Edge Detector (1980)
Laplacian of Gaussian
Inspired by neurophysiology:
Neurons for the eyes perform Gaussian filtering
Paper: D. Marr, E. Hildreth. “Theory of Edge Detection”. 1980
Marr-Hildreth: Algorithm
Computer Vision: Pattern Recognition
Computer Vision: Pattern Recognition
Canny Edge Detector (1986)
Gradient of Gaussian
Pure mathematical approach
Paper: J. Canny. “A Computational Approach To Edge Detection”. 1986
Computer Vision: Pattern Recognition
Computer Vision: Pattern Recognition
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Computer Vision: Pattern Recognition
Computer Vision: Pattern Recognition
Computer Vision: Pattern Recognition
Computer Vision: Pattern Recognition
Computer Vision: Pattern Recognition
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Haralick Texture Descriptor (1973)
Paper: R.M. Haralick. “Textural Features for Image Classification”. 1973
Haralick: Algorithm
Gray-level co-occurrence matrix
- Element [i, j]: number of times pixel of
intensity i is adjacent to intensity j
- Divide matrix by number of comparisons
made (~ number of pixels in image)
http://murphylab.web.cmu.edu/publications/boland/boland_node26.html
Haralick: Algorithm
4 co-occurrence matrices, one for each
adjacency direction:
horizontal, vertical, left and right diagonals
Haralick: Zernike moments (Features)
● Angular second moment
● Contrast
● Correlation
● Sum of differences:
variance
● Inverse difference
moment
● Sum average
● Sum variance
● Sum entropy
● Entropy
● Difference variance
● Difference entropy
● Info. measure of
correlation 1
● Info. measure of
correlation 2
● Max. correlation coeff.
Pyramids (1983)
Gaussian Pyramid
Application: Image compositing
Laplacian Pyramid
Application: Image compression
Paper: E.H. Adelson, P.J. Burt. “The Laplacian Pyramid as a Compact Image
Code”. 1983
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Reduce & Expand operations performed using:
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
SIFT & SURF
SIFT: Scale-Invariant Feature Transform
SURF: Speeded-Up Robust Features
SIFT (1999)
Inspired by primate visual system (human, ape,
monkey, etc.)
Paper: D. Lowe. “Distinctive Image Features from Scale-Invariant Keypoints”.
2004
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
Credit: Mubarak Shah, University of Central Florida
SURF (2006)
Paper: Herbert Ray, et al. “SURF: Speeded Up Robust Features”. 2006
SIFT vs SURF
SURF has similar performance to SIFT, but is
much faster
When time isn’t critical, SIFT tends to
outperform SURF
Examples
Photosynth
https://photosynth.net/view.aspx?cid=6b7abc6f-d6a0-43f5-
9632-b28905fc0ed1
Barbie Digital Makeover
https://youtu.be/Ta1GmdKj9Qc?t=4m40s
Acknowledgements: Videos
M. Shah, University of Central Florida- Center for Research
in Computer Vision YouTube Videos
https://www.youtube.com/user/UCFCRCV
A. Ng, Stanford University
https://youtu.be/n1ViNeWhC24
Acknowledgements: Books
J.E. Solem. Programming Computer Vision with Python.
2012
W. Richert, L.P. Coelho. Building Machine Learning
Systems with Python. 2013
A. Rosebrock. Practical Python and OpenCV. 2014

More Related Content

PPTX
Ai lecture 03 computer vision
Ahmad sohail Kakar
 
PPTX
IRIS RECOGNITION
Ankit Kumar
 
PPTX
Facial recognition system
Divya Sushma
 
PPTX
Computer Vision
ArtiKhanchandani
 
PPTX
Canny Edge Detection
SN Chakraborty
 
PPT
Fields of digital image processing slides
Srinath Dhayalamoorthy
 
PPTX
Object Recognition
Eman Abed AlWahhab
 
PPTX
Computer vision
pravindesai17
 
Ai lecture 03 computer vision
Ahmad sohail Kakar
 
IRIS RECOGNITION
Ankit Kumar
 
Facial recognition system
Divya Sushma
 
Computer Vision
ArtiKhanchandani
 
Canny Edge Detection
SN Chakraborty
 
Fields of digital image processing slides
Srinath Dhayalamoorthy
 
Object Recognition
Eman Abed AlWahhab
 
Computer vision
pravindesai17
 

What's hot (20)

PPTX
Computer vision
Kartik Kalpande Patil
 
PPTX
Face recognization
leenak770
 
PDF
Computer Vision
Kan-Han (John) Lu
 
PPTX
Computer Vision Introduction
Camera Culture Group, MIT Media Lab
 
PPTX
Image Processing and Computer Vision
Silicon Mentor
 
PPTX
Computer vision introduction
Wael Badawy
 
PPTX
Face recognigion system ppt
Ravi Kumar
 
PPTX
Computer Vision
Ameer Mohamed Rajah
 
PPT
Computer Vision Basics
Suren Kumar
 
PPTX
face recognition
vipin varghese
 
PPTX
Face recognition attendance system
Naomi Kulkarni
 
PPTX
Iris Recognition Technology
RutikBhoyar
 
PPT
Face recognition ppt
Santosh Kumar
 
PDF
Image recognition
Nikhil Singh
 
ODP
An Introduction to Computer Vision
guestd1b1b5
 
PPTX
AI Computer vision
Kashafnaz2
 
PPT
Natural Interfaces for Augmented Reality
Mark Billinghurst
 
PPTX
1ère Présentation Atelier Vision par ordinateur
Samir TABIB
 
PPSX
Image Processing Basics
Dr. A. B. Shinde
 
PPTX
What is computer vision?
Qentinel
 
Computer vision
Kartik Kalpande Patil
 
Face recognization
leenak770
 
Computer Vision
Kan-Han (John) Lu
 
Computer Vision Introduction
Camera Culture Group, MIT Media Lab
 
Image Processing and Computer Vision
Silicon Mentor
 
Computer vision introduction
Wael Badawy
 
Face recognigion system ppt
Ravi Kumar
 
Computer Vision
Ameer Mohamed Rajah
 
Computer Vision Basics
Suren Kumar
 
face recognition
vipin varghese
 
Face recognition attendance system
Naomi Kulkarni
 
Iris Recognition Technology
RutikBhoyar
 
Face recognition ppt
Santosh Kumar
 
Image recognition
Nikhil Singh
 
An Introduction to Computer Vision
guestd1b1b5
 
AI Computer vision
Kashafnaz2
 
Natural Interfaces for Augmented Reality
Mark Billinghurst
 
1ère Présentation Atelier Vision par ordinateur
Samir TABIB
 
Image Processing Basics
Dr. A. B. Shinde
 
What is computer vision?
Qentinel
 
Ad

Viewers also liked (15)

PDF
Letter
Salim Salim
 
PDF
Python in Computer Vision
Brian Thorne
 
PPTX
Kuldeep
kuldeep shukla
 
PPT
Images and Vision in Python
streety
 
PDF
CVPR2010: higher order models in computer vision: Part 3
zukun
 
PPT
General introduction to computer vision
butest
 
PDF
Brunelli 2008: template matching techniques in computer vision
zukun
 
PDF
Applying Computer Vision to Art History
jeresig
 
PPTX
Augmented Reality
Tareq Mulla
 
PDF
How Computer Vision is Reshaping Real Estate Search - Andrew Flachner
Inman News
 
PDF
Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...
Universitat Politècnica de Catalunya
 
PDF
"Computer Vision and Artificial Intelligence: Market Trends and Implications,...
Edge AI and Vision Alliance
 
PPTX
Computer Vision Crash Course
台灣資料科學年會
 
PPTX
Cisco Web and Email Security Overview
Cisco Security
 
PPTX
Optimization algorithms for solving computer vision problems
Krzysztof Wegner
 
Letter
Salim Salim
 
Python in Computer Vision
Brian Thorne
 
Images and Vision in Python
streety
 
CVPR2010: higher order models in computer vision: Part 3
zukun
 
General introduction to computer vision
butest
 
Brunelli 2008: template matching techniques in computer vision
zukun
 
Applying Computer Vision to Art History
jeresig
 
Augmented Reality
Tareq Mulla
 
How Computer Vision is Reshaping Real Estate Search - Andrew Flachner
Inman News
 
Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...
Universitat Politècnica de Catalunya
 
"Computer Vision and Artificial Intelligence: Market Trends and Implications,...
Edge AI and Vision Alliance
 
Computer Vision Crash Course
台灣資料科學年會
 
Cisco Web and Email Security Overview
Cisco Security
 
Optimization algorithms for solving computer vision problems
Krzysztof Wegner
 
Ad

Similar to Computer Vision: Pattern Recognition (20)

PDF
Machine Learning:
butest
 
DOC
Elegant Resume
butest
 
DOC
Elegant Resume
butest
 
PPT
Ben Shneiderman: Thrill of Discovery
russ9595
 
PDF
divita_resume_finalv2
Joseph DiVita
 
PDF
Adaptive survey design 1st Edition Peytchev
yvdbsuxc0277
 
PPTX
The UVA School of Data Science
Philip Bourne
 
PDF
CV
jasonj383
 
PDF
Introduction to Data Mining
AbcdDcba12
 
DOC
FayinLi_CV_Full.doc
butest
 
PDF
Adaptive survey design 1st Edition Peytchev
bruhnlautzdb
 
PPTX
6. Work6 Social Distancing.pptx
Vanditha11
 
PPT
Data Mining introduction and basic concepts
PritiRishi
 
PDF
Computational Studies of Human Motion David Forsyth
calskievit
 
PDF
jlkcv
Jeff Krichmar
 
PDF
Petascale Analytics Largescale Machine Learning In The Earth Sciences 1st Edi...
baladadzekui
 
PDF
resume_Yuli_Liang
Yuli Liang
 
PPT
Contractor-Borner-SNA-SAC
webuploader
 
PDF
Don't blindly trust your ML System, it may change your life (Azzurra Ragone, ...
Data Driven Innovation
 
PPTX
Data mining
Dharani Dhanaraj
 
Machine Learning:
butest
 
Elegant Resume
butest
 
Elegant Resume
butest
 
Ben Shneiderman: Thrill of Discovery
russ9595
 
divita_resume_finalv2
Joseph DiVita
 
Adaptive survey design 1st Edition Peytchev
yvdbsuxc0277
 
The UVA School of Data Science
Philip Bourne
 
Introduction to Data Mining
AbcdDcba12
 
FayinLi_CV_Full.doc
butest
 
Adaptive survey design 1st Edition Peytchev
bruhnlautzdb
 
6. Work6 Social Distancing.pptx
Vanditha11
 
Data Mining introduction and basic concepts
PritiRishi
 
Computational Studies of Human Motion David Forsyth
calskievit
 
Petascale Analytics Largescale Machine Learning In The Earth Sciences 1st Edi...
baladadzekui
 
resume_Yuli_Liang
Yuli Liang
 
Contractor-Borner-SNA-SAC
webuploader
 
Don't blindly trust your ML System, it may change your life (Azzurra Ragone, ...
Data Driven Innovation
 
Data mining
Dharani Dhanaraj
 

Recently uploaded (20)

PDF
Key Features to Look for in Arizona App Development Services
Net-Craft.com
 
PDF
lesson-2-rules-of-netiquette.pdf.bshhsjdj
jasmenrojas249
 
PDF
49785682629390197565_LRN3014_Migrating_the_Beast.pdf
Abilash868456
 
PPTX
AI-Ready Handoff: Auto-Summaries & Draft Emails from MQL to Slack in One Flow
bbedford2
 
PDF
ShowUs: Pharo Stream Deck (ESUG 2025, Gdansk)
ESUG
 
PPTX
Role Of Python In Programing Language.pptx
jaykoshti048
 
PDF
Bandai Playdia The Book - David Glotz
BluePanther6
 
PPTX
Presentation about Database and Database Administrator
abhishekchauhan86963
 
DOCX
Can You Build Dashboards Using Open Source Visualization Tool.docx
Varsha Nayak
 
PPTX
ConcordeApp: Engineering Global Impact & Unlocking Billions in Event ROI with AI
chastechaste14
 
PPTX
PFAS Reporting Requirements 2026 Are You Submission Ready Certivo.pptx
Certivo Inc
 
PDF
New Download MiniTool Partition Wizard Crack Latest Version 2025
imang66g
 
PDF
49784907924775488180_LRN2959_Data_Pump_23ai.pdf
Abilash868456
 
PPT
Why Reliable Server Maintenance Service in New York is Crucial for Your Business
Sam Vohra
 
PPT
Activate_Methodology_Summary presentatio
annapureddyn
 
PPTX
Odoo Integration Services by Candidroot Solutions
CandidRoot Solutions Private Limited
 
PPTX
The-Dawn-of-AI-Reshaping-Our-World.pptxx
parthbhanushali307
 
PPTX
Explanation about Structures in C language.pptx
Veeral Rathod
 
PPTX
oapresentation.pptx
mehatdhavalrajubhai
 
PPTX
slidesgo-unlocking-the-code-the-dynamic-dance-of-variables-and-constants-2024...
kr2589474
 
Key Features to Look for in Arizona App Development Services
Net-Craft.com
 
lesson-2-rules-of-netiquette.pdf.bshhsjdj
jasmenrojas249
 
49785682629390197565_LRN3014_Migrating_the_Beast.pdf
Abilash868456
 
AI-Ready Handoff: Auto-Summaries & Draft Emails from MQL to Slack in One Flow
bbedford2
 
ShowUs: Pharo Stream Deck (ESUG 2025, Gdansk)
ESUG
 
Role Of Python In Programing Language.pptx
jaykoshti048
 
Bandai Playdia The Book - David Glotz
BluePanther6
 
Presentation about Database and Database Administrator
abhishekchauhan86963
 
Can You Build Dashboards Using Open Source Visualization Tool.docx
Varsha Nayak
 
ConcordeApp: Engineering Global Impact & Unlocking Billions in Event ROI with AI
chastechaste14
 
PFAS Reporting Requirements 2026 Are You Submission Ready Certivo.pptx
Certivo Inc
 
New Download MiniTool Partition Wizard Crack Latest Version 2025
imang66g
 
49784907924775488180_LRN2959_Data_Pump_23ai.pdf
Abilash868456
 
Why Reliable Server Maintenance Service in New York is Crucial for Your Business
Sam Vohra
 
Activate_Methodology_Summary presentatio
annapureddyn
 
Odoo Integration Services by Candidroot Solutions
CandidRoot Solutions Private Limited
 
The-Dawn-of-AI-Reshaping-Our-World.pptxx
parthbhanushali307
 
Explanation about Structures in C language.pptx
Veeral Rathod
 
oapresentation.pptx
mehatdhavalrajubhai
 
slidesgo-unlocking-the-code-the-dynamic-dance-of-variables-and-constants-2024...
kr2589474
 

Computer Vision: Pattern Recognition