SlideShare a Scribd company logo
Intro to Semantic Segmentation Using Deep Learning
================================================================
Semantic segmentation is the task of classifying each and every pixel in an image
into a class as shown in the image below. Here you can see that all persons are red,
the road is purple, the vehicles are blue, street signs are yellow etc.
Semantic segmentation is different from instance segmentation which is that
different objects of the same class will have different labels as in person1, person2
and hence different colours. The picture below very crisply illustrates the difference
between instance and semantic segmentation. If you are interested in learning more
about classification and object detection, please check out my blog here.
One important question can be why do we need this granularity of understanding
pixel by pixel location?
Some examples that come to mind are:
i) Self Driving Cars — May need to know exactly where another car is on the road or
the location of a human crossing the road
ii) Robotic systems — Robots that say join two parts together will perform better if
they know the exact locations of the two parts
iii) Damage Detection - It may be important in this case to know the exact extent of
damage
Deep Learning Model Architectures for Semantic Segmentation
Lets now talk about 3 model architectures that do semantic segmentation.
1. Fully Convolutional Network (FCN)
FCN is a popular algorithm for doing semantic segmentation. This model uses
various blocks of convolution and max pool layers to first decompress an image to
1/32th of its original size. It then makes a class prediction at this level of granularity.
Finally it uses up sampling and deconvolution layers to resize the image to its
original dimensions.
These models typically don't have any fully connected layers. The goal of down
sampling steps is to capture semantic/contextual information while the goal of up
sampling is to recover spatial information. Also there are no limitations on image
size. The final image is the same size as the original image. To fully recover the fine
grained spatial information lost in down sampling, skip connections are used. A skip
connection is a connection that bypasses at least one layer. Here it is used to pass
information from the down sampling step to the up sampling step. Merging features
from various resolution levels helps combining context information with spatial
information
Contacts Us:-
Address: - 110 Fontainbleau Drive, Toronto
Telephone: - 647-550-0256
Email: - deeplearning33@gmail.com

More Related Content

What's hot (12)

PDF
NEW ONTOLOGY RETRIEVAL IMAGE METHOD IN 5K COREL IMAGES
ijcax
 
DOC
SchuurmansLecture.doc
butest
 
PPTX
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
Cheriyan K M
 
DOCX
Multiview alignment hashing for
jpstudcorner
 
PPT
Rafi Zachut's slides on class specific segmentation
wolf
 
PDF
Nips 2016 tutorial generative adversarial networks review
Minho Heo
 
PDF
Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...
IJECEIAES
 
PDF
IEEE PROJECT TOPICS &ABSTRACTS on image processing
aswin tbbc
 
PPTX
Static Spatial Graph Features
Niklas Elmqvist
 
PPT
A Review of Relational Machine Learning(SRL) for Knowledge Graphs
yalda akbarzadeh
 
DOCX
Bt9301 computer graphics
smumbahelp
 
PDF
A MULTI-STREAM HMM APPROACH TO OFFLINE HANDWRITTEN ARABIC WORD RECOGNITION
ijnlc
 
NEW ONTOLOGY RETRIEVAL IMAGE METHOD IN 5K COREL IMAGES
ijcax
 
SchuurmansLecture.doc
butest
 
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
Cheriyan K M
 
Multiview alignment hashing for
jpstudcorner
 
Rafi Zachut's slides on class specific segmentation
wolf
 
Nips 2016 tutorial generative adversarial networks review
Minho Heo
 
Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...
IJECEIAES
 
IEEE PROJECT TOPICS &ABSTRACTS on image processing
aswin tbbc
 
Static Spatial Graph Features
Niklas Elmqvist
 
A Review of Relational Machine Learning(SRL) for Knowledge Graphs
yalda akbarzadeh
 
Bt9301 computer graphics
smumbahelp
 
A MULTI-STREAM HMM APPROACH TO OFFLINE HANDWRITTEN ARABIC WORD RECOGNITION
ijnlc
 

Similar to Intro to Semantic Segmentation Using Deep Learning (20)

PPTX
AaSeminar_Template.pptx
ManojGowdaKb
 
PPTX
Introduction to Segmentation in Computer vision
ParrotAI
 
PDF
Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018
Universitat Politècnica de Catalunya
 
PDF
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
ynxm25hpxp
 
PPTX
Image Segmentation: Approaches and Challenges
Apache MXNet
 
PPTX
Semantic segmentation with Convolutional Neural Network Approaches
UMBC
 
PDF
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Universitat Politècnica de Catalunya
 
PDF
IRJET- Semantic Segmentation using Deep Learning
IRJET Journal
 
PDF
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
岳華 杜
 
PPTX
Image Segmentation Using Deep Learning : A survey
NUPUR YADAV
 
PPTX
Review-image-segmentation-by-deep-learning
Trong-An Bui
 
PPTX
Semantic Segmentation on Satellite Imagery
RAHUL BHOJWANI
 
PPTX
DefenseTalk_Trimmed
Abhishek Sharma
 
PDF
Optimisation of semantic segmentation algorithm for autonomous driving using ...
IAESIJAI
 
PPTX
cityscapes Semantic Segmentation using FCN, U Net and U Net++.pptx
faizalmistry5
 
PDF
A brief introduction to recent segmentation methods
Shunta Saito
 
PDF
Semantic Video Segmentation with Using Ensemble of Particular Classifiers and...
ITIIIndustries
 
PDF
The Future of Health Monitoring: Advances in Wearable Sensor Data Processing
IgMin Publications Inc.
 
PPTX
Image segmentation hj_cho
Hyungjoo Cho
 
PDF
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Universitat Politècnica de Catalunya
 
AaSeminar_Template.pptx
ManojGowdaKb
 
Introduction to Segmentation in Computer vision
ParrotAI
 
Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018
Universitat Politècnica de Catalunya
 
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
ynxm25hpxp
 
Image Segmentation: Approaches and Challenges
Apache MXNet
 
Semantic segmentation with Convolutional Neural Network Approaches
UMBC
 
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Universitat Politècnica de Catalunya
 
IRJET- Semantic Segmentation using Deep Learning
IRJET Journal
 
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
岳華 杜
 
Image Segmentation Using Deep Learning : A survey
NUPUR YADAV
 
Review-image-segmentation-by-deep-learning
Trong-An Bui
 
Semantic Segmentation on Satellite Imagery
RAHUL BHOJWANI
 
DefenseTalk_Trimmed
Abhishek Sharma
 
Optimisation of semantic segmentation algorithm for autonomous driving using ...
IAESIJAI
 
cityscapes Semantic Segmentation using FCN, U Net and U Net++.pptx
faizalmistry5
 
A brief introduction to recent segmentation methods
Shunta Saito
 
Semantic Video Segmentation with Using Ensemble of Particular Classifiers and...
ITIIIndustries
 
The Future of Health Monitoring: Advances in Wearable Sensor Data Processing
IgMin Publications Inc.
 
Image segmentation hj_cho
Hyungjoo Cho
 
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Universitat Politècnica de Catalunya
 
Ad

Recently uploaded (20)

PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
PDF
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PDF
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PDF
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PDF
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
PDF
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
PDF
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
Ad

Intro to Semantic Segmentation Using Deep Learning

  • 1. Intro to Semantic Segmentation Using Deep Learning ================================================================ Semantic segmentation is the task of classifying each and every pixel in an image into a class as shown in the image below. Here you can see that all persons are red, the road is purple, the vehicles are blue, street signs are yellow etc. Semantic segmentation is different from instance segmentation which is that different objects of the same class will have different labels as in person1, person2 and hence different colours. The picture below very crisply illustrates the difference between instance and semantic segmentation. If you are interested in learning more about classification and object detection, please check out my blog here.
  • 2. One important question can be why do we need this granularity of understanding pixel by pixel location? Some examples that come to mind are: i) Self Driving Cars — May need to know exactly where another car is on the road or the location of a human crossing the road ii) Robotic systems — Robots that say join two parts together will perform better if they know the exact locations of the two parts iii) Damage Detection - It may be important in this case to know the exact extent of damage Deep Learning Model Architectures for Semantic Segmentation Lets now talk about 3 model architectures that do semantic segmentation. 1. Fully Convolutional Network (FCN) FCN is a popular algorithm for doing semantic segmentation. This model uses various blocks of convolution and max pool layers to first decompress an image to 1/32th of its original size. It then makes a class prediction at this level of granularity. Finally it uses up sampling and deconvolution layers to resize the image to its original dimensions. These models typically don't have any fully connected layers. The goal of down sampling steps is to capture semantic/contextual information while the goal of up sampling is to recover spatial information. Also there are no limitations on image size. The final image is the same size as the original image. To fully recover the fine grained spatial information lost in down sampling, skip connections are used. A skip
  • 3. connection is a connection that bypasses at least one layer. Here it is used to pass information from the down sampling step to the up sampling step. Merging features from various resolution levels helps combining context information with spatial information Contacts Us:- Address: - 110 Fontainbleau Drive, Toronto Telephone: - 647-550-0256 Email: - [email protected]