Intro to Semantic Segmentation Using Deep Learning

0 likes128 views

The document introduces semantic segmentation, a deep learning task that classifies each pixel in an image into different classes, contrasting it with instance segmentation. It highlights the importance of pixel-level understanding for applications like self-driving cars and robotic systems, and discusses the Fully Convolutional Network (FCN) model architecture used for this task. The FCN processes images through down sampling and up sampling to maintain semantic and spatial information, utilizing skip connections to aid in feature merging.

Technology

Intro to Semantic Segmentation Using Deep Learning
================================================================
Semantic segmentation is the task of classifying each and every pixel in an image
into a class as shown in the image below. Here you can see that all persons are red,
the road is purple, the vehicles are blue, street signs are yellow etc.
Semantic segmentation is different from instance segmentation which is that
different objects of the same class will have different labels as in person1, person2
and hence different colours. The picture below very crisply illustrates the difference
between instance and semantic segmentation. If you are interested in learning more
about classification and object detection, please check out my blog here.

One important question can be why do we need this granularity of understanding
pixel by pixel location?
Some examples that come to mind are:
i) Self Driving Cars — May need to know exactly where another car is on the road or
the location of a human crossing the road
ii) Robotic systems — Robots that say join two parts together will perform better if
they know the exact locations of the two parts
iii) Damage Detection - It may be important in this case to know the exact extent of
damage
Deep Learning Model Architectures for Semantic Segmentation
Lets now talk about 3 model architectures that do semantic segmentation.
1. Fully Convolutional Network (FCN)
FCN is a popular algorithm for doing semantic segmentation. This model uses
various blocks of convolution and max pool layers to first decompress an image to
1/32th of its original size. It then makes a class prediction at this level of granularity.
Finally it uses up sampling and deconvolution layers to resize the image to its
original dimensions.
These models typically don't have any fully connected layers. The goal of down
sampling steps is to capture semantic/contextual information while the goal of up
sampling is to recover spatial information. Also there are no limitations on image
size. The final image is the same size as the original image. To fully recover the fine
grained spatial information lost in down sampling, skip connections are used. A skip

connection is a connection that bypasses at least one layer. Here it is used to pass
information from the down sampling step to the up sampling step. Merging features
from various resolution levels helps combining context information with spatial
information
Contacts Us:-
Address: - 110 Fontainbleau Drive, Toronto
Telephone: - 647-550-0256
Email: - deeplearning33@gmail.com

More Related Content

What's hot (12)

PDF

NEW ONTOLOGY RETRIEVAL IMAGE METHOD IN 5K COREL IMAGESijcax

DOC

SchuurmansLecture.docbutest

PPTX

SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...Cheriyan K M

DOCX

Multiview alignment hashing forjpstudcorner

PPT

Rafi Zachut's slides on class specific segmentationwolf

PDF

Nips 2016 tutorial generative adversarial networks reviewMinho Heo

PDF

Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...IJECEIAES

PDF

IEEE PROJECT TOPICS &ABSTRACTS on image processingaswin tbbc

PPTX

Static Spatial Graph FeaturesNiklas Elmqvist

PPT

A Review of Relational Machine Learning(SRL) for Knowledge Graphsyalda akbarzadeh

DOCX

Bt9301 computer graphicssmumbahelp

PDF

A MULTI-STREAM HMM APPROACH TO OFFLINE HANDWRITTEN ARABIC WORD RECOGNITIONijnlc

NEW ONTOLOGY RETRIEVAL IMAGE METHOD IN 5K COREL IMAGESijcax

SchuurmansLecture.docbutest

SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...Cheriyan K M

Multiview alignment hashing forjpstudcorner

Rafi Zachut's slides on class specific segmentationwolf

Nips 2016 tutorial generative adversarial networks reviewMinho Heo

Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...IJECEIAES

IEEE PROJECT TOPICS &ABSTRACTS on image processingaswin tbbc

Static Spatial Graph FeaturesNiklas Elmqvist

A Review of Relational Machine Learning(SRL) for Knowledge Graphsyalda akbarzadeh

Bt9301 computer graphicssmumbahelp

A MULTI-STREAM HMM APPROACH TO OFFLINE HANDWRITTEN ARABIC WORD RECOGNITIONijnlc

Similar to Intro to Semantic Segmentation Using Deep Learning (20)

PPTX

AaSeminar_Template.pptxManojGowdaKb

PPTX

Introduction to Segmentation in Computer vision ParrotAI

PDF

Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018Universitat Politècnica de Catalunya

PDF

SimCLR: A Simple Framework for Contrastive Learning of Visual Representationsynxm25hpxp

PPTX

Image Segmentation: Approaches and ChallengesApache MXNet

PPTX

Semantic segmentation with Convolutional Neural Network ApproachesUMBC

PDF

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

PDF

IRJET- Semantic Segmentation using Deep LearningIRJET Journal

PDF

Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation岳華杜

PPTX

Image Segmentation Using Deep Learning : A surveyNUPUR YADAV

PPTX

Review-image-segmentation-by-deep-learningTrong-An Bui

PPTX

Semantic Segmentation on Satellite ImageryRAHUL BHOJWANI

PPTX

DefenseTalk_TrimmedAbhishek Sharma

PDF

Optimisation of semantic segmentation algorithm for autonomous driving using ...IAESIJAI

PPTX

cityscapes Semantic Segmentation using FCN, U Net and U Net++.pptxfaizalmistry5

PDF

A brief introduction to recent segmentation methodsShunta Saito

PDF

Semantic Video Segmentation with Using Ensemble of Particular Classifiers and...ITIIIndustries

PDF

The Future of Health Monitoring: Advances in Wearable Sensor Data ProcessingIgMin Publications Inc.

PPTX

Image segmentation hj_choHyungjoo Cho

PDF

Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Universitat Politècnica de Catalunya

AaSeminar_Template.pptxManojGowdaKb

Introduction to Segmentation in Computer vision ParrotAI

Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018Universitat Politècnica de Catalunya

SimCLR: A Simple Framework for Contrastive Learning of Visual Representationsynxm25hpxp

Image Segmentation: Approaches and ChallengesApache MXNet

Semantic segmentation with Convolutional Neural Network ApproachesUMBC

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

IRJET- Semantic Segmentation using Deep LearningIRJET Journal

Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation岳華杜

Image Segmentation Using Deep Learning : A surveyNUPUR YADAV

Review-image-segmentation-by-deep-learningTrong-An Bui

Semantic Segmentation on Satellite ImageryRAHUL BHOJWANI

DefenseTalk_TrimmedAbhishek Sharma

Optimisation of semantic segmentation algorithm for autonomous driving using ...IAESIJAI

cityscapes Semantic Segmentation using FCN, U Net and U Net++.pptxfaizalmistry5

A brief introduction to recent segmentation methodsShunta Saito

Semantic Video Segmentation with Using Ensemble of Particular Classifiers and...ITIIIndustries

The Future of Health Monitoring: Advances in Wearable Sensor Data ProcessingIgMin Publications Inc.

Image segmentation hj_choHyungjoo Cho

Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Universitat Politècnica de Catalunya

Recently uploaded (20)

PPTX

Building Search Using OpenSearch: Limitations and WorkaroundsSease

PDF

DevBcn - Building 10x Organizations Using Modern Productivity MetricsJustin Reock

PDF

HubSpot Main Hub: A Unified Growth PlatformJaswinder Singh

PDF

From Code to Challenge: Crafting Skill-Based Games That Engage and Rewardaiyshauae

PDF

"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...Fwdays

PDF

Chris Elwell Woburn, MA - Passionate About IT InnovationChris Elwell Woburn, MA

PPTX

Q2 FY26 Tableau User Group Leader Quarterly Calllward7

PDF

LLMs.txt: Easily Control How AI Crawls Your SiteKeploy

PDF

[Newgen] NewgenONE Marvin Brochure 1.pdfdarshakparmar

PDF

Building Real-Time Digital Twins with IBM Maximo & ArcGIS IndoorsSafe Software

PPTX

AI Penetration Testing Essentials: A Cybersecurity Guide for 2025defencerabbit Team

PDF

CIFDAQ Token Spotlight for 9th July 2025CIFDAQ

PPTX

Webinar: Introduction to LF Energy EVerestDanBrown980551

PDF

CIFDAQ Weekly Market Wrap for 11th July 2025CIFDAQ

PDF

Bitcoin for Millennials podcast with Bram, Power Laws of BitcoinStephen Perrenod

PDF

Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdfdarshakparmar

PDF

Complete JavaScript Notes: From Basics to Advanced Concepts.pdfhaydendavispro

PDF

NewMind AI - Journal 100 Insights After The 100th IssueNewMind AI

PDF

Blockchain Transactions Explained For EveryoneCIFDAQ

PDF

Presentation - Vibe Coding The Future of Techyanuarsinggih1

Building Search Using OpenSearch: Limitations and WorkaroundsSease

DevBcn - Building 10x Organizations Using Modern Productivity MetricsJustin Reock

HubSpot Main Hub: A Unified Growth PlatformJaswinder Singh

From Code to Challenge: Crafting Skill-Based Games That Engage and Rewardaiyshauae

"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...Fwdays

Chris Elwell Woburn, MA - Passionate About IT InnovationChris Elwell Woburn, MA

Q2 FY26 Tableau User Group Leader Quarterly Calllward7

LLMs.txt: Easily Control How AI Crawls Your SiteKeploy

[Newgen] NewgenONE Marvin Brochure 1.pdfdarshakparmar

Building Real-Time Digital Twins with IBM Maximo & ArcGIS IndoorsSafe Software

AI Penetration Testing Essentials: A Cybersecurity Guide for 2025defencerabbit Team

CIFDAQ Token Spotlight for 9th July 2025CIFDAQ

Webinar: Introduction to LF Energy EVerestDanBrown980551

CIFDAQ Weekly Market Wrap for 11th July 2025CIFDAQ

Bitcoin for Millennials podcast with Bram, Power Laws of BitcoinStephen Perrenod

Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdfdarshakparmar

Complete JavaScript Notes: From Basics to Advanced Concepts.pdfhaydendavispro

NewMind AI - Journal 100 Insights After The 100th IssueNewMind AI

Blockchain Transactions Explained For EveryoneCIFDAQ

Presentation - Vibe Coding The Future of Techyanuarsinggih1

Intro to Semantic Segmentation Using Deep Learning

1. Intro to Semantic Segmentation Using Deep Learning ================================================================ Semantic segmentation is the task of classifying each and every pixel in an image into a class as shown in the image below. Here you can see that all persons are red, the road is purple, the vehicles are blue, street signs are yellow etc. Semantic segmentation is different from instance segmentation which is that different objects of the same class will have different labels as in person1, person2 and hence different colours. The picture below very crisply illustrates the difference between instance and semantic segmentation. If you are interested in learning more about classification and object detection, please check out my blog here.

2. One important question can be why do we need this granularity of understanding pixel by pixel location? Some examples that come to mind are: i) Self Driving Cars — May need to know exactly where another car is on the road or the location of a human crossing the road ii) Robotic systems — Robots that say join two parts together will perform better if they know the exact locations of the two parts iii) Damage Detection - It may be important in this case to know the exact extent of damage Deep Learning Model Architectures for Semantic Segmentation Lets now talk about 3 model architectures that do semantic segmentation. 1. Fully Convolutional Network (FCN) FCN is a popular algorithm for doing semantic segmentation. This model uses various blocks of convolution and max pool layers to first decompress an image to 1/32th of its original size. It then makes a class prediction at this level of granularity. Finally it uses up sampling and deconvolution layers to resize the image to its original dimensions. These models typically don't have any fully connected layers. The goal of down sampling steps is to capture semantic/contextual information while the goal of up sampling is to recover spatial information. Also there are no limitations on image size. The final image is the same size as the original image. To fully recover the fine grained spatial information lost in down sampling, skip connections are used. A skip

3. connection is a connection that bypasses at least one layer. Here it is used to pass information from the down sampling step to the up sampling step. Merging features from various resolution levels helps combining context information with spatial information Contacts Us:- Address: - 110 Fontainbleau Drive, Toronto Telephone: - 647-550-0256 Email: - [email protected]