© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Semantic Segmentation
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Problem statement: Pixel-level classification task
Applications: Brain tissue segmentation
U-Net: Convolutional Networks for Biomedical Image Segmentation Olaf Ronneberger, Philipp Fischer, Thomas Brox, 2015
source: https://github.com/reachsumit/deep-unet-for-satellite-image-segmentation
Applications: Satellite image land use
Applications: Self-driving cars
source: https://www.youtube.com/watch?v=ATlcEDSPWXY
How does it work?
Source: Fully Convolutional Networks for Semantic Segmentation, Long et al. 2015
Deep Neural Network
Input
RGB or Grayscale Images
Unsigned integer [0,255]
N classes
Output: predict one “heat-map” per class
Softmax across class axis
How does it work?
Trained to minimize the softmax cross entropy loss for each pixel i,j
predictions among the N different classes:
𝑙𝑜𝑠𝑠 = −
𝑖,𝑗
𝐻,𝑊
𝑐
𝑁
𝑦𝑖,𝑗,𝑐 ∗ log(𝑝𝑖,𝑗,𝑐)
𝑙𝑜𝑠𝑠 = −
𝑖,𝑗
𝐻,𝑊
log(𝑝𝑖,𝑗,𝑐=𝑦 𝑖,𝑗
)
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Main challenge: capturing multi-scale context
cow?
Source: Deep LabV3 Rethinking Atrous Convolution for Semantic Image Segmentation, Chen et al. 2017
Strategies for capturing multi-scale context
Architectures: HourGlass
Architecture of the full network. The convolution network is based on the VGG16 architecture. The deconvolution
network uses unpooling and deconvolution layers. Source: H. Noh et al. (2015)
Architectures: U-Net
U-Net: Convolutional Networks for Biomedical Image Segmentation Olaf Ronneberger, Philipp Fischer, Thomas Brox, 2015
Architectures: DeepLab V3
Source: Rethinking Atrous Convolution for Semantic Image Segmentation Liang-Chieh Chen, George Papandreou,
Florian Schroff, Hartwig Adam, 2017
Architectures: DeepLab V3+
Source: Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation, Liang-Chieh Chen, Yukun Zhu,
George Papandreou, Florian Schroff, and Hartwig Adam, 2018
Architectures: and more
See this medium blog post: Review of deep learning algorithm for semantic
segmentation
Fully Convolutional Network
ParseNet
Feature Pyramid Network
Pyramid Scene Parsing network (PSPNet)
Path Aggregation Network (PANet)
Context Encoding Network (EncNet)
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Conclusion
The key challenge in semantic segmentation is to
efficiently mix local and global context for pixel-wise
predictions
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
Thank you!
Go Build! https://gluon-cv.mxnet.io/build/examples_segmentation/index.html

More Related Content

PDF
Image segmentation with deep learning
PPTX
Image Segmentation Using Deep Learning : A survey
PPTX
Application of-image-segmentation-in-brain-tumor-detection
PDF
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用
PPTX
Deep Learning in Computer Vision
PDF
Research of adversarial example on a deep neural network
PDF
Lec8: Medical Image Segmentation (II) (Region Growing/Merging)
PDF
Mask R-CNN
Image segmentation with deep learning
Image Segmentation Using Deep Learning : A survey
Application of-image-segmentation-in-brain-tumor-detection
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用
Deep Learning in Computer Vision
Research of adversarial example on a deep neural network
Lec8: Medical Image Segmentation (II) (Region Growing/Merging)
Mask R-CNN

What's hot (20)

PPTX
Object Detection Methods using Deep Learning
PDF
Deep Learning for Computer Vision: Medical Imaging (UPC 2016)
PPTX
Otsu binarization
PPT
Image segmentation ppt
PDF
[5분 논문요약] Structured Knowledge Distillation for Semantic Segmentation
PPTX
CNN Tutorial
PPTX
Object Detection using Deep Neural Networks
PPTX
IMAGE SEGMENTATION.
PPTX
Canny Edge Detection
PDF
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
PDF
yolov3-4-5.pdf
PPTX
Facial expression recognition based on image feature
PDF
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
PPTX
Tutorial on Object Detection (Faster R-CNN)
PDF
A survey of deep learning approaches to medical applications
PDF
Data Science - Part XVII - Deep Learning & Image Processing
PDF
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
PPTX
BRAIN TUMOR MRI IMAGE SEGMENTATION AND DETECTION IN IMAGE PROCESSING
PPT
Image segmentation
PDF
Lec15: Medical Image Registration (Introduction)
Object Detection Methods using Deep Learning
Deep Learning for Computer Vision: Medical Imaging (UPC 2016)
Otsu binarization
Image segmentation ppt
[5분 논문요약] Structured Knowledge Distillation for Semantic Segmentation
CNN Tutorial
Object Detection using Deep Neural Networks
IMAGE SEGMENTATION.
Canny Edge Detection
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
yolov3-4-5.pdf
Facial expression recognition based on image feature
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Tutorial on Object Detection (Faster R-CNN)
A survey of deep learning approaches to medical applications
Data Science - Part XVII - Deep Learning & Image Processing
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
BRAIN TUMOR MRI IMAGE SEGMENTATION AND DETECTION IN IMAGE PROCESSING
Image segmentation
Lec15: Medical Image Registration (Introduction)
Ad

Similar to Image Segmentation: Approaches and Challenges (20)

PDF
IRJET- Semantic Segmentation using Deep Learning
PDF
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
PPTX
Semantic segmentation with Convolutional Neural Network Approaches
PPTX
AaSeminar_Template.pptx
PPTX
Introduction to Segmentation in Computer vision
PDF
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
PPTX
Review-image-segmentation-by-deep-learning
PDF
The Future of Health Monitoring: Advances in Wearable Sensor Data Processing
PDF
Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018
PPTX
Semantic Segmentation on Satellite Imagery
PDF
Intro to Semantic Segmentation Using Deep Learning
PDF
#6 PyData Warsaw: Deep learning for image segmentation
PDF
Stadnford University practical presentation.pdf
PPTX
cityscapes Semantic Segmentation using FCN, U Net and U Net++.pptx
PPTX
U-Netpresentation.pptx
PDF
Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...
PDF
Optimisation of semantic segmentation algorithm for autonomous driving using ...
PDF
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
PPTX
UNetEliyaLaialy (2).pptx
PPTX
U-Net (1).pptx
IRJET- Semantic Segmentation using Deep Learning
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
Semantic segmentation with Convolutional Neural Network Approaches
AaSeminar_Template.pptx
Introduction to Segmentation in Computer vision
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Review-image-segmentation-by-deep-learning
The Future of Health Monitoring: Advances in Wearable Sensor Data Processing
Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018
Semantic Segmentation on Satellite Imagery
Intro to Semantic Segmentation Using Deep Learning
#6 PyData Warsaw: Deep learning for image segmentation
Stadnford University practical presentation.pdf
cityscapes Semantic Segmentation using FCN, U Net and U Net++.pptx
U-Netpresentation.pptx
Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...
Optimisation of semantic segmentation algorithm for autonomous driving using ...
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
UNetEliyaLaialy (2).pptx
U-Net (1).pptx
Ad

More from Apache MXNet (20)

PPTX
Recent Advances in Natural Language Processing
PPTX
Fine-tuning BERT for Question Answering
PPTX
Introduction to GluonNLP
PPTX
Introduction to object tracking with Deep Learning
PPTX
Introduction to GluonCV
PPTX
Introduction to Computer Vision
PPTX
Introduction to Deep face detection and recognition
PPTX
Generative Adversarial Networks (GANs) using Apache MXNet
PPTX
Deep Learning With Apache MXNet On Video by Ben Taylor @ ziff.ai
PDF
Using Java to deploy Deep Learning models with MXNet
PPTX
AI powered emotion recognition: From Inception to Production - Global AI Conf...
PPTX
MXNet Paris Workshop - Intro To MXNet
PDF
Apache MXNet ODSC West 2018
PDF
DeepLearning001&ApacheMXNetWithSparkForInference-ACNA2018
PDF
Apache MXNet EcoSystem - ACNA2018
PDF
ONNX and Edge Deployments
PDF
Distributed Inference with MXNet and Spark
PDF
Multivariate Time Series
PDF
AI On the Edge: Model Compression
PDF
Building Content Recommendation Systems using MXNet Gluon
Recent Advances in Natural Language Processing
Fine-tuning BERT for Question Answering
Introduction to GluonNLP
Introduction to object tracking with Deep Learning
Introduction to GluonCV
Introduction to Computer Vision
Introduction to Deep face detection and recognition
Generative Adversarial Networks (GANs) using Apache MXNet
Deep Learning With Apache MXNet On Video by Ben Taylor @ ziff.ai
Using Java to deploy Deep Learning models with MXNet
AI powered emotion recognition: From Inception to Production - Global AI Conf...
MXNet Paris Workshop - Intro To MXNet
Apache MXNet ODSC West 2018
DeepLearning001&ApacheMXNetWithSparkForInference-ACNA2018
Apache MXNet EcoSystem - ACNA2018
ONNX and Edge Deployments
Distributed Inference with MXNet and Spark
Multivariate Time Series
AI On the Edge: Model Compression
Building Content Recommendation Systems using MXNet Gluon

Recently uploaded (20)

PDF
Chapter 3 - Human Development Poweroint presentation
PDF
The Physiology Of The Red Blood Cells pdf
PPTX
Cells and Organs of the Immune System (Unit-2) - Majesh Sir.pptx
PDF
Sujay Rao Mandavilli IJISRT25AUG764 context based approaches to population ma...
PDF
Exploring PCR Techniques and Applications
PDF
Sustainable Biology- Scopes, Principles of sustainiability, Sustainable Resou...
PPTX
BPharm_Hospital_Organization_Complete_PPT.pptx
PDF
Sumer, Akkad and the mythology of the Toradja Sa'dan.pdf
PPTX
Spectroscopic Techniques for M Tech Civil Engineerin .pptx
PPT
Chapter 6 Introductory course Biology Camp
PPTX
Preformulation.pptx Preformulation studies-Including all parameter
PDF
Traditional Healing Practices: A Model for Integrative Care in Diabetes Mana...
PDF
No dilute core produced in simulations of giant impacts on to Jupiter
PDF
CuO Nps photocatalysts 15156456551564161
PPTX
EPILEPSY UPDATE in kkm malaysia today new
PDF
From Molecular Interactions to Solubility in Deep Eutectic Solvents: Explorin...
PDF
CHEM - GOC general organic chemistry.ppt
PPTX
Chapter 1 Introductory course Biology Camp
PDF
Integrative Oncology: Merging Conventional and Alternative Approaches (www.k...
PPTX
Cutaneous tuberculosis Dermatology
Chapter 3 - Human Development Poweroint presentation
The Physiology Of The Red Blood Cells pdf
Cells and Organs of the Immune System (Unit-2) - Majesh Sir.pptx
Sujay Rao Mandavilli IJISRT25AUG764 context based approaches to population ma...
Exploring PCR Techniques and Applications
Sustainable Biology- Scopes, Principles of sustainiability, Sustainable Resou...
BPharm_Hospital_Organization_Complete_PPT.pptx
Sumer, Akkad and the mythology of the Toradja Sa'dan.pdf
Spectroscopic Techniques for M Tech Civil Engineerin .pptx
Chapter 6 Introductory course Biology Camp
Preformulation.pptx Preformulation studies-Including all parameter
Traditional Healing Practices: A Model for Integrative Care in Diabetes Mana...
No dilute core produced in simulations of giant impacts on to Jupiter
CuO Nps photocatalysts 15156456551564161
EPILEPSY UPDATE in kkm malaysia today new
From Molecular Interactions to Solubility in Deep Eutectic Solvents: Explorin...
CHEM - GOC general organic chemistry.ppt
Chapter 1 Introductory course Biology Camp
Integrative Oncology: Merging Conventional and Alternative Approaches (www.k...
Cutaneous tuberculosis Dermatology

Image Segmentation: Approaches and Challenges

  • 1. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Semantic Segmentation
  • 2. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Problem statement: Pixel-level classification task
  • 3. Applications: Brain tissue segmentation U-Net: Convolutional Networks for Biomedical Image Segmentation Olaf Ronneberger, Philipp Fischer, Thomas Brox, 2015
  • 5. Applications: Self-driving cars source: https://www.youtube.com/watch?v=ATlcEDSPWXY
  • 6. How does it work? Source: Fully Convolutional Networks for Semantic Segmentation, Long et al. 2015 Deep Neural Network
  • 7. Input RGB or Grayscale Images Unsigned integer [0,255]
  • 8. N classes Output: predict one “heat-map” per class Softmax across class axis
  • 9. How does it work? Trained to minimize the softmax cross entropy loss for each pixel i,j predictions among the N different classes: 𝑙𝑜𝑠𝑠 = − 𝑖,𝑗 𝐻,𝑊 𝑐 𝑁 𝑦𝑖,𝑗,𝑐 ∗ log(𝑝𝑖,𝑗,𝑐) 𝑙𝑜𝑠𝑠 = − 𝑖,𝑗 𝐻,𝑊 log(𝑝𝑖,𝑗,𝑐=𝑦 𝑖,𝑗 )
  • 10. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Main challenge: capturing multi-scale context cow?
  • 11. Source: Deep LabV3 Rethinking Atrous Convolution for Semantic Image Segmentation, Chen et al. 2017 Strategies for capturing multi-scale context
  • 12. Architectures: HourGlass Architecture of the full network. The convolution network is based on the VGG16 architecture. The deconvolution network uses unpooling and deconvolution layers. Source: H. Noh et al. (2015)
  • 13. Architectures: U-Net U-Net: Convolutional Networks for Biomedical Image Segmentation Olaf Ronneberger, Philipp Fischer, Thomas Brox, 2015
  • 14. Architectures: DeepLab V3 Source: Rethinking Atrous Convolution for Semantic Image Segmentation Liang-Chieh Chen, George Papandreou, Florian Schroff, Hartwig Adam, 2017
  • 15. Architectures: DeepLab V3+ Source: Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation, Liang-Chieh Chen, Yukun Zhu, George Papandreou, Florian Schroff, and Hartwig Adam, 2018
  • 16. Architectures: and more See this medium blog post: Review of deep learning algorithm for semantic segmentation Fully Convolutional Network ParseNet Feature Pyramid Network Pyramid Scene Parsing network (PSPNet) Path Aggregation Network (PANet) Context Encoding Network (EncNet)
  • 17. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark
  • 18. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Conclusion The key challenge in semantic segmentation is to efficiently mix local and global context for pixel-wise predictions
  • 19. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Trademark Thank you! Go Build! https://gluon-cv.mxnet.io/build/examples_segmentation/index.html

Editor's Notes

  • #2: First call deck for a high level introduction to Apache MXNet.
  • #3: Pixel house landscape by 8bitnoob