Oles Petriv "Semantic image segmentation using word embeddings."

Download as PPTX, PDF

0 likes182 views

Semantic image segmentation uses word embeddings to provide context from images. Convolutional neural networks are used to extract visual features from images at different levels, from low level to higher semantic levels. Models like SegNet and Pyramid Pooling are used for segmentation. Word embeddings represent words as vectors in a continuous concept space, where cosine distance reflects semantic similarity. Image classes are mapped to word embeddings to provide semantic context during training, with the loss measuring cosine distance between predicted and true word vectors for each image region. This allows vocabulary-free semantic segmentation based on conceptual relationships between words and image regions.

Semantic image
segmentatation
using word embeddings

What is semanic image segmentation ?

Types of semantic segmentation problems

Getting context from image:

Getting context from image:

Getting context from image:

Our arsenal: Convolution

Our arsenal: Convolution

How do learned visual features look like ?
low level:

How do learned visual features look like ?
higher level:

How do learned visual features look like ?

Our arsenal: skip connections

Simple SegNet:

Context problem:

Pyramid Pooling:

Softmax loss:

Softmax loss:

Datasets: COCO

Datasets: COCO-stuff

Datasets: COCO-stuff class tree

Datasets: ADE20K

Why word embeddings ?

Word embeddings: context windows

Word embeddings: skip gramm vs CBoW

Word embeddings: concept space

Word embeddings: cosine distance

Word embeddings: cosine distance

Word embeddings: cosine distace distribution

Word embeddings: language graph

map classes to wordnet synsets

classical one-hot segmentation labels

3d tensor of word vector labels

Loss : per-tube cosine distance

Loss : per-tube cosine distance

result: vocabulary free semantic segmentation

Questions:

Ad

Recommended

PPTX

Ai big dataconference_semantic image segmentatation using word embeddings_ole...Olga Zinkevych

PDF

Zero shot learning through cross-modal transferRoelof Pieters

PPTX

A Higher-Level Visual Representation For Semantic Learning In ImageDatabasesismailelsayad

PDF

Yoav Goldberg: Word Embeddings What, How and WhitherMLReview

PPTX

Neural Text Embeddings for Information Retrieval (WSDM 2017)Bhaskar Mitra

PDF

BEV Semantic SegmentationYu Huang

PPTX

Iccv2009 recognition and learning object categories p2 c03 - objects and an...zukun

PDF

Andrey Kutuzov and Elizaveta Kuzmenko - WebVectors: Toolkit for Building Web...AIST

PDF

Cs231n 2017 lecture11 Detection and SegmentationYanbin Kong

PPTX

AaSeminar_Template.pptxManojGowdaKb

PDF

Spot the Dog: An overview of semantic retrieval of unannotated images in the ...Jonathon Hare

PDF

Deep Representation: Building a Semantic Image Search EngineC4Media

PDF

Video + Language 2019Goergen Institute for Data Science

PDF

Video + LanguageGoergen Institute for Data Science

PDF

Scene Description From Images To SentencesIRJET Journal

PDF

Video+Language: From Classification to DescriptionGoergen Institute for Data Science

PDF

Semantic Segmentation AIML ProjectHitesh

PPT

Cvpr2007 object category recognition p1 - bag of words modelszukun

PPTX

Word_Embeddings.pptxGowrySailaja

PPTX

Using Text Embeddings for Information RetrievalBhaskar Mitra

ODP

Make Embeddings Semantic Again!Heiko Paulheim

PDF

"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...Dataconomy Media

PDF

Big Data Palooza Talk: Aspects of Semantic ProcessingNa'im Tyson

PDF

Learning to Perceive the 3D WorldNAVER Engineering

PDF

Texts Classification with the usage of Neural Network based on the Word2vec’s...ijsc

PDF

Texts Classification with the usage of Neural Network based on the Word2vec’s...ijsc

PDF

TEXTS CLASSIFICATION WITH THE USAGE OF NEURAL NETWORK BASED ON THE WORD2VEC’S...ijsc

PPTX

Semantic segmentation with Convolutional Neural Network ApproachesUMBC

PPTX

Sergiy Lunyakin "Cloud BI with Azure Analysis Services"DataConf

PPTX

Sergiy Lunyakin "Azure SQL DWH: Tips and Tricks for developers"DataConf

More Related Content

Similar to Oles Petriv "Semantic image segmentation using word embeddings." (20)

PDF

Cs231n 2017 lecture11 Detection and SegmentationYanbin Kong

PPTX

AaSeminar_Template.pptxManojGowdaKb

PDF

Spot the Dog: An overview of semantic retrieval of unannotated images in the ...Jonathon Hare

PDF

Deep Representation: Building a Semantic Image Search EngineC4Media

PDF

Video + Language 2019Goergen Institute for Data Science

PDF

Video + LanguageGoergen Institute for Data Science

PDF

Scene Description From Images To SentencesIRJET Journal

PDF

Video+Language: From Classification to DescriptionGoergen Institute for Data Science

PDF

Semantic Segmentation AIML ProjectHitesh

PPT

Cvpr2007 object category recognition p1 - bag of words modelszukun

PPTX

Word_Embeddings.pptxGowrySailaja

PPTX

Using Text Embeddings for Information RetrievalBhaskar Mitra

ODP

Make Embeddings Semantic Again!Heiko Paulheim

PDF

"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...Dataconomy Media

PDF

Big Data Palooza Talk: Aspects of Semantic ProcessingNa'im Tyson

PDF

Learning to Perceive the 3D WorldNAVER Engineering

PDF

Texts Classification with the usage of Neural Network based on the Word2vec’s...ijsc

PDF

Texts Classification with the usage of Neural Network based on the Word2vec’s...ijsc

PDF

TEXTS CLASSIFICATION WITH THE USAGE OF NEURAL NETWORK BASED ON THE WORD2VEC’S...ijsc

PPTX

Semantic segmentation with Convolutional Neural Network ApproachesUMBC

Cs231n 2017 lecture11 Detection and SegmentationYanbin Kong

AaSeminar_Template.pptxManojGowdaKb

Spot the Dog: An overview of semantic retrieval of unannotated images in the ...Jonathon Hare

Deep Representation: Building a Semantic Image Search EngineC4Media

Video + Language 2019Goergen Institute for Data Science

Video + LanguageGoergen Institute for Data Science

Scene Description From Images To SentencesIRJET Journal

Video+Language: From Classification to DescriptionGoergen Institute for Data Science

Semantic Segmentation AIML ProjectHitesh

Cvpr2007 object category recognition p1 - bag of words modelszukun

Word_Embeddings.pptxGowrySailaja

Using Text Embeddings for Information RetrievalBhaskar Mitra

Make Embeddings Semantic Again!Heiko Paulheim

"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...Dataconomy Media

Big Data Palooza Talk: Aspects of Semantic ProcessingNa'im Tyson

Learning to Perceive the 3D WorldNAVER Engineering

Texts Classification with the usage of Neural Network based on the Word2vec’s...ijsc

Texts Classification with the usage of Neural Network based on the Word2vec’s...ijsc

TEXTS CLASSIFICATION WITH THE USAGE OF NEURAL NETWORK BASED ON THE WORD2VEC’S...ijsc

Semantic segmentation with Convolutional Neural Network ApproachesUMBC

More from DataConf (9)

PPTX

Sergiy Lunyakin "Cloud BI with Azure Analysis Services"DataConf

PPTX

Sergiy Lunyakin "Azure SQL DWH: Tips and Tricks for developers"DataConf

PPTX

Eugene Polonichko "Azure Data Lake: what is it? why is it? where is it?"DataConf

PDF

Taras Firman "How to build advanced prediction with adding external data."DataConf

PPTX

Juriy Zaletsky "Використання Encog для прогнозування коливання курсів валют"DataConf

PPTX

Anastasiya Kaminskaya "How to optimize Tabular model in PowerPivot or in Anal...DataConf

PPTX

Vitalii Bashun "First Spark application in one hour"DataConf

PPTX

Vitalii Bondarenko "Machine Learning on Fast Data"DataConf

PDF

Volodymyr Getmanskyi "Deep learning for satellite imagery colorization and di...DataConf

Sergiy Lunyakin "Cloud BI with Azure Analysis Services"DataConf

Sergiy Lunyakin "Azure SQL DWH: Tips and Tricks for developers"DataConf

Eugene Polonichko "Azure Data Lake: what is it? why is it? where is it?"DataConf

Taras Firman "How to build advanced prediction with adding external data."DataConf

Juriy Zaletsky "Використання Encog для прогнозування коливання курсів валют"DataConf

Anastasiya Kaminskaya "How to optimize Tabular model in PowerPivot or in Anal...DataConf

Vitalii Bashun "First Spark application in one hour"DataConf

Vitalii Bondarenko "Machine Learning on Fast Data"DataConf

Volodymyr Getmanskyi "Deep learning for satellite imagery colorization and di...DataConf

Ad

Recently uploaded (20)

PDF

community health nursing question paper 2.pdfPrince kumar

PPTX

SPINA BIFIDA: NURSING MANAGEMENT .pptxPRADEEP ABOTHU

PPTX

How to Create a PDF Report in Odoo 18 - Odoo SlidesCeline George

PPTX

How to Set Maximum Difference Odoo 18 POSCeline George

PDF

Stokey: A Jewish Village by Rachel KolskyHistory of Stoke Newington

PDF

SSHS-2025-PKLP_Quarter-1-Dr.-Kerby-Alvarez.pdfAishahSangcopan1

PPTX

A PPT on Alfred Lord Tennyson's Ulysses.Beena E S

PPTX

How to Convert an Opportunity into a Quotation in Odoo 18 CRMCeline George

PPT

Talk on Critical Theory, Part One, Philosophy of Social SciencesSoraj Hongladarom

PPTX

grade 5 lesson matatag ENGLISH 5_Q1_PPT_WEEK4.pptxSireQuinn

PPTX

STAFF DEVELOPMENT AND WELFARE: MANAGEMENTPRADEEP ABOTHU

PDF

ARAL-Orientation_Morning-Session_Day-11.pdfJoelVilloso1

PDF

Reconstruct, Restore, Reimagine: New Perspectives on Stoke Newington’s Histor...History of Stoke Newington

PDF

0725.WHITEPAPER-UNIQUEWAYSOFPROTOTYPINGANDUXNOW.pdfThomas GIRARD, MA, CDP

PDF

Biological Bilingual Glossary Hindi and English MediumWorld of Wisdom

PDF

The-Ever-Evolving-World-of-Science (1).pdf/7TH CLASS CURIOSITY /1ST CHAPTER/B...Sandeep Swamy

PPTX

Universal immunization Programme (UIP).pptxVishal Chanalia

PDF

Isharyanti-2025-Cross Language Communication in Indonesian LanguageNeny Isharyanti

PPTX

ASRB NET 2023 PREVIOUS YEAR QUESTION PAPER GENETICS AND PLANT BREEDING BY SAT...Krashi Coaching

PPTX

Cultivation practice of Litchi in Nepal.pptxUmeshTimilsina1

community health nursing question paper 2.pdfPrince kumar

SPINA BIFIDA: NURSING MANAGEMENT .pptxPRADEEP ABOTHU

How to Create a PDF Report in Odoo 18 - Odoo SlidesCeline George

How to Set Maximum Difference Odoo 18 POSCeline George

Stokey: A Jewish Village by Rachel KolskyHistory of Stoke Newington

SSHS-2025-PKLP_Quarter-1-Dr.-Kerby-Alvarez.pdfAishahSangcopan1

A PPT on Alfred Lord Tennyson's Ulysses.Beena E S

How to Convert an Opportunity into a Quotation in Odoo 18 CRMCeline George

Talk on Critical Theory, Part One, Philosophy of Social SciencesSoraj Hongladarom

grade 5 lesson matatag ENGLISH 5_Q1_PPT_WEEK4.pptxSireQuinn

STAFF DEVELOPMENT AND WELFARE: MANAGEMENTPRADEEP ABOTHU

ARAL-Orientation_Morning-Session_Day-11.pdfJoelVilloso1

Reconstruct, Restore, Reimagine: New Perspectives on Stoke Newington’s Histor...History of Stoke Newington

0725.WHITEPAPER-UNIQUEWAYSOFPROTOTYPINGANDUXNOW.pdfThomas GIRARD, MA, CDP

Biological Bilingual Glossary Hindi and English MediumWorld of Wisdom

The-Ever-Evolving-World-of-Science (1).pdf/7TH CLASS CURIOSITY /1ST CHAPTER/B...Sandeep Swamy

Universal immunization Programme (UIP).pptxVishal Chanalia

Isharyanti-2025-Cross Language Communication in Indonesian LanguageNeny Isharyanti

ASRB NET 2023 PREVIOUS YEAR QUESTION PAPER GENETICS AND PLANT BREEDING BY SAT...Krashi Coaching

Cultivation practice of Litchi in Nepal.pptxUmeshTimilsina1

Ad

Oles Petriv "Semantic image segmentation using word embeddings."

1. Semantic image segmentatation using word embeddings

2. What is semanic image segmentation ?

3. Types of semantic segmentation problems

4. Getting context from image:

5. Getting context from image:

6. Getting context from image:

7. Our arsenal: Convolution

8. Our arsenal: Convolution

9. How do learned visual features look like ? low level:

10. How do learned visual features look like ? higher level:

11. How do learned visual features look like ?

12. Our arsenal: skip connections

13. Simple SegNet:

14. Context problem:

15. Pyramid Pooling:

16. Softmax loss:

17. Softmax loss:

18. Datasets: COCO

19. Datasets: COCO-stuff

20. Datasets: COCO-stuff class tree

21. Datasets: ADE20K

22. Why word embeddings ?

23. Word embeddings: context windows

24. Word embeddings: skip gramm vs CBoW

25. Word embeddings: concept space

26. Word embeddings: cosine distance

27. Word embeddings: cosine distance

28. Word embeddings: cosine distace distribution

29. Word embeddings: language graph

30. map classes to wordnet synsets

31. classical one-hot segmentation labels

32. 3d tensor of word vector labels

33. Loss : per-tube cosine distance

34. Loss : per-tube cosine distance

35. result: vocabulary free semantic segmentation