SlideShare a Scribd company logo
Generative Adversarial
Networks
Palacode Narayana Iyer Anantharaman
29 Oct 2018
References
• https://github.com/lukedeo/keras-acgan/blob/master/acgan-analysis.ipynb
• https://github.com/keras-team/keras/blob/master/examples/mnist_acgan.py
• https://skymind.ai/wiki/generative-adversarial-network-gan
• https://junyanz.github.io/CycleGAN/
• Self Attention Generative Adversarial Networks: Zhang et al
Why GAN?
• GANs can learn to mimic any distribution and generate data
• The data may be images, speech or music
• The outputs from GANs are found to be quite realistic and impressive
• Thus, GANs have a number of applications: From being a feature in products like
Photoshop to generating synthetic datasets for image augmentation
GAN Architecture
GAN Architecture
GAN Workflow
Generator
• Generates synthetic images given the input noise z
• G is differentiable
• Typically a Gaussian distribution
Training
• Train on 2 mini batches simultaneously
• Training samples
• Generated samples
• Cost
Cost Function
Different Variants of GAN
Ref: https://github.com/lukedeo/keras-acgan/blob/master/acgan-analysis.ipynb
Cycle GAN (2017)
• Original Paper: “Unpaired Image-to-Image Translation using Cycle-Consistent
Adversarial Networks”, Zhu et al
Image to Image Translation
• Image to image translation is aimed at finding a mapping
between an input image (X) and its corresponding output
image (Y), where the pair X, Y are provided in the dataset
• This assumes that we are provided with such a labelled
dataset with pairings
• CycleGAN attempts to find a mapping between images from
source and target domains in the absence of paired
examples
Learn G: X → Y such that the distribution of images from G(X) is
indistinguishable from the distribution Y using an adversarial
loss.
Couple this with an inverse mapping F: Y → X and enforce a
cycle consistency loss to enforce F(G(X)) ≈ X
CycleGAN Approach
Cycle GAN: Objective Function
• Two discriminators: Dx and Dy where Dx aims to distinguish between images {x}
and translated images {F(y)}. In the same way Dy aims to discriminate between {y}
and {G(x)}
• The objective function has 2 parts representing the losses:
• adversarial losses for matching the distribution of generated images to the data distribution
in the target domain
• Cycle consistency losses that prevent the learned mappings G and F from contradicting each
other
Losses
Exercises
• Go through the original paper and answer the following:
• How is the model evaluated? What are the metrics?
• What are the main applications discussed in the paper?
• What are the limitations and future work?
SAGAN (2018) Zhang et al Abstract
• GANs often use a CNN as a generator
• CNNs capture short range dependencies very well (local receptive fields) but not
effective to capture long distance correlations
• Self Attention Generative Adversarial Networks (SAGAN) is aimed at generating
images that take in to account both short and long distance dependencies in the
source images
SAGAN
SAGAN Architecture
Generative Adversarial Networks : Basic architecture and variants

More Related Content

What's hot (20)

PPTX
Core Java
Priyanka Pradhan
 
PPTX
Calibrating Lighting and Materials in Far Cry 3
stevemcauley
 
PPTX
Snake PY Game.pptx
Lovely professinal university
 
PPTX
Deep Semi-supervised Learning methods
Princy Joy
 
PPTX
Sigmoid function machine learning made simple
Devansh16
 
DOC
Snake project report
Manju Rajput
 
PDF
Files in java
Muthukumaran Subramanian
 
PDF
A Basic Django Introduction
Ganga Ram
 
PDF
Django Introduction & Tutorial
之宇 趙
 
PDF
C Programming Project
Vijayananda Mohire
 
PPT
First Review(Ppt)
smjagadish
 
PDF
GDSC IIT Goa Info Session Slides
SEJALGUPTA44
 
PPTX
Final Year Project Presentation
Syed Absar
 
DOCX
INTERNSHIP REPORT
Mufaddal Vasi
 
PDF
Neural Network from Scratch in Python
Dhirajk7
 
PPTX
oop.pptx
KabitaParajuli3
 
PPTX
String in java
Ideal Eyes Business College
 
PPTX
Image to image translation with Pix2Pix GAN
S.Shayan Daneshvar
 
PDF
Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)
Universitat Politècnica de Catalunya
 
PPTX
Bsc. Final Project Presentation
Mohammad Ali Khan
 
Core Java
Priyanka Pradhan
 
Calibrating Lighting and Materials in Far Cry 3
stevemcauley
 
Snake PY Game.pptx
Lovely professinal university
 
Deep Semi-supervised Learning methods
Princy Joy
 
Sigmoid function machine learning made simple
Devansh16
 
Snake project report
Manju Rajput
 
A Basic Django Introduction
Ganga Ram
 
Django Introduction & Tutorial
之宇 趙
 
C Programming Project
Vijayananda Mohire
 
First Review(Ppt)
smjagadish
 
GDSC IIT Goa Info Session Slides
SEJALGUPTA44
 
Final Year Project Presentation
Syed Absar
 
INTERNSHIP REPORT
Mufaddal Vasi
 
Neural Network from Scratch in Python
Dhirajk7
 
oop.pptx
KabitaParajuli3
 
Image to image translation with Pix2Pix GAN
S.Shayan Daneshvar
 
Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)
Universitat Politècnica de Catalunya
 
Bsc. Final Project Presentation
Mohammad Ali Khan
 

Similar to Generative Adversarial Networks : Basic architecture and variants (20)

PDF
Unpaired Image Translations Using GANs: A Review
IRJET Journal
 
PDF
Volodymyr Lyubinets “Generative models for images”
Lviv Startup Club
 
PDF
Jakub Langr (University of Oxford) - Overview of Generative Adversarial Netwo...
Codiax
 
PDF
Finding connections among images using CycleGAN
NAVER Engineering
 
PPTX
Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...
Catalina Arango
 
PPTX
Generative adversarial networks in computer vision
ShreeGowriRadhakrish
 
PPTX
Image colorization
Yash Saraf
 
PPTX
Image colorization
Yash Saraf
 
PDF
Generative adversarial network_Ayadi_Alaeddine
Deep Learning Italia
 
PDF
Top Blockchain Development Services | Build Your Blockchain Today
Qubited
 
PPTX
Generative Adversarial Networks (GAN)
Manohar Mukku
 
PDF
Google Dev Group Yangon (2020) AI Talk (Creative AI in Action)
WaiYanNyeinNaing
 
PPTX
Generative Adversarial Network (GAN) for Image Synthesis
Riwaz Mahat
 
PDF
lec19.pdf
ssuser0d87ff
 
PDF
“Making GANs Much Better, or If at First You Don’t Succeed, Try, Try a GAN,” ...
Edge AI and Vision Alliance
 
PPTX
Face-GAN project report.pptx
AndleebFatima16
 
PPTX
Face-GAN project report
AndleebFatima16
 
PDF
Generative Adversarial Networksの基礎と応用について
So Hasegawa
 
PDF
Generative Adversarial Networks and Their Medical Imaging Applications
Kyuhwan Jung
 
PDF
An Introduction To Generative Adversarial Networks
Bluebash
 
Unpaired Image Translations Using GANs: A Review
IRJET Journal
 
Volodymyr Lyubinets “Generative models for images”
Lviv Startup Club
 
Jakub Langr (University of Oxford) - Overview of Generative Adversarial Netwo...
Codiax
 
Finding connections among images using CycleGAN
NAVER Engineering
 
Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...
Catalina Arango
 
Generative adversarial networks in computer vision
ShreeGowriRadhakrish
 
Image colorization
Yash Saraf
 
Image colorization
Yash Saraf
 
Generative adversarial network_Ayadi_Alaeddine
Deep Learning Italia
 
Top Blockchain Development Services | Build Your Blockchain Today
Qubited
 
Generative Adversarial Networks (GAN)
Manohar Mukku
 
Google Dev Group Yangon (2020) AI Talk (Creative AI in Action)
WaiYanNyeinNaing
 
Generative Adversarial Network (GAN) for Image Synthesis
Riwaz Mahat
 
lec19.pdf
ssuser0d87ff
 
“Making GANs Much Better, or If at First You Don’t Succeed, Try, Try a GAN,” ...
Edge AI and Vision Alliance
 
Face-GAN project report.pptx
AndleebFatima16
 
Face-GAN project report
AndleebFatima16
 
Generative Adversarial Networksの基礎と応用について
So Hasegawa
 
Generative Adversarial Networks and Their Medical Imaging Applications
Kyuhwan Jung
 
An Introduction To Generative Adversarial Networks
Bluebash
 
Ad

More from ananth (20)

PDF
Convolutional Neural Networks : Popular Architectures
ananth
 
PDF
Foundations: Artificial Neural Networks
ananth
 
PDF
Overview of Convolutional Neural Networks
ananth
 
PDF
Artificial Intelligence Course: Linear models
ananth
 
PDF
An Overview of Naïve Bayes Classifier
ananth
 
PDF
Mathematical Background for Artificial Intelligence
ananth
 
PDF
Search problems in Artificial Intelligence
ananth
 
PDF
Introduction to Artificial Intelligence
ananth
 
PDF
Word representation: SVD, LSA, Word2Vec
ananth
 
PDF
Deep Learning For Speech Recognition
ananth
 
PDF
Overview of TensorFlow For Natural Language Processing
ananth
 
PDF
Convolutional Neural Networks: Part 1
ananth
 
PDF
Machine Learning Lecture 3 Decision Trees
ananth
 
PDF
Machine Learning Lecture 2 Basics
ananth
 
PDF
Introduction To Applied Machine Learning
ananth
 
PDF
Recurrent Neural Networks, LSTM and GRU
ananth
 
PDF
MaxEnt (Loglinear) Models - Overview
ananth
 
PDF
An overview of Hidden Markov Models (HMM)
ananth
 
PDF
L06 stemmer and edit distance
ananth
 
PDF
L05 language model_part2
ananth
 
Convolutional Neural Networks : Popular Architectures
ananth
 
Foundations: Artificial Neural Networks
ananth
 
Overview of Convolutional Neural Networks
ananth
 
Artificial Intelligence Course: Linear models
ananth
 
An Overview of Naïve Bayes Classifier
ananth
 
Mathematical Background for Artificial Intelligence
ananth
 
Search problems in Artificial Intelligence
ananth
 
Introduction to Artificial Intelligence
ananth
 
Word representation: SVD, LSA, Word2Vec
ananth
 
Deep Learning For Speech Recognition
ananth
 
Overview of TensorFlow For Natural Language Processing
ananth
 
Convolutional Neural Networks: Part 1
ananth
 
Machine Learning Lecture 3 Decision Trees
ananth
 
Machine Learning Lecture 2 Basics
ananth
 
Introduction To Applied Machine Learning
ananth
 
Recurrent Neural Networks, LSTM and GRU
ananth
 
MaxEnt (Loglinear) Models - Overview
ananth
 
An overview of Hidden Markov Models (HMM)
ananth
 
L06 stemmer and edit distance
ananth
 
L05 language model_part2
ananth
 
Ad

Recently uploaded (20)

PDF
How to Hire AI Developers_ Step-by-Step Guide in 2025.pdf
DianApps Technologies
 
PPTX
AEM User Group: India Chapter Kickoff Meeting
jennaf3
 
PDF
Why Businesses Are Switching to Open Source Alternatives to Crystal Reports.pdf
Varsha Nayak
 
PPTX
Empowering Asian Contributions: The Rise of Regional User Groups in Open Sour...
Shane Coughlan
 
PDF
4K Video Downloader Plus Pro Crack for MacOS New Download 2025
bashirkhan333g
 
PDF
Open Chain Q2 Steering Committee Meeting - 2025-06-25
Shane Coughlan
 
PPTX
Foundations of Marketo Engage - Powering Campaigns with Marketo Personalization
bbedford2
 
PDF
MiniTool Partition Wizard 12.8 Crack License Key LATEST
hashhshs786
 
PDF
Generic or Specific? Making sensible software design decisions
Bert Jan Schrijver
 
PPTX
Homogeneity of Variance Test Options IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
PPTX
Change Common Properties in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
PPTX
Hardware(Central Processing Unit ) CU and ALU
RizwanaKalsoom2
 
PPTX
Agentic Automation: Build & Deploy Your First UiPath Agent
klpathrudu
 
PPTX
Help for Correlations in IBM SPSS Statistics.pptx
Version 1 Analytics
 
PDF
Alexander Marshalov - How to use AI Assistants with your Monitoring system Q2...
VictoriaMetrics
 
PDF
Unlock Efficiency with Insurance Policy Administration Systems
Insurance Tech Services
 
PPTX
Tally_Basic_Operations_Presentation.pptx
AditiBansal54083
 
PPTX
Finding Your License Details in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
PDF
AI + DevOps = Smart Automation with devseccops.ai.pdf
Devseccops.ai
 
PPTX
OpenChain @ OSS NA - In From the Cold: Open Source as Part of Mainstream Soft...
Shane Coughlan
 
How to Hire AI Developers_ Step-by-Step Guide in 2025.pdf
DianApps Technologies
 
AEM User Group: India Chapter Kickoff Meeting
jennaf3
 
Why Businesses Are Switching to Open Source Alternatives to Crystal Reports.pdf
Varsha Nayak
 
Empowering Asian Contributions: The Rise of Regional User Groups in Open Sour...
Shane Coughlan
 
4K Video Downloader Plus Pro Crack for MacOS New Download 2025
bashirkhan333g
 
Open Chain Q2 Steering Committee Meeting - 2025-06-25
Shane Coughlan
 
Foundations of Marketo Engage - Powering Campaigns with Marketo Personalization
bbedford2
 
MiniTool Partition Wizard 12.8 Crack License Key LATEST
hashhshs786
 
Generic or Specific? Making sensible software design decisions
Bert Jan Schrijver
 
Homogeneity of Variance Test Options IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
Change Common Properties in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
Hardware(Central Processing Unit ) CU and ALU
RizwanaKalsoom2
 
Agentic Automation: Build & Deploy Your First UiPath Agent
klpathrudu
 
Help for Correlations in IBM SPSS Statistics.pptx
Version 1 Analytics
 
Alexander Marshalov - How to use AI Assistants with your Monitoring system Q2...
VictoriaMetrics
 
Unlock Efficiency with Insurance Policy Administration Systems
Insurance Tech Services
 
Tally_Basic_Operations_Presentation.pptx
AditiBansal54083
 
Finding Your License Details in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
AI + DevOps = Smart Automation with devseccops.ai.pdf
Devseccops.ai
 
OpenChain @ OSS NA - In From the Cold: Open Source as Part of Mainstream Soft...
Shane Coughlan
 

Generative Adversarial Networks : Basic architecture and variants

  • 2. References • https://github.com/lukedeo/keras-acgan/blob/master/acgan-analysis.ipynb • https://github.com/keras-team/keras/blob/master/examples/mnist_acgan.py • https://skymind.ai/wiki/generative-adversarial-network-gan • https://junyanz.github.io/CycleGAN/ • Self Attention Generative Adversarial Networks: Zhang et al
  • 3. Why GAN? • GANs can learn to mimic any distribution and generate data • The data may be images, speech or music • The outputs from GANs are found to be quite realistic and impressive • Thus, GANs have a number of applications: From being a feature in products like Photoshop to generating synthetic datasets for image augmentation
  • 7. Generator • Generates synthetic images given the input noise z • G is differentiable • Typically a Gaussian distribution
  • 8. Training • Train on 2 mini batches simultaneously • Training samples • Generated samples • Cost
  • 10. Different Variants of GAN Ref: https://github.com/lukedeo/keras-acgan/blob/master/acgan-analysis.ipynb
  • 11. Cycle GAN (2017) • Original Paper: “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”, Zhu et al
  • 12. Image to Image Translation • Image to image translation is aimed at finding a mapping between an input image (X) and its corresponding output image (Y), where the pair X, Y are provided in the dataset • This assumes that we are provided with such a labelled dataset with pairings • CycleGAN attempts to find a mapping between images from source and target domains in the absence of paired examples Learn G: X → Y such that the distribution of images from G(X) is indistinguishable from the distribution Y using an adversarial loss. Couple this with an inverse mapping F: Y → X and enforce a cycle consistency loss to enforce F(G(X)) ≈ X
  • 14. Cycle GAN: Objective Function • Two discriminators: Dx and Dy where Dx aims to distinguish between images {x} and translated images {F(y)}. In the same way Dy aims to discriminate between {y} and {G(x)} • The objective function has 2 parts representing the losses: • adversarial losses for matching the distribution of generated images to the data distribution in the target domain • Cycle consistency losses that prevent the learned mappings G and F from contradicting each other
  • 16. Exercises • Go through the original paper and answer the following: • How is the model evaluated? What are the metrics? • What are the main applications discussed in the paper? • What are the limitations and future work?
  • 17. SAGAN (2018) Zhang et al Abstract • GANs often use a CNN as a generator • CNNs capture short range dependencies very well (local receptive fields) but not effective to capture long distance correlations • Self Attention Generative Adversarial Networks (SAGAN) is aimed at generating images that take in to account both short and long distance dependencies in the source images
  • 18. SAGAN