Deep Learning for Computer Vision: Backward Propagation (UPC 2016)

2 likes1,521 views

The document discusses backpropagation and optimization techniques in neural networks, emphasizing the use of supervised and unsupervised learning methods to train models effectively. It details the backpropagation algorithm's functionality, which involves minimizing a loss function using stochastic gradient descent and its variants. Additionally, it illustrates how to compute gradients iteratively through the network layers to improve parameter fitting.

Engineering

Day 1 Lecture 4
Backward Propagation
Elisa Sayrol
[course site]

Learning
Purely Supervised
Typically Backpropagation + Stochastic Gradient Descent (SGD)
Good when there are lots of labeled data
Layer-wise Unsupervised + Supervised classifier
Train each layer in sequence, using regularized auto-encoders or Restricted Boltzmann
Machines (RBM)
Hold the feature extractor, on top train linear classifier on features
Good when labeled data is scarce but there are lots of unlabeled data
Layer-wise Unsupervised + Supervised Backprop
Train each layer in sequence
Backprop through the whole system
Good when learning problem is very difficult
Slide Credit: Lecun 2

From Lecture 3
L Hidden Layers
Hidden pre-activation (k>0)
Hidden activation (k=1,…L)
Output activation (k=L+1)
Figure Credit: Hugo Laroche NN course 3

Backpropagation algorithm
The output of the Network gives class scores that depens on the input
and the parameters
• Define a loss function that quantifies our unhappiness with the
scores across the training data.
• Come up with a way of efficiently finding the parameters that
minimize the loss function (optimization)
4

Probability Class given an input
(softmax)
Minimize the loss (plus some
regularization term) w.r.t. Parameters
over the whole training set.
Loss function; e.g., negative log-
likelihood (good for classification)
h2
h3
a3
a4 h4
Loss
Hidden Hidden Output
W2
W3
x a2
Input
W1
Regularization term (L2 Norm)
aka as weight decay
Figure Credit: Kevin McGuiness
Forward Pass
5

Backpropagation algorithm
• We need a way to fit the model to data: find parameters (W(k)
, b(k)
) of the
network that (locally) minimize the loss function.
• We can use stochastic gradient descent. Or better yet, mini-batch
stochastic gradient descent.
• To do this, we need to find the gradient of the loss function with respect to
all the parameters of the model (W(k)
, b(k)
)
• These can be found using the chain rule of differentiation.
• The calculations reveal that the gradient wrt. the parameters in layer k only
depends on the error from the above layer and the output from the layer
below.
• This means that the gradients for each layer can be computed iteratively,
starting at the last layer and propagating the error back through the network.
This is known as the backpropagation algorithm.
Slide Credit: Kevin McGuiness 6

1. Find the error in the top layer: 3. Backpropagate error to layer below2. Compute weight updates
h2
h3
a3
a4 h4
Loss
Hidden Hidden Output
W2
W3
x a2
Input
W1
L
Figure Credit: Kevin McGuiness
Backward Pass
7

Optimization
Stochastic Gradient Descent
Stochastic Gradient Descent with momentum
Stochastic Gradient Descent with L2 regularization
http://cs231n.github.io/optimization-1/
http://cs231n.github.io/optimization-2/
: learning rate
: weight decay
Recommended lectures:
8

More Related Content

What's hot (20)

PDF

Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

PDF

Joint unsupervised learning of deep representations and image clustersUniversitat Politècnica de Catalunya

PDF

D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

PDF

Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...Universitat Politècnica de Catalunya

PDF

Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

PDF

Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)Universitat Politècnica de Catalunya

PDF

Generative Models and Adversarial Training (D2L3 Insight@DCU Machine Learning...Universitat Politècnica de Catalunya

PDF

Deep Learning for Computer Vision: Transfer Learning and Domain Adaptation (U...Universitat Politècnica de Catalunya

PDF

Transfer Learning (D2L4 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

PDF

Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

PDF

Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018Universitat Politècnica de Catalunya

PPTX

Visual Object Analysis using Regions and Local FeaturesUniversitat Politècnica de Catalunya

PDF

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

PDF

Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...Universitat Politècnica de Catalunya

PDF

DeconvNet, DecoupledNet, TransferNet in Image SegmentationNamHyuk Ahn

PDF

Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018Universitat Politècnica de Catalunya

PPTX

Object detection - RCNNs vs RetinanetRishabh Indoria

PDF

Unsupervised Learning (D2L6 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

PPTX

Image Classification using deep learning Asma-AH

PDF

Image Classification with Deep Learning | DevFest + GDay, George Town, Mala...Virot "Ta" Chiraphadhanakul

Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Joint unsupervised learning of deep representations and image clustersUniversitat Politècnica de Catalunya

D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...Universitat Politècnica de Catalunya

Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)Universitat Politècnica de Catalunya

Generative Models and Adversarial Training (D2L3 Insight@DCU Machine Learning...Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Transfer Learning and Domain Adaptation (U...Universitat Politècnica de Catalunya

Transfer Learning (D2L4 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018Universitat Politècnica de Catalunya

Visual Object Analysis using Regions and Local FeaturesUniversitat Politècnica de Catalunya

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...Universitat Politècnica de Catalunya

DeconvNet, DecoupledNet, TransferNet in Image SegmentationNamHyuk Ahn

Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018Universitat Politècnica de Catalunya

Object detection - RCNNs vs RetinanetRishabh Indoria

Unsupervised Learning (D2L6 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

Image Classification using deep learning Asma-AH

Image Classification with Deep Learning | DevFest + GDay, George Town, Mala...Virot "Ta" Chiraphadhanakul

Similar to Deep Learning for Computer Vision: Backward Propagation (UPC 2016) (20)

PDF

Backpropagation - Elisa Sayrol - UPC Barcelona 2018Universitat Politècnica de Catalunya

PPTX

22PCOAM16_UNIT 2_ Session 12 Deriving Back-Propagation .pptxGuru Nanak Technical Institutions

PDF

Backpropagation (DLAI D3L1 2017 UPC Deep Learning for Artificial Intelligence)Universitat Politècnica de Catalunya

PPTX

Classification_by_back_&propagation.pptxSadiaSaleem301

PPTX

Back Propagation-11-11-2qwasdddddd024.pptxvinodkumarthatipamul

PDF

Classification by back propagation, multi layered feed forward neural network...bihira aggrey

PPTX

Backpropagation algonoT yeT woRkiNg !! iM stiLl stUdYinG !!

PDF

NPTEL_backprobagation_Lecture4_DL(1).pdfnaveenraghavendran10

PPTX

Training Neural Networks.pptxksghuge

PPT

Back propagation DrBaljitSinghKhehra

PPTX

Deep neural networks & computational graphsRevanth Kumar

PPTX

PRML Chapter 5Sunwoo Kim

PPTX

ML_ Unit 2_Part_BSrimatre K

PPTX

back propagation1_presenation_lab 6.pptxsomeyamohsen2

PPT

this is a Ai topic neural network ML_Lecture_4.pptry54321288

PPTX

DeepLearningLecture.pptxssuserf07225

PPT

nural network ER. Abhishek k. upadhyayabhishek upadhyay

PPTX

Deep learning crash courseVishwas N

PPTX

Maxhine learning rec02 - MLP and BP.pptxToyba2

PPTX

This is about session rec02 - MLP and BP.pptxToyba2

Backpropagation - Elisa Sayrol - UPC Barcelona 2018Universitat Politècnica de Catalunya

22PCOAM16_UNIT 2_ Session 12 Deriving Back-Propagation .pptxGuru Nanak Technical Institutions

Backpropagation (DLAI D3L1 2017 UPC Deep Learning for Artificial Intelligence)Universitat Politècnica de Catalunya

Classification_by_back_&propagation.pptxSadiaSaleem301

Back Propagation-11-11-2qwasdddddd024.pptxvinodkumarthatipamul

Classification by back propagation, multi layered feed forward neural network...bihira aggrey

Backpropagation algonoT yeT woRkiNg !! iM stiLl stUdYinG !!

NPTEL_backprobagation_Lecture4_DL(1).pdfnaveenraghavendran10

Training Neural Networks.pptxksghuge

Back propagation DrBaljitSinghKhehra

Deep neural networks & computational graphsRevanth Kumar

PRML Chapter 5Sunwoo Kim

ML_ Unit 2_Part_BSrimatre K

back propagation1_presenation_lab 6.pptxsomeyamohsen2

this is a Ai topic neural network ML_Lecture_4.pptry54321288

DeepLearningLecture.pptxssuserf07225

nural network ER. Abhishek k. upadhyayabhishek upadhyay

Deep learning crash courseVishwas N

Maxhine learning rec02 - MLP and BP.pptxToyba2

This is about session rec02 - MLP and BP.pptxToyba2

More from Universitat Politècnica de Catalunya (20)

PDF

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Universitat Politècnica de Catalunya

PDF

Deep Generative Learning for AllUniversitat Politècnica de Catalunya

PDF

The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...Universitat Politècnica de Catalunya

PDF

Towards Sign Language Translation & Production | Xavier Giro-i-NietoUniversitat Politècnica de Catalunya

PDF

The Transformer - Xavier Giró - UPC Barcelona 2021Universitat Politècnica de Catalunya

PDF

Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...Universitat Politècnica de Catalunya

PDF

Open challenges in sign language translation and productionUniversitat Politècnica de Catalunya

PPTX

Generation of Synthetic Referring Expressions for Object Segmentation in VideosUniversitat Politècnica de Catalunya

PPTX

Discovery and Learning of Navigation Goals from Pixels in MinecraftUniversitat Politècnica de Catalunya

PDF

Learn2Sign : Sign language recognition and translation using human keypoint e...Universitat Politècnica de Catalunya

PDF

Intepretability / Explainable AI for Deep Neural NetworksUniversitat Politècnica de Catalunya

PDF

Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya

PDF

Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...Universitat Politècnica de Catalunya

PDF

Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya

PDF

Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...Universitat Politècnica de Catalunya

PDF

Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Universitat Politècnica de Catalunya

PDF

Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)Universitat Politècnica de Catalunya

PDF

Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Universitat Politècnica de Catalunya

PDF

Curriculum Learning for Recurrent Video Object SegmentationUniversitat Politècnica de Catalunya

PDF

Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020Universitat Politècnica de Catalunya

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Universitat Politècnica de Catalunya

Deep Generative Learning for AllUniversitat Politècnica de Catalunya

The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...Universitat Politècnica de Catalunya

Towards Sign Language Translation & Production | Xavier Giro-i-NietoUniversitat Politècnica de Catalunya

The Transformer - Xavier Giró - UPC Barcelona 2021Universitat Politècnica de Catalunya

Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...Universitat Politècnica de Catalunya

Open challenges in sign language translation and productionUniversitat Politècnica de Catalunya

Generation of Synthetic Referring Expressions for Object Segmentation in VideosUniversitat Politècnica de Catalunya

Discovery and Learning of Navigation Goals from Pixels in MinecraftUniversitat Politècnica de Catalunya

Learn2Sign : Sign language recognition and translation using human keypoint e...Universitat Politècnica de Catalunya

Intepretability / Explainable AI for Deep Neural NetworksUniversitat Politècnica de Catalunya

Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya

Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...Universitat Politècnica de Catalunya

Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya

Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...Universitat Politècnica de Catalunya

Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Universitat Politècnica de Catalunya

Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)Universitat Politècnica de Catalunya

Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Universitat Politècnica de Catalunya

Curriculum Learning for Recurrent Video Object SegmentationUniversitat Politècnica de Catalunya

Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020Universitat Politècnica de Catalunya

Recently uploaded (20)

PDF

Biomechanics of Gait: Engineering Solutions for Rehabilitation (www.kiu.ac.ug)publication11

PPTX

Server Side Web Development Unit 1 of Nodejs.pptxsneha852132

PDF

Reasons for the succes of MENARD PRESSUREMETER.pdfmajdiamz

PPTX

Day2 B2 Best.pptxhelenjenefa1

PDF

Water Design_Manual_2005. KENYA FOR WASTER SUPPLY AND SEWERAGEDancanNgutuku

PPT

PPT2_Metal formingMECHANICALENGINEEIRNG .pptPraveen Kumar

PPTX

artificial intelligence applications in GeomaticsNawrasShatnawi1

PPTX

Element 7. CHEMICAL AND BIOLOGICAL AGENT.pptxmerrandomohandas

PDF

Zilliz Cloud Demo for performance and scaleZilliz

DOC

MRRS Strength and Durability of ConcreteCivilMythili

PPTX

UNIT DAA PPT cover all topics 2021 regulationarchu26

PDF

Book.pdf01_Intro.ppt algorithm for preperation stu usedarchu26

PPTX

美国电子版毕业证南卡罗莱纳大学上州分校水印成绩单USC学费发票定做学位证书编号怎么查Taqyea

PDF

Ethics and Trustworthy AI in Healthcare – Governing Sensitive Data, Profiling...AlqualsaDIResearchGr

PPTX

GitOps_Without_K8s_Training_detailed git repositoryDanialHabibi2

PDF

GTU Civil Engineering All Semester Syllabus.pdfVimal Bhojani

PPTX

Green Building & Energy Conservation pptSagar Sarangi

DOCX

CS-802 (A) BDH Lab manual IPS Academy Indorethegodhimself05

PPTX

Snet+Pro+Service+Software_SNET+Pro+2+Instructions.pptxjenilsatikuvar1

PPTX

Hashing Introduction , hash functions and techniquessailajam21

Biomechanics of Gait: Engineering Solutions for Rehabilitation (www.kiu.ac.ug)publication11

Server Side Web Development Unit 1 of Nodejs.pptxsneha852132

Reasons for the succes of MENARD PRESSUREMETER.pdfmajdiamz

Day2 B2 Best.pptxhelenjenefa1

Water Design_Manual_2005. KENYA FOR WASTER SUPPLY AND SEWERAGEDancanNgutuku

PPT2_Metal formingMECHANICALENGINEEIRNG .pptPraveen Kumar

artificial intelligence applications in GeomaticsNawrasShatnawi1

Element 7. CHEMICAL AND BIOLOGICAL AGENT.pptxmerrandomohandas

Zilliz Cloud Demo for performance and scaleZilliz

MRRS Strength and Durability of ConcreteCivilMythili

UNIT DAA PPT cover all topics 2021 regulationarchu26

Book.pdf01_Intro.ppt algorithm for preperation stu usedarchu26

美国电子版毕业证南卡罗莱纳大学上州分校水印成绩单USC学费发票定做学位证书编号怎么查Taqyea

Ethics and Trustworthy AI in Healthcare – Governing Sensitive Data, Profiling...AlqualsaDIResearchGr

GitOps_Without_K8s_Training_detailed git repositoryDanialHabibi2

GTU Civil Engineering All Semester Syllabus.pdfVimal Bhojani

Green Building & Energy Conservation pptSagar Sarangi

CS-802 (A) BDH Lab manual IPS Academy Indorethegodhimself05

Snet+Pro+Service+Software_SNET+Pro+2+Instructions.pptxjenilsatikuvar1

Hashing Introduction , hash functions and techniquessailajam21

Deep Learning for Computer Vision: Backward Propagation (UPC 2016)

1. Day 1 Lecture 4 Backward Propagation Elisa Sayrol [course site]

2. Learning Purely Supervised Typically Backpropagation + Stochastic Gradient Descent (SGD) Good when there are lots of labeled data Layer-wise Unsupervised + Supervised classifier Train each layer in sequence, using regularized auto-encoders or Restricted Boltzmann Machines (RBM) Hold the feature extractor, on top train linear classifier on features Good when labeled data is scarce but there are lots of unlabeled data Layer-wise Unsupervised + Supervised Backprop Train each layer in sequence Backprop through the whole system Good when learning problem is very difficult Slide Credit: Lecun 2

3. From Lecture 3 L Hidden Layers Hidden pre-activation (k>0) Hidden activation (k=1,…L) Output activation (k=L+1) Figure Credit: Hugo Laroche NN course 3

4. Backpropagation algorithm The output of the Network gives class scores that depens on the input and the parameters • Define a loss function that quantifies our unhappiness with the scores across the training data. • Come up with a way of efficiently finding the parameters that minimize the loss function (optimization) 4

5. Probability Class given an input (softmax) Minimize the loss (plus some regularization term) w.r.t. Parameters over the whole training set. Loss function; e.g., negative log- likelihood (good for classification) h2 h3 a3 a4 h4 Loss Hidden Hidden Output W2 W3 x a2 Input W1 Regularization term (L2 Norm) aka as weight decay Figure Credit: Kevin McGuiness Forward Pass 5

6. Backpropagation algorithm • We need a way to fit the model to data: find parameters (W(k) , b(k) ) of the network that (locally) minimize the loss function. • We can use stochastic gradient descent. Or better yet, mini-batch stochastic gradient descent. • To do this, we need to find the gradient of the loss function with respect to all the parameters of the model (W(k) , b(k) ) • These can be found using the chain rule of differentiation. • The calculations reveal that the gradient wrt. the parameters in layer k only depends on the error from the above layer and the output from the layer below. • This means that the gradients for each layer can be computed iteratively, starting at the last layer and propagating the error back through the network. This is known as the backpropagation algorithm. Slide Credit: Kevin McGuiness 6

7. 1. Find the error in the top layer: 3. Backpropagate error to layer below2. Compute weight updates h2 h3 a3 a4 h4 Loss Hidden Hidden Output W2 W3 x a2 Input W1 L Figure Credit: Kevin McGuiness Backward Pass 7

8. Optimization Stochastic Gradient Descent Stochastic Gradient Descent with momentum Stochastic Gradient Descent with L2 regularization http://cs231n.github.io/optimization-1/ http://cs231n.github.io/optimization-2/ : learning rate : weight decay Recommended lectures: 8